A dataset of the corpus files containing metata for 440 speakers of American English.

sdac_speaker_meta

Format

A data frame with 543 rows and 14 variables:

speaker_id

Unique speaker identification code

pin

Access code

target

...

sex

Sex of the speaker

birth_year

Year that the speaker was born

dialect_area

Region from the US where the speaker spent first 10 years

education

Highest educational level attained

ti

...

payment_type

Form of payment for participation

amt_pd

Payment amount for participation

con

...

remarks

Misc. comments

calls_deleted

...

speaker_partition

...

Source

https://catalog.ldc.upenn.edu/docs/LDC97S62/