A dataset containing the 15,475 utterances by 44 speakers of American English.
sbc
A data frame with 15,475 rows and 13 variables:
ID for each speaker
Name of each speaker
Gender of the speaker
Age of the speaker at recording
Dialect self-assessment for each speaker
State where each speaker was raised
State of residence for each speaker at recording
Highest educational degree obtained
Number of years in the educational setting
Occupation of the speaker at recording
Ethnicity self-assessment for each speaker
Annotated transcription of a speaker's utterance
Simplified transcription of a speaker's utterance
http://www.linguistics.ucsb.edu/research/santa-barbara-corpus