A dataset of the corpus files containing the 1,1150 conversations of 440 speakers of American English.
sdac_files
A data frame with 223,606 rows and 7 variables:
ID for each conversation document
DAMSL dialog act annotation labels
Label for each speaker in the conversation
Number of contiguous utterance turns for a given speaker
The cumulative number of utterances in the conversation
The actual dialog utterance
Unique speaker identification code