Correia R., Mamede N., Baptista J., Eskenazi M.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2014

pp 262

-
269

Abstract:

This paper describes the supervised classification of four metadiscursive functions in English. Training data is collected using crowdsourcing to label a corpus of TED talks transcripts with occurrences of Introductions, Conclusions, Examples, and Emphasis. Using decision trees and lexical features, we report classification accuracy.