CSAIL Event Calendar: Previous Series

Summarizing Speech

Speaker: Gerald Penn , University of Toronto
Date: November 5 2007
Time: 3:00PM to 4:00PM
Location: 32-D463 Stata Center - Star Conference Room
Host: Regina Barzilay, MIT CSAIL

Contact: Marcia Davidson, 617-253-3049, marcia@csail.mit.edu
Relevant URL:

Speech is arguably the most basic, most natural form of communication that we engage in, so it should come as no surprise that there has been a consistent pressure to deliver spoken audio content on web pages that, in principle, can be searched through. Even once the search problem is solved, however, the low-bandwidth, non-visual, traditional delivery of spoken audio makes it much more difficult to browse through. This makes the automated summarization of speech particularly attractive: given a number N, prepare a summary of a spoken "document" that contains the most important or salient content that is N seconds long, or N utterances long, or N percent of the original document's length.

This talk will present a (human-prepared) summary of our research on summarizing speech. We'll talk about how speech summarization is usually evaluated, including some of the appropriate baselines in this area, the dependence of genre on the performance and tuning of summarizers, the role of automated speech transcription in summarization, and the usefulness of some of the acoustic, untranscribed features of the speech signal.

See other events that are part of Artificial Intelligence Seminar Series 2007/2008

See other events happening in November 2007


About Us Research News Resources Directory