About
Education
Research
People
News/Events
Contacts

Course Title: Self-Paced Lab: Speech Recognition (11-754)
Department: Language Technologies Institute (LTI)
Units: 6
Semester: Both
Instructor: Alex Rudnicky, Alan Black
Prerequisite: Speech Recognition (11-751) or permission of instructor

Course Description:

This course will teach participants how to implement a complete spoken language system while providing opportunities to explore research topics of interest in the context of a functioning system. The course will produce a complete implementation of a system to access and manipulate email through voice only, for example to allow users to interact with the mail system over a telephone while away from their computer. In doing so the class will address the component activities of spoken language system building. These include, but are not limited to, task analysis and language design, application-specific acoustic and language modeling, grammar design, task design, dialog management, language generation and synthesis. The course will place particular emphasis on issues in task design and dialog management and on issues in language generation and synthesis.

For Fall 2001, we will implement a simple telephone-based information access application. The domain is bus schedules (see http://www.speech.cs.cmu.edu/BusLine for a web-based interface to this domain) and the goal will be to create one or more usable applications that can provide a real service and can be deployed for actual use by the University community. 

Participants will chose individual components of the system to concentrate on and will collaborate to put together the entire system. It is perfectly acceptable for several individuals to concentrate on a single component, particularly if their work will exemplify alternative approaches to the same problem.


Webmaster: ehn@cs.cmu.edu



LTI is part of the School of Computer Science at Carnegie Mellon University.
This page is maintained by ehn@cs.cmu.edu.