CSE542


Course CSE542
Title Speech Processing
Description CSE542 will cover three main topics, each to be covered over a 4-week
period:

Introduction to the collection and analysis of speech data for speech processing
Includes a brief introduction to corpus linguistics. Students will learn the range and types of spoken language collections, and will learn how to analyze speech data using the Praat tool.

Introduction to speech recognition.
Students will learn basic technologies for speech recognition, using the Hidden Markov Model Toolkit (HTK).

Introduction to concatenative text-to-speech synthesis
Students will learn the basics of text-to-speech synthesis (TTS), as well as current technologies for concatenative TTS. The TTS system Festival (or its Java version, FreeTTS) will be used.

Integration of speech recognition and TTS into other technologies (by means of, e.g., VoiceXML and/or the speech SDKs under development by Microsoft, Sun (Java), and IBM) will also be discussed.

Prerequisite or Corequisite Knowledge of C/C++, Java, and/or Perl preferred. Knowledge of phonetics a plus.
Credit Information 3 - credits
Course Webpage http://www.cs.sunysb.edu/~cse542
Course Coordinator Dr. Amanda Stent