SPEECH TECHNOLOGY
Answers to some Questions
SPEECH TECHNOLOGY
WHAT IS SPEECH TECHNOLOGY
ABOUT ??
SPEECH TECHNOLOGY IS ABOUT
PROCESSING HUMAN SPEECH
as SIGNAL
as a form of LANGUAGE
SPEECH TECHNOLOGY
WHAT ALL IS INVOLVED IN
PROCESSING SPEECH ??
MULTI-DISCIPLINARY FIELD
•Linguistics •Statistics
•Physiology Pattern Recognition
•Psychology •Communication Theory
•Signal Processing •Computer Science: A.I.
•Acoustics (Physics) Heuristics / Machine Learning
Speech Technology
SPEECH TECHNOLOGY
WHERE DOES
SPEECH TECHNOLOGY
FIND ITS
APPLICATIONS ??
SPEECH TECHNOLOGY
WHERE DOES SPEECH TECHNOLOGY FIND ITS
APPLICATIONS ??
1. MAN MACHINE INTERFACES
TALKING MACHINES (TERMINATOR)
TELEPHONIC INTERFACES
SPEECH ENABLED WEB INTERFACES
INTERFACES FOR THE DISABLED
SPEECH TECHNOLOGY
WHERE DOES SPEECH TECHNOLOGY FIND ITS
APPLICATIONS ??
2. COMMUNICATIONS
SPEECH CODING / COMPRESSION
SPEECH ENHANCEMENT
(in noisy environments)
SPEECH TECHNOLOGY
WHERE DOES SPEECH TECHNOLOGY FIND ITS
APPLICATIONS ??
3. BIOMETRICS
SPEAKER IDENTIFICATION
SPEECH TECHNOLOGY
WHERE DOES SPEECH TECHNOLOGY FIND ITS
APPLICATIONS ??
4. ENTERTAINMENT TECHNOLOGY
SINGING MACHINES
VOICE CONVERSION
SPEECH TECHNOLOGY
CHALLENGES
High Performance Speech Recognition Systems
• Automatic Speech Recognizers for Indian
languages
(no major systems available)
• Audio-Visual Speech recognition
• Speech Recognition in tiny / mobile devices
[ Robots, Watches (Bond Stuff) ]
SPEECH TECHNOLOGY
CHALLENGES
Naturally Speaking Synthesis Systems
• Indian Language T. T. S.
(Improvements and extensions to various Indian Languages)
• Customizable T.T.S.
• Ability to produce Emotional Speech
SPEECH TECHNOLOGY
WHAT ARE WE DOING CURRENTLY ??
1. INDIAN LANGUAGE SPEECH SYNTHESIS
http://speech.iiit.net/
SYNTHESIZED SPEECH 1
SYNTHESIZED SPEECH 2
SPEECH TECHNOLOGY
WHAT ARE WE DOING CURRENTLY ??
2. SPEECH RECOGNITION
3. PROSODY MANIPULAITON
MANIPULATED SPEECH 1
MANIPULATED SPEECH 2
SPEECH TECHNOLOGY
WHAT ARE WE DOING CURRENTLY ??
4. SPEECH ENABLED INTERFACES (Web, PDAs)
5. READING AID FOR VISUALLY IMPAIRED
SPEECH TECHNOLOGY
WHO WILL YOU BE WORKING WITH ??
• S. P. Kishore (Ph.D. @ CMU, Scientist @ IIIT)
• Prof. Rajeev Sangal
• Dr. Vasudeva Varma
• Close Association with Faculty Members
of Speech Group at CMU
• Me (Rohit Kumar)
SPEECH TECHNOLOGY
QUESTIONS
[ speech@iiit.net ]