Escolar Documentos
Profissional Documentos
Cultura Documentos
Introduction:
1.1 Topic
1.2 Organization
Dictation systems
Voice Based Communications in tele-banking, voice mail, data-base query
systems, information retrieval systems, etc
System Control in automobiles, robotics, airplanes, etc
Security systems for speaker verification
3. Objective:
Recognise 10 English words (speaker independent) with at least 90% accuracy in a noisy
environment.
4. Methodology:
5. Project Schedule:
January 2008
o Processing of audio signals
o Feature extraction from the chosen training database
o Pattern recognition and signature extraction from the features
o Training the HMM with the training set
February 2008
o Processing of video signals
o Feature extraction from the chosen training database
o Pattern recognition and signature extraction from the features
March 2008
o Synchronize audio and video features for pattern recognition
o Extension of training data set to 10 words
April 2008
o Up gradation of system for speaker independent applications
o Performance analysis by comparing results of audio-only approach with that of
joint audio-visual approach
May 2008
o Documentation
References:
1. Tsuhan Chen, "Audiovisual Speech Processing, Lip Reading and Lip
synchronization", IEEE Signal Processing Magazine, January 2001.
2. R.Chellapa, C.L. Wilson and S. Sirohoey, ‘Human and Machine
Recognition of Faces : A survey’, Proceedings of the IEEE, vol 83, no.5 May
1995