Lip contour description based on orthogonal transform in visual driven speech synthesis system

[in Chinese]; [in Chinese]; [in Chinese]; [in Chinese]

[2] RAO R C T.Joint audio-video processing for multimedia[C].Proceedings of 22nd International Conference on Industrial Electronics,Control,and Instrumentation,Los Alamitos,USA:IEEE,1996,1:548-553.

[3] ZHANG X.,MERSEREAU R M,BROUN C C,et al..Visual speech feature extraction for improved speech recognition[C].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing,Pis-cataway,NJ,USA:LEEE,2002,2:1993-1996.

[5] KAYNAK M N,QIZ,CHEOK A D,et al..Audio visual modeling for bimodat speech recognition[C].Proceedings of IEEE International Conference on Systems,Man,and Cybernetics,Piscataway,NJ,USA:IEEE,2001,1:181-186.

[6] SCANLON P,REILLY R.Feature analysis for automatic speechreading[C].Proceedings of IEEE Fourth Workshop on Multimedia Signal Processing,Piscataway,NJ,USA:LEEE,2001:625-6304.

[7] MATTHEWS I,COOTES T F,BANGHAM J A,et al..Extraction of visual features for lipreading[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(2):198-213.

[8] SEGUIER R,CLADEL N.Multiobjectives genetic snakes:application on audio-visual speech recognition[C].Proceedings of 4th EURASIP Con ference focused on Video/Image Processing and Multimedia Communications,Groatia,Zagreb:Faculty of Electrical Engineering and wmputing,2003,2:625-630.

[9] CHANDRAMOHAN D,SILSBEE P L.A multiple deformable template approach for visual speech recognition[C].Proceedings of 4th International Confefence on Spoken Language,Processing New York,USA:IEEE,1996,1:50-53.

[10] LIE W N,HSIEH H C.Lips detection by morphological image processing[C].Proceedings of 4th International Confefence on Signal Processing,Piscataway,NJ,USA:IEEE,1998,2:1084-1087.

[11] GRAF H P,COSATTO E,POTAMIANOS M.Robust recognition of faces and facial features with a multi-modal systerm[c].Proceedings of International Conference on Systems,Man,andCybernetics,Piscataway,NJ,USA:IEEE,1997,3:2034-2039.

[13] LI G,WANG M J,LIN L.Extracting lip parameters in speech synthesis system driven by visual-speech[c].Proceedings of IEEE first International Conference on Innovative Computing,Information and Control,Los Alamitos,USA:IEEE-CS,2006,2:346-349.

[14] PERSOON E,FU K S.Shape discrimination using Fourier descriptors[J].IEEE Transactions on System,Man and Cybernetics,1977,7(2):170-179.

[16] WILLIAMS J J,KATSAGGELOS A K,RANDOLPH MA.A hidden Markov model based visual speech synthesizer[C].Proceedings of International Conference on Acoustics,Speech,and Signal Processing,Piscataway,NJ,USA:IEEE,2000,4:2393-2396.

微信扫一扫：分享

微信扫一扫：分享