• Optics and Precision Engineering
  • Vol. 15, Issue 7, 1117 (2007)
1, 1, 1, and 1,2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    DOI: Cite this Article
    [in Chinese], [in Chinese], [in Chinese], [in Chinese]. Lip contour description based on orthogonal transform in visual driven speech synthesis system[J]. Optics and Precision Engineering, 2007, 15(7): 1117 Copy Citation Text show less
    References

    [2] RAO R C T.Joint audio-video processing for multimedia[C].Proceedings of 22nd International Conference on Industrial Electronics,Control,and Instrumentation,Los Alamitos,USA:IEEE,1996,1:548-553.

    [3] ZHANG X.,MERSEREAU R M,BROUN C C,et al..Visual speech feature extraction for improved speech recognition[C].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing,Pis-cataway,NJ,USA:LEEE,2002,2:1993-1996.

    [5] KAYNAK M N,QIZ,CHEOK A D,et al..Audio visual modeling for bimodat speech recognition[C].Proceedings of IEEE International Conference on Systems,Man,and Cybernetics,Piscataway,NJ,USA:IEEE,2001,1:181-186.

    [6] SCANLON P,REILLY R.Feature analysis for automatic speechreading[C].Proceedings of IEEE Fourth Workshop on Multimedia Signal Processing,Piscataway,NJ,USA:LEEE,2001:625-6304.

    [7] MATTHEWS I,COOTES T F,BANGHAM J A,et al..Extraction of visual features for lipreading[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(2):198-213.

    [8] SEGUIER R,CLADEL N.Multiobjectives genetic snakes:application on audio-visual speech recognition[C].Proceedings of 4th EURASIP Con ference focused on Video/Image Processing and Multimedia Communications,Groatia,Zagreb:Faculty of Electrical Engineering and wmputing,2003,2:625-630.

    [9] CHANDRAMOHAN D,SILSBEE P L.A multiple deformable template approach for visual speech recognition[C].Proceedings of 4th International Confefence on Spoken Language,Processing New York,USA:IEEE,1996,1:50-53.

    [10] LIE W N,HSIEH H C.Lips detection by morphological image processing[C].Proceedings of 4th International Confefence on Signal Processing,Piscataway,NJ,USA:IEEE,1998,2:1084-1087.

    [11] GRAF H P,COSATTO E,POTAMIANOS M.Robust recognition of faces and facial features with a multi-modal systerm[c].Proceedings of International Conference on Systems,Man,andCybernetics,Piscataway,NJ,USA:IEEE,1997,3:2034-2039.

    [13] LI G,WANG M J,LIN L.Extracting lip parameters in speech synthesis system driven by visual-speech[c].Proceedings of IEEE first International Conference on Innovative Computing,Information and Control,Los Alamitos,USA:IEEE-CS,2006,2:346-349.

    [14] PERSOON E,FU K S.Shape discrimination using Fourier descriptors[J].IEEE Transactions on System,Man and Cybernetics,1977,7(2):170-179.

    [16] WILLIAMS J J,KATSAGGELOS A K,RANDOLPH MA.A hidden Markov model based visual speech synthesizer[C].Proceedings of International Conference on Acoustics,Speech,and Signal Processing,Piscataway,NJ,USA:IEEE,2000,4:2393-2396.

    [in Chinese], [in Chinese], [in Chinese], [in Chinese]. Lip contour description based on orthogonal transform in visual driven speech synthesis system[J]. Optics and Precision Engineering, 2007, 15(7): 1117
    Download Citation