Automatic recognition of the acoust

Automatic recognition of the acoustic speech signal alone is inaccurate and computationally expensive. Additional sources of speech information, such as lipreading (or speechreading), should enhance automatic speech recognition, just as lipreading is used by humans to enhance speech recognition when the acoustic signal is degraded. This paper describes an automatic lipreading system which has been developed. A commercial device performs the acoustic speech recognition independently of the lipreading system.
The recognition domain is restricted to isolated utterances and speaker dependent recognition. The speaker faces a solid state camera which sends digitized video to a minicomputer system with custom video processing hardware. The video data is sampled during an utterance and then reduced to a template consisting of visual speech parameter time sequences. The distances between the incoming template and all of the trained templates for each utterance in the vocabulary are computed and a visual recognition candidate is obtained. The combination of the acoustic and visual recognition candidates is shown to yield a final recognition accuracy which greatly exceeds the acoustic recognition accuracy alone. Practical considerations and the possible enhancement of speaker independent and continuous speech recognition systems are also discussed.

0/5000

From: -

To: -

Results (Thai) 1: [Copy]

Copied!

Automatic recognition of the acoustic speech signal alone is inaccurate and computationally expensive. Additional sources of speech information, such as lipreading (or speechreading), should enhance automatic speech recognition, just as lipreading is used by humans to enhance speech recognition when the acoustic signal is degraded. This paper describes an automatic lipreading system which has been developed. A commercial device performs the acoustic speech recognition independently of the lipreading system.The recognition domain is restricted to isolated utterances and speaker dependent recognition. The speaker faces a solid state camera which sends digitized video to a minicomputer system with custom video processing hardware. The video data is sampled during an utterance and then reduced to a template consisting of visual speech parameter time sequences. The distances between the incoming template and all of the trained templates for each utterance in the vocabulary are computed and a visual recognition candidate is obtained. The combination of the acoustic and visual recognition candidates is shown to yield a final recognition accuracy which greatly exceeds the acoustic recognition accuracy alone. Practical considerations and the possible enhancement of speaker independent and continuous speech recognition systems are also discussed.

Being translated, please wait..

Results (Thai) 2:[Copy]

Copied!

รับรู้โดยอัตโนมัติของสัญญาณเสียงอะคูสติกอย่างเดียวที่ไม่ถูกต้องและมีราคาแพงคอมพิวเตอร์ แหล่งข้อมูลเพิ่มเติมคำพูดเช่น lipreading (หรือ speechreading) ควรเพิ่มการรู้จำเสียงพูดอัตโนมัติเช่นเดียวกับ lipreading ถูกนำมาใช้โดยมนุษย์เพื่อเพิ่มการรับรู้คำพูดเมื่อสัญญาณอะคูสติกที่มีการสลายตัว กระดาษนี้จะอธิบายระบบ lipreading อัตโนมัติซึ่งได้รับการพัฒนา อุปกรณ์เชิงพาณิชย์ดำเนินการรู้จำเสียงอะคูสติกเป็นอิสระจากระบบ lipreading.
โดเมนได้รับการยอมรับถูก จำกัด ให้คำพูดที่แยกและลำโพงขึ้นอยู่กับการรับรู้ ลำโพงใบหน้ากล้องของรัฐที่มั่นคงซึ่งจะส่งวิดีโอดิจิตอลกับระบบมินิคอมพิวเตอร์ฮาร์ดแวร์ประมวลผลวิดีโอที่กำหนดเอง ข้อมูลวิดีโอเป็นตัวอย่างในระหว่างคำพูดและจากนั้นลดลงไปที่ประกอบด้วยแม่แบบของการพูดภาพลำดับเวลาพารามิเตอร์ ระยะทางระหว่างแม่แบบที่เข้ามาและทั้งหมดของแม่แบบการฝึกอบรมสำหรับคำพูดคำศัพท์แต่ละคำนวณและการรับรู้ของผู้สมัครจะได้รับภาพ การรวมกันของผู้สมัครการรับรู้อะคูสติกและภาพแสดงให้เห็นว่าผลผลิตความถูกต้องได้รับการยอมรับเป็นครั้งสุดท้ายที่มากเกินกว่าความถูกต้องของการรับรู้อะคูสติกเพียงอย่างเดียว การพิจารณาการปฏิบัติและเป็นไปได้ของการเพิ่มประสิทธิภาพของลำโพงที่เป็นอิสระและอย่างต่อเนื่องระบบรู้จำเสียงพูดยังจะกล่าวถึง

Being translated, please wait..

Results (Thai) 3:[Copy]

Copied!

การรับรู้โดยอัตโนมัติของอะคูสติกสัญญาณเสียงพูดอย่างเดียวไม่ถูกต้องและ computationally แพง แหล่งข่าวกล่าวเพิ่มเติม เช่น lipreading ( หรือ speechreading ) ควรเพิ่มประสิทธิภาพการรู้จำเสียงพูดอัตโนมัติ เช่นเดียวกับ lipreading ถูกใช้โดยมนุษย์เพื่อเพิ่มประสิทธิภาพการรู้จำเสียงพูดเมื่อสัญญาณมีคุณภาพต่ำบทความนี้อธิบายถึง lipreading อัตโนมัติระบบที่ได้รับการพัฒนา . อุปกรณ์เชิงพาณิชย์จะมีการรู้จำเสียงพูดเสียงอิสระของระบบ lipreading .
การจำกัดความ และโดเมนแยกลำโพงขึ้นอยู่กับการรับรู้ลำโพงหน้าแข็ง สภาพกล้องวิดีโอดิจิทัลที่ส่งระบบมินิคอมพิวเตอร์กับฮาร์ดแวร์การประมวลผลวิดีโอที่กำหนดเอง ข้อมูล วิดีโอ และในระหว่างที่แล้วลดลงเป็นแม่แบบซึ่งประกอบด้วยพารามิเตอร์ภาพ คำพูด เวลา ลำดับระยะทางระหว่างขาเข้าทั้งหมดของแม่แบบแม่แบบและฝึกคำศัพท์ที่ในแต่ละชั้น และการรับรู้ภาพ ผู้สมัครจะได้รับ การรวมกันของการรับรู้เสียงและภาพผู้สมัครแสดงผลผลิตสุดท้ายความถูกต้องในการรู้จำซึ่งมากเกินกว่าความจำเสียงคนเดียวข้อควรพิจารณาในทางปฏิบัติและการเพิ่มประสิทธิภาพที่สุดของลำโพงอิสระ และระบบจดจำเสียงพูดต่อเนื่องยังกล่าวถึง

Being translated, please wait..

Other languages

The translation tool support: Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Cebuano, Chichewa, Chinese, Chinese Traditional, Corsican, Croatian, Czech, Danish, Detect language, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Frisian, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Korean, Kurdish (Kurmanji), Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Myanmar (Burmese), Nepali, Norwegian, Odia (Oriya), Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Samoan, Scots Gaelic, Serbian, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tajik, Tamil, Tatar, Telugu, Thai, Turkish, Turkmen, Ukrainian, Urdu, Uyghur, Uzbek, Vietnamese, Welsh, Xhosa, Yiddish, Yoruba, Zulu, Language translation.