Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge
0
Zitationen
6
Autoren
2024
Jahr
Abstract
Lipreading which infers spoken content based solely on visual information such as lip movements is crucial in multi-modal research medicine and human-computer interaction. We organized the Chat-scenario Chinese Lipreading (Chat-CLR) challenge focusing on unscripted chat scenarios among native Chinese speakers. We placed emphasis on two tasks wake word lipreading (WWLR) and target speaker lipreading (TSLR). We are dedicated to fulfilling the requirements of waking up smart home devices within household settings and utilizing video for speech recognition with these smart home devices. For the WWLR task we received submissions from 5 teams with the top-performing system showing a 71.4% improvement over the baseline system. In the TSLR task we received submissions from 6 teams and the best system achieved a 22.1% improvement compared to the baseline system.
Ähnliche Arbeiten
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller
1999 · 5.633 Zit.
An experiment in linguistic synthesis with a fuzzy logic controller
1975 · 5.591 Zit.
A FRAMEWORK FOR REPRESENTING KNOWLEDGE
1988 · 4.551 Zit.
Opinion Paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy
2023 · 3.515 Zit.