Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups
10.240
Zitationen
11
Autoren
2012
Jahr
Abstract
Most current speech recognition systems use hidden Markov models (HMMs) to deal with the temporal variability of speech and Gaussian mixture models (GMMs) to determine how well each state of each HMM fits a frame or a short window of frames of coefficients that represents the acoustic input. An alternative way to evaluate the fit is to use a feed-forward neural network that takes several frames of coefficients as input and produces posterior probabilities over HMM states as output. Deep neural networks (DNNs) that have many hidden layers and are trained using new methods have been shown to outperform GMMs on a variety of speech recognition benchmarks, sometimes by a large margin. This article provides an overview of this progress and represents the shared views of four research groups that have had recent successes in using DNNs for acoustic modeling in speech recognition.
Ähnliche Arbeiten
AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale
2018 · 45.495 Zit.
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014 · 24.073 Zit.
A tutorial on hidden Markov models and selected applications in speech recognition
1989 · 22.674 Zit.
Efficient Estimation of Word Representations in Vector Space
2013 · 18.100 Zit.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
2001 · 12.995 Zit.