Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Librispeech: An ASR corpus based on public domain audio books
5.829
Zitationen
4
Autoren
2015
Jahr
Abstract
This paper introduces a new corpus of read English speech, suitable for training and evaluating speech recognition systems. The LibriSpeech corpus is derived from audiobooks that are part of the LibriVox project, and contains 1000 hours of speech sampled at 16 kHz. We have made the corpus freely available for download, along with separately prepared language-model training data and pre-built language models. We show that acoustic models trained on LibriSpeech give lower error rate on the Wall Street Journal (WSJ) test sets than models trained on WSJ itself. We are also releasing Kaldi scripts that make it easy to build these systems.
Ähnliche Arbeiten
AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale
2018 · 45.495 Zit.
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014 · 24.073 Zit.
A tutorial on hidden Markov models and selected applications in speech recognition
1989 · 22.674 Zit.
Efficient Estimation of Word Representations in Vector Space
2013 · 18.100 Zit.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
2001 · 12.995 Zit.