Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Opensmile
2.519
Zitationen
3
Autoren
2010
Jahr
Abstract
We introduce the openSMILE feature extraction toolkit, which unites feature extraction algorithms from the speech processing and the Music Information Retrieval communities. Audio low-level descriptors such as CHROMA and CENS features, loudness, Mel-frequency cepstral coefficients, perceptual linear predictive cepstral coefficients, linear predictive coefficients, line spectral frequencies, fundamental frequency, and formant frequencies are supported. Delta regression and various statistical functionals can be applied to the low-level descriptors. openSMILE is implemented in C++ with no third-party dependencies for the core functionality. It is fast, runs on Unix and Windows platforms, and has a modular, component based architecture which makes extensions via plug-ins easy. It supports on-line incremental processing for all implemented features as well as off-line and batch processing. Numeric compatibility with future versions is ensured by means of unit tests. openSMILE can be downloaded from http://opensmile.sourceforge.net/.
Ähnliche Arbeiten
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
2014 · 10.764 Zit.
Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups
2012 · 10.244 Zit.
Speech recognition with deep recurrent neural networks
2013 · 8.784 Zit.
LSTM: A Search Space Odyssey
2016 · 6.700 Zit.
Librispeech: An ASR corpus based on public domain audio books
2015 · 5.832 Zit.