Sciweavers

512 search results - page 40 / 103
» Signal Processing for Robust Speech Recognition
Sort
View
ICASSP
2010
IEEE
15 years 6 months ago
Subspace Gaussian Mixture Models for speech recognition
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the...
Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar A...
ICASSP
2011
IEEE
14 years 9 months ago
Structured precision modelling with Cholesky Basis Superposition for speech recognition
Structured precision modelling is an important approach to improve the intra-frame correlation modelling of the standard HMM, where Gaussian mixture model with diagonal covariance...
Lei Jia, Kai Yu, Bo Xu
MIR
2003
ACM
161views Multimedia» more  MIR 2003»
15 years 11 months ago
Highlight scene extraction in real time from baseball live video
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
Yasuo Ariki, Masahito Kumano, Kiyoshi Tsukada
ICASSP
2009
IEEE
16 years 22 days ago
A flat direct model for speech recognition
We introduce a direct model for speech recognition that assumes an unstructured, i.e., flat text output. The flat model allows us to model arbitrary attributes and dependences o...
Georg Heigold, Geoffrey Zweig, Xiao Li, Patrick Ng...
TSD
2004
Springer
15 years 11 months ago
Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technol...
Marc Padellini, François Capman, Genevi&egr...