Sciweavers

135
Voted
INTERSPEECH
2010
15 years 23 days ago
Roles of the average voice in speaker-adaptive HMM-based speech synthesis
In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptat...
Junichi Yamagishi, Oliver Watts, Simon King, Bela ...
162
Voted
INTERSPEECH
2010
15 years 23 days ago
Sparse component analysis for speech recognition in multi-speaker environment
Sparse Component Analysis is a relatively young technique that relies upon a representation of signal occupying only a small part of a larger space. Mixtures of sparse components ...
Afsaneh Asaei, Hervé Bourlard, Philip N. Ga...
153
Voted
INTERSPEECH
2010
15 years 23 days ago
Speaker and language adaptive training for HMM-based polyglot speech synthesis
This paper proposes a technique for speaker and language adaptive training for HMM-based polyglot speech synthesis. Language-specific context-dependencies in the system are captur...
Heiga Zen
152
Voted
INTERSPEECH
2010
15 years 23 days ago
Speech dominoes and phonetic convergence
Interlocutors are known to mutually adapt during conversation. Recent studies have questioned the adaptation of phonological representations and kinematics of phonetic variables s...
Gérard Bailly, Amélie Lelong
163
Voted
INTERSPEECH
2010
15 years 23 days ago
Acoustic feature diversity and speaker verification
We present a new method for speaker verification that uses the diversity of information from multiple feature representations. The principle behind the method is that certain feat...
R. Padmanabhan, Hema A. Murthy
140
Voted
INTERSPEECH
2010
15 years 23 days ago
Can tongue be recovered from face? the answer of data-driven statistical models
This study revisits the face-to-tongue articulatory inversion problem in speech. We compare the Multi Linear Regression method (MLR) with two more sophisticated methods based on H...
Atef Ben Youssef, Pierre Badin, Gérard Bail...