This paper describes a complete low-complexity imaging system based on a single MEMS scanning mirror and a single photodetector, together with customized image enhancement algorit...
Ganchi Zhang, Li Li, Vladimir Stankovic, Lina Stan...
We present a variational Bayesian algorithm that enhances the log spectra of noisy speech using speaker dependent priors. This algorithm extends prior work by Frey et al. where th...
We propose a novel semi-supervised method for building a statistical model that represents the relationship between sounds and text labels (“tags”). The proposed method, named...
Jun Takagi, Yasunori Ohishi, Akisato Kimura, Masas...
Speech translation (ST) is an enabling technology for cross-lingual oral communication. A ST system consists of two major components: an automatic speech recognizer (ASR) and a ma...
We propose a new statistical model, named Hierarchical Topic Trajectory Model (HTTM), for acquiring a dynamically changing topic model that represents the relationship between vid...