Sciweavers

142 search results - page 21 / 29
» Contemporaneous text as side-information in statistical lang...
Sort
View
CORR
2000
Springer
129views Education» more  CORR 2000»
15 years 5 months ago
Prosody-Based Automatic Segmentation of Speech into Sentences and Topics
A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and topic units. Speech segm...
Elizabeth Shriberg, Andreas Stolcke, Dilek Z. Hakk...
INEX
2005
Springer
15 years 11 months ago
Parameter Estimation for a Simple Hierarchical Generative Model for XML Retrieval
Abstract. This paper explores the possibility of using a modified Expectation-Maximization algorithm to estimate parameters for a simple hierarchical generative model for XML retr...
Paul Ogilvie, Jamie Callan
IDEAL
2000
Springer
15 years 9 months ago
An Off-Line Recognizer for Hand-Written Chinese Characters
Abstract. An off-line hand-written Chinese character recognizer supporting a vocabulary of 4,616 Chinese characters, alphanumerics and punctuation symbols has been reported. Traine...
P. K. Wong
ACL
2003
15 years 7 months ago
Unsupervised Learning of Arabic Stemming Using a Parallel Corpus
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
Monica Rogati, J. Scott McCarley, Yiming Yang
IAT
2009
IEEE
16 years 24 days ago
An Intelligent Agent That Autonomously Learns How to Translate
—We describe the design of an autonomous agent that can teach itself how to translate from a foreign language, by first assembling its own training set, then using it to improve...
Marco Turchi, Tijl De Bie, Nello Cristianini