Sciweavers

5647 search results - page 92 / 1130
» A word from the editor
Sort
View
COLING
2008
15 years 7 months ago
Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation
Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...
Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...
LREC
2008
90views Education» more  LREC 2008»
15 years 7 months ago
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Parallel corpora are critical resources for machine translation research and development since parallel corpora contain translation equivalences of various granularities. Manual a...
Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing...
NAACL
2007
15 years 7 months ago
Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages
We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and ...
Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo...
CLIN
2004
15 years 7 months ago
Syntactic Contexts for Finding Semantically Related Words
Finding semantically related words is a first step in the direction of automatic ontology building. Guided by the view that similar words occur in similar contexts, we looked at t...
Lonneke van der Plas, Gosse Bouma
TCS
2008
15 years 6 months ago
Parikh matrices and amiable words
Using the fact that the Parikh matrix mapping is not an injective mapping, the paper investigates some properties of the set of words with the same Parikh matrix; these words are ...
Adrian Atanasiu, Radu Atanasiu, Ion Petre