Sciweavers

184
Voted
COLING
2010
15 years 24 days ago
Multiword Expressions in the wild? The mwetoolkit comes in handy
The mwetoolkit is a tool for automatic extraction of Multiword Expressions (MWEs) from monolingual corpora. It both generates and validates MWE candidates. The generation is based...
Carlos Ramisch, Aline Villavicencio, Christian Boi...
177
Voted
ECIR
2010
Springer
15 years 7 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III
154
Voted
ACL
2008
15 years 7 months ago
Learning Bilingual Lexicons from Monolingual Corpora
We present a method for learning bilingual translation lexicons from monolingual corpora. Word types in each language are characterized by purely monolingual features, such as con...
Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatric...
149
Voted
ICASSP
2009
IEEE
16 years 17 days ago
Incorporating monolingual corpora into bilingual latent semantic analysis for crosslingual LM adaptation
The major limitation in bilingual latent semantic analysis (bLSA) is the requirement of parallel training corpora. Motivated by semi-supervised learning, we propose a clusterbased...
Yik-Cheung Tam, Tanja Schultz