Sciweavers

2452 search results - page 183 / 491
» A Language Modeling Approach to Information Retrieval
Sort
View
ANLP
1997
116views more  ANLP 1997»
15 years 7 months ago
A Maximum Entropy Approach to Identifying Sentence Boundaries
We present a trainable model for identifying sentence boundaries in raw text. Given a corpus annotated with sentence boundaries, our model learns to classify each occurrence of., ...
Jeffrey C. Reynar, Adwait Ratnaparkhi
ICASSP
2008
IEEE
16 years 27 days ago
An iterative unsupervised learning method for information distillation
Information distillation techniques are used to analyze and interpret large volumes of speech and text archives in multiple languages and produce structured information of interes...
Kamand Kamangar, Dilek Hakkani-Tür, Gökh...
CIKM
2009
Springer
16 years 1 months ago
Automatic generation of topic pages using query-based aspect models
We investigate the automatic generation of topic pages as an alternative to the current Web search paradigm. We describe a general framework, which combines query log analysis to ...
Niranjan Balasubramanian, Silviu Cucerzan
LREC
2008
141views Education» more  LREC 2008»
15 years 7 months ago
A Comparative Study on Language Identification Methods
In this paper we present two experiments conducted for comparison of different language identification algorithms. Short words-, frequent words- and n-gram-based approaches are co...
Lena Grothe, Ernesto William De Luca, Andreas N&uu...
SIGIR
2010
ACM
15 years 10 months ago
Multilingual PRF: english lends a helping hand
In this paper, we present a novel approach to Pseudo-Relevance Feedback (PRF) called Multilingual PRF (MultiPRF). The key idea is to harness multilinguality. Given a query in a la...
Manoj Kumar Chinnakotla, Karthik Raman, Pushpak Bh...