Sciweavers

2929 search results - page 244 / 586
» Models of English Text
Sort
View
SIGIR
2010
ACM
15 years 1 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
ICDAR
2011
IEEE
14 years 6 months ago
Segmentation and Normalisation in Grapheme Codebooks
Abstract—The grapheme codebook is a high-performing technique for offline writer identification. This paper considers whether the de facto standards for initial grapheme extrac...
Tara Gilliam, Richard C. Wilson, John A. Clark
ACL
2012
13 years 9 months ago
ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora
The lack of parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated translation. A possible...
Marcis Pinnis, Radu Ion, Dan Stefanescu, Fangzhong...
ICCV
2011
IEEE
14 years 6 months ago
Learning Cross-modality Similarity for Multinomial Data
Many applications involve multiple-modalities such as text and images that describe the problem of interest. In order to leverage the information present in all the modalities, on...
Yangqing Jia, Mathieu Salzmann, Trevor Darrell
CLEF
2003
Springer
15 years 11 months ago
ITC-irst at CLEF 2003: Monolingual, Bilingual, and Multilingual Information Retrieval
This paper reports on the participation of ITC-irst in the Cross Language Evaluation Forum 2003; in particular, in the monolingual, bilingual, small multilingual, and spoken docum...
Nicola Bertoldi, Marcello Federico