Sciweavers

3705 search results - page 270 / 741
» Building Documentation Generators
Sort
View
ICDAR
2009
IEEE
15 years 4 months ago
An Open Source Tesseract Based Optical Character Recognizer for Bangla Script
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the Center for Research on Bangla Language Pr...
Md. Abul Hasnat, Muttakinur Rahman Chowdhury, Mumi...
204
Voted
WWW
2006
ACM
16 years 7 months ago
Large-scale text categorization by batch mode active learning
Large-scale text categorization is an important research topic for Web data mining. One of the challenges in large-scale text categorization is how to reduce the amount of human e...
Steven C. H. Hoi, Rong Jin, Michael R. Lyu
WWW
2004
ACM
16 years 7 months ago
Efficient web change monitoring with page digest
The Internet and the World Wide Web have enabled a publishing explosion of useful online information, which has produced the unfortunate side effect of information overload: it is...
David Buttler, Daniel Rocco, Ling Liu
CLEF
2006
Springer
15 years 10 months ago
TALP at GeoCLEF 2006: Experiments Using JIRS and Lucene with the ADL Feature Type Thesaurus
This paper describes our experiments in Geographical Information Retrieval (GIR) in the context of our participation in the GeoCLEF 2006 Monolingual English task. The TALPGeoIR sy...
Daniel Ferrés, Horacio Rodríguez
ASC
2000
15 years 8 months ago
Knowledge Management through Content Interpretation
The improved performance of computer-based text analysis represents a major step forward for knowledge management. Reliable text interpretation allows focus to be placed upon the ...
R. R. Jones, Bernt A. Bremdal, C. Spaggiari, F. Jo...