Sciweavers

4645 search results - page 643 / 929
» Using Information Extraction to Improve Document Retrieval
Sort
View
CIKM
2010
Springer
15 years 4 months ago
Combining link and content for collective active learning
In this paper, we study a novel problem Collective Active Learning, in which we aim to select a batch set of "informative" instances from a networking data set to query ...
Lixin Shi, Yuhang Zhao, Jie Tang
CICLING
2008
Springer
15 years 8 months ago
Arabic/English Multi-document Summarization with CLASSY - The Past and the Future
Abstract. Automatic document summarization has become increasingly important due to the quantity of written material generated worldwide. Generating good quality summaries enables ...
Judith D. Schlesinger, Dianne P. O'Leary, John M. ...
ICDAR
1999
IEEE
15 years 11 months ago
Multifont Classification using Typographical Attributes
This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders a...
Min-Chul Jung, Yong-Chul Shin, Sargur N. Srihari
SMC
2010
IEEE
186views Control Systems» more  SMC 2010»
15 years 5 months ago
Semantic enrichment of text representation with wikipedia for text classification
—Text classification is a widely studied topic in the area of machine learning. A number of techniques have been developed to represent and classify text documents. Most of the t...
Hiroki Yamakawa, Jing Peng, Anna Feldman
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
16 years 1 months ago
Semi-supervised Clustering Using Bayesian Regularization
Text clustering is most commonly treated as a fully automated task without user supervision. However, we can improve clustering performance using supervision in the form of pairwi...
Zuobing Xu, Ram Akella, Mike Ching, Renjie Tang