Sciweavers

2771 search results - page 268 / 555
» Advances in Document Engineering
Sort
View
WIDM
1998
ACM
15 years 11 months ago
WebML: Querying the World-Wide Web for Resources and Knowledge
There is a massive increase of information available on electronic networks. This profusion of resources on the WorldWide Web gave rise to considerable interest in the research co...
Osmar R. Zaïane, Jiawei Han
WEBDB
2000
Springer
131views Database» more  WEBDB 2000»
15 years 10 months ago
Automatic Classification of Text Databases Through Query Probing
Many text databases on the web are "hidden" behind search interfaces, and their documents are only accessible through querying. Search engines typically ignore the conte...
Panagiotis G. Ipeirotis, Luis Gravano, Mehran Saha...
LREC
2008
169views Education» more  LREC 2008»
15 years 8 months ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
ACL
2003
15 years 8 months ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use...
Joohui An, Seungwoo Lee, Gary Geunbae Lee
IIS
2004
15 years 8 months ago
Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search hits list, returned from a search engine. In this paper we present t...
Stanislaw Osinski, Dawid Weiss