Sciweavers

2718 search results - page 375 / 544
» Querying the deep web
Sort
View
PODS
2004
ACM
189views Database» more  PODS 2004»
16 years 6 months ago
The Lixto Data Extraction Project - Back and Forth between Theory and Practice
We present the Lixto project, which is both a research project in database theory and a commercial enterprise that develops Web data extraction (wrapping) and Web service definiti...
Georg Gottlob, Christoph Koch, Robert Baumgartner,...
ECIR
2009
Springer
16 years 3 months ago
Quality-Oriented Search for Depression Portals
The problem of low-quality information on the Web is nowhere more important than in the domain of health, where unsound information and misleading advice can have serious consequen...
Thanh Tin Tang, David Hawking, Ramesh S. Sankarana...
SIGIR
2009
ACM
16 years 1 months ago
Building enriched document representations using aggregated anchor text
It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....
Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...
ICDE
2008
IEEE
118views Database» more  ICDE 2008»
16 years 1 months ago
OntoNet: Scalable knowledge-based networking
Recent years have seen a proliferation of work on the Semantic Web, an initiative to enable intelligent agents to reason about and utilize World Wide Web content and services. Con...
Joseph B. Kopena, Boon Thau Loo
IPPS
2008
IEEE
16 years 1 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon