Sciweavers

874 search results - page 92 / 175
» Jedi: Extracting and Synthesizing Information from the Web
Sort
View
ICITA
2005
IEEE
15 years 12 months ago
Partition-Based Parallel PageRank Algorithm
A re-ranking technique,called “PageRank brings a successful story behind the search engine. Many studies focus on finding an way to compute the PageRank scores of a large web gr...
Arnon Rungsawang, Bundit Manaskasemsak
ITBAM
2010
15 years 4 months ago
MEDCollector: Multisource Epidemic Data Collector
This paper analyzes the requirements and presents a novel approach to the development of a system for epidemiological data collection and integration based on the principles of int...
João Zamite, Fabrício A. B. Silva, F...
WIDM
2004
ACM
15 years 11 months ago
Stylistic and lexical co-training for web block classification
Many applications which use web data extract information from a limited number of regions on a web page. As such, web page division into blocks and the subsequent block classifica...
Chee How Lee, Min-Yen Kan, Sandra Lai
IDEAS
2005
IEEE
142views Database» more  IDEAS 2005»
15 years 12 months ago
Automatically Maintaining Wrappers for Web Sources
A substantial subset of the web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or semantic information about the data it represents...
Juan Raposo, Alberto Pan, Manuel Álvarez, J...
ACL
2010
15 years 4 months ago
Learning 5000 Relational Extractors
Many researchers are trying to use information extraction (IE) to create large-scale knowledge bases from natural language text on the Web. However, the primary approach (supervis...
Raphael Hoffmann, Congle Zhang, Daniel S. Weld