Sciweavers

2082 search results - page 156 / 417
» Query by document
Sort
View
KDD
2008
ACM
120views Data Mining» more  KDD 2008»
16 years 6 months ago
Entity categorization over large document collections
Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...
Arnd Christian König, Rares Vernica, Venkates...
ACMICEC
2006
ACM
141views ECommerce» more  ACMICEC 2006»
16 years 11 days ago
From HTML documents to web tables and rules
We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...
Kai Simon, Georg Lausen, Harold Boley
IDEAS
2003
IEEE
96views Database» more  IDEAS 2003»
15 years 11 months ago
Evaluating Nested Queries on XML Data
In the past few years, much attention has been paid to the study of semistructured data, i.e., data with irregular, possibly unstable, and rapidly changing structure, and, in part...
Carlo Sartiani
ICADL
2007
Springer
132views Education» more  ICADL 2007»
16 years 16 days ago
On Building a Full-Text Digital Library of Historical Documents
The National Taiwan University Library has built a digital library of historical documents about Taiwan. The content is unique in that it covers about 80% of all primary Chinese hi...
Szu-Pei Chen, Jieh Hsiang, Hsieh-Chang Tu, Micha W...
VLDB
2005
ACM
126views Database» more  VLDB 2005»
15 years 12 months ago
Hubble: An Advanced Dynamic Folder Technology for XML
A significant amount of information is stored in computer systems today, but people are struggling to manage their documents such that the information is easily found. XML is a de...
Ning Li, Joshua Hui, Hui-I Hsiao, Kevin S. Beyer