Sciweavers

2875 search results - page 269 / 575
» Information Retrieval in Structured Domains
Sort
View
SIGIR
2010
ACM
15 years 10 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
JODS
2008
424views Data Mining» more  JODS 2008»
15 years 6 months ago
Semantically Processing Parallel Colour Descriptions
Information integration and retrieval are useful tasks in many information systems. In these systems, it is far from an easy task to directly integrate information from natural lan...
Shenghui Wang, Jeff Z. Pan
DGO
2003
128views Education» more  DGO 2003»
15 years 8 months ago
A Study on Automatic Ontology Mapping of Categorical Information
Semantic heterogeneity of information is a major barrier of information and system interoperability. Defining ontology of data and mapping ontologies among heterogeneous informati...
Naijun Zhou
ICDE
2008
IEEE
166views Database» more  ICDE 2008»
16 years 8 months ago
A Clustered Index Approach to Distributed XPath Processing
Supporting top-k queries over distributed collections of schemaless XML data poses two challenges. While XML supports expressive query languages such as XPath and XQuery, these la...
Georgia Koloniari, Evaggelia Pitoura
IPSJ
1994
138views more  IPSJ 1994»
15 years 8 months ago
The TSIMMIS Project: Integration of Heterogeneous Information Sources
The goal of the Tsimmis Project is to develop tools that facilitate the rapid integration of heterogeneous information sources that may include both structured and unstructured da...
Sudarshan S. Chawathe, Hector Garcia-Molina, Joach...