Sciweavers

5107 search results - page 421 / 1022
» Data Mining and Information Retrieval
Sort
View
166
Voted
SEMWEB
2007
Springer
16 years 29 days ago
Sindice.com: Weaving the Open Linked Data
Developers of Semantic Web applications face a challenge with respect to the decentralised publication model: where to find statements about encountered resources. The “linked d...
Giovanni Tummarello, Renaud Delbru, Eyal Oren
186
Voted
SIGIR
2003
ACM
16 years 3 days ago
Decision-Theoretic Resource Selection for Different Data Types in MIND
In a federated digital library system, it is too expensive to query every accessible library. Resource selection is the task to decide to which libraries a query should be routed. ...
Henrik Nottelmann, Norbert Fuhr
189
Voted
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
16 years 26 days ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
185
Voted
SAC
1997
ACM
15 years 11 months ago
The RasDaMan approach to multidimensional database management
Multidimensional discrete data (MDD), i.e., arrays of arbitrary size, dimension, and base type, are receiving growing attention among the database community. MDD occur in a variet...
Peter Baumann, Paula Furtado, Roland Ritsch, Norbe...
154
Voted
PVLDB
2008
99views more  PVLDB 2008»
15 years 6 months ago
Industry-scale duplicate detection
Duplicate detection is the process of identifying multiple representations of a same real-world object in a data source. Duplicate detection is a problem of critical importance in...
Melanie Weis, Felix Naumann, Ulrich Jehle, Jens Lu...