Sciweavers

2513 search results - page 211 / 503
» Improving Generalization by Data Categorization
Sort
View
JCDL
2003
ACM
160views Education» more  JCDL 2003»
15 years 12 months ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
IQIS
2005
ACM
16 years 5 days ago
Exploiting relationships for object consolidation
Researchers in the data mining area frequently have to spend significant portion of their time on preprocessing the data in order to apply their algorithms to real-world datasets...
Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotr...
ICDE
2004
IEEE
127views Database» more  ICDE 2004»
16 years 8 months ago
Lazy Database Replication with Ordering Guarantees
Lazy replication is a popular technique for improving the performance and availability of database systems. Although there are concurrency control techniques which guarantee seria...
Khuzaima Daudjee, Kenneth Salem
ICML
2004
IEEE
16 years 7 months ago
Unifying collaborative and content-based filtering
Collaborative and content-based filtering are two paradigms that have been applied in the context of recommender systems and user preference prediction. This paper proposes a nove...
Justin Basilico, Thomas Hofmann
SIGMOD
2004
ACM
182views Database» more  SIGMOD 2004»
16 years 6 months ago
Efficient set joins on similarity predicates
In this paper we present an efficient, scalable and general algorithm for performing set joins on predicates involving various similarity measures like intersect size, Jaccard-coe...
Sunita Sarawagi, Alok Kirpal