Sciweavers

2080 search results - page 136 / 416
» Collections, Cardinalities, and Relations
Sort
View
WWW
2002
ACM
16 years 7 months ago
Parallel crawlers
In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...
Junghoo Cho, Hector Garcia-Molina
DEXAW
2008
IEEE
128views Database» more  DEXAW 2008»
16 years 26 days ago
Proximity Estimation and Hardness of Short-Text Corpora
Abstract—In this work, we investigate the relative hardness of shorttext corpora in clustering problems and how this hardness relates to traditional similarity measures. Our appr...
Marcelo Luis Errecalde, Diego Ingaramo, Paolo Ross...
DIS
2007
Springer
16 years 17 days ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
RULEML
2007
Springer
16 years 16 days ago
Querying the Semantic Web with SWRL
The SWRLTab is a development environment for working with SWRL rules in Protégé-OWL. It supports the editing and execution of SWRL rules. It also provides mechanisms to allow int...
Martin J. O'Connor, Samson W. Tu, Csongor Nyulas, ...
ICSM
2006
IEEE
16 years 13 days ago
Working Session: Information Retrieval Based Approaches in Software Evolution
During software evolution a collection of related artifacts with different representations are created. Some of these are composed of structured data (e.g., analysis data), some c...
Andrian Marcus, Andrea De Lucia, Jane Huffman Haye...