Sciweavers

298 search results - page 39 / 60
» An information-theoretic measure for document similarity
Sort
View
SPIRE
2004
Springer
15 years 11 months ago
Dealing with Syntactic Variation Through a Locality-Based Approach
To date, attempts for applying syntactic information in the document-based retrieval model dominant have led to little practical improvement, mainly due to the problems associated ...
Jesús Vilares Ferro, Miguel A. Alonso
CICLING
2010
Springer
15 years 10 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
ICPR
2006
IEEE
16 years 7 months ago
A General Framework for Agglomerative Hierarchical Clustering Algorithms
This paper presents a general framework for agglomerative hierarchical clustering based on graphs. Specifying an inter-cluster similarity measure, a subgraph of the similarity gra...
Reynaldo Gil-García, José Manuel Bad...
CAE
2007
15 years 8 months ago
Extracting the Essence from Sets of Images
We use a set of photographs showing similar scenes as a model for a single photograph this scene. A distance measure for this model is defined by correlating the neigborhoods of p...
Marc Alexa
SIGIR
2010
ACM
15 years 26 days ago
Three web-based heuristics to determine a person's or institution's country of origin
We propose three heuristics to determine the country of origin of a person or institution via text-based IE from the Web. We evaluate all methods on a collection of music artists ...
Markus Schedl, Klaus Seyerlehner, Dominik Schnitze...