Sciweavers

2277 search results - page 179 / 456
» Clustering by pattern similarity in large data sets
Sort
View
LREC
2010
139views Education» more  LREC 2010»
15 years 8 months ago
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian
The theoretical characterisation of multiword expressions (MWEs) is tightly connected to their actual occurrences in data and to their representation in lexical resources. We pres...
Andrea Zaninello, Malvina Nissim
WWW
2008
ACM
16 years 7 months ago
Query-sets: using implicit feedback and query patterns to organize web documents
In this paper we present a new document representation model based on implicit user feedback obtained from search engine queries. The main objective of this model is to achieve be...
Barbara Poblete, Ricardo A. Baeza-Yates
SPIESR
2003
136views Database» more  SPIESR 2003»
15 years 8 months ago
Media segmentation using self-similarity decomposition
We present a framework for analyzing the structure of digital media streams. Though our methods work for video, text, and audio, we concentrate on detecting the structure of digit...
Jonathan Foote, Matthew L. Cooper
SDM
2007
SIAM
107views Data Mining» more  SDM 2007»
15 years 8 months ago
On Demand Phenotype Ranking through Subspace Clustering
High throughput biotechnologies have enabled scientists to collect a large number of genetic and phenotypic attributes for a large collection of samples. Computational methods are...
Xiang Zhang, Wei Wang 0010, Jun Huan
AI
2007
Springer
16 years 22 days ago
Fuzzy Clustering for Topic Analysis and Summarization of Document Collections
Abstract. Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common an...
René Witte, Sabine Bergler