Sciweavers

298 search results - page 23 / 60
» An information-theoretic measure for document similarity
Sort
View
SAC
2010
ACM
16 years 29 days ago
Hypothesis generation and ranking based on event similarities
Accelerated by the technological advances in the domain, the size of the biomedical literature has been growing rapidly. As a result, it is not feasible for individual researchers...
Taiki Miyanishi, Kazuhiro Seki, Kuniaki Uehara
ISAAC
2005
Springer
138views Algorithms» more  ISAAC 2005»
15 years 11 months ago
On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm
In this paper, we prove for the first time that the learning complexity of Rocchio’s algorithm is O(d+d2 (log d+log n)) over the discretized vector space {0, . . . , n − 1}d ,...
Zhixiang Chen, Bin Fu
CICLING
2007
Springer
16 years 8 days ago
Clustering Narrow-Domain Short Texts by Using the Kullback-Leibler Distance
Clustering short length texts is a difficult task itself, but adding the narrow domain characteristic poses an additional challenge for current clustering methods. We addressed thi...
David Pinto, José-Miguel Benedí, Pao...
SODA
2000
ACM
123views Algorithms» more  SODA 2000»
15 years 7 months ago
Communication complexity of document exchange
We address the problem of minimizing the communication involved in the exchange of similar documents. We consider two users, A and B, who hold documents x and y respectively. Neit...
Graham Cormode, Mike Paterson, Süleyman Cenk ...
DOCENG
2007
ACM
15 years 10 months ago
XML version detection
The problem of version detection is critical in many important application scenarios, including software clone identification, Web page ranking, plagiarism detection, and peer-to-...
Deise de Brum Saccol, Nina Edelweiss, Renata de Ma...