Sciweavers

2764 search results - page 444 / 553
» Information Retrieval by Semantic Similarity
Sort
View
HT
2010
ACM
15 years 3 months ago
Citation based plagiarism detection: a new approach to identify plagiarized work language independently
This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze docu...
Bela Gipp, Jöran Beel
ICML
1997
IEEE
16 years 7 months ago
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
The Rocchio relevance feedback algorithm is one of the most popular and widely applied learning methods from information retrieval. Here, a probabilistic analysis of this algorith...
Thorsten Joachims
ACL
2004
15 years 7 months ago
Discovering Relations among Named Entities from Large Corpora
Discovering the significant relations embedded in documents would be very useful not only for information retrieval but also for question answering and summarization. Prior method...
Takaaki Hasegawa, Satoshi Sekine, Ralph Grishman
RIAO
2004
15 years 7 months ago
Multilingual document clusters discovery
Cross Language Information Retrieval community has brought up search engines over multilingual corpora, and multilingual text categorization systems. In this paper, we focus on th...
Benoît Mathieu, Romaric Besançon, Chr...
NAACL
2003
15 years 7 months ago
Automating XML markup of text documents
We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Se...
Shazia Akhtar, Ronan G. Reilly, John Dunnion