Sciweavers

211 search results - page 23 / 43
» Texts semantic similarity detection based graph approach
Sort
View
KDD
2007
ACM
136views Data Mining» more  KDD 2007»
16 years 6 months ago
Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases
We now have incrementally-grown databases of text documents ranging back for over a decade in areas ranging from personal email, to news-articles and conference proceedings. While...
Benyah Shaparenko, Thorsten Joachims
ECAI
2008
Springer
15 years 7 months ago
On Cross-lingual Plagiarism Analysis using a Statistical Model
The automatic detection of plagiarism is a task that has acquired relevance in the Information Retrieval area and it becomes more complex when the plagiarism is made in a multiling...
Alberto Barrón-Cedeño, Paolo Rosso, ...
ICWSM
2009
15 years 3 months ago
Content Based Recommendation and Summarization in the Blogosphere
This paper presents a stochastic graph based method for recommending or selecting a small subset of blogs that best represents a much larger set. within a certain topic. Each blog...
Ahmed Hassan, Dragomir R. Radev, Junghoo Cho, Amru...
COLING
2010
15 years 29 days ago
Learning Summary Content Units with Topic Modeling
In the field of multi-document summarization, the Pyramid method has become an important approach for evaluating machine-generated summaries. The method is based on the manual ann...
Leonhard Hennig, Ernesto William De Luca, Sahin Al...
ASIAN
2007
Springer
102views Algorithms» more  ASIAN 2007»
16 years 5 days ago
A Static Birthmark of Binary Executables Based on API Call Structure
Abstract. A software birthmark is a unique characteristic of a program that can be used as a software theft detection. In this paper we suggest and empirically evaluate a static bi...
Seokwoo Choi, Heewan Park, Hyun-il Lim, Taisook Ha...