Search Sciweavers | Sciweavers

4971 search results - page 754 / 995

» On Scalable Information Retrieval Systems

223

click to vote

CN
2006

163views more CN 2006»

A framework for mining evolving trends in Web data streams using dynamic learning and retrospective validation

15 years 6 months ago

Download webmining.spd.louisville.edu

The expanding and dynamic nature of the Web poses enormous challenges to most data mining techniques that try to extract patterns from Web data, such as Web usage and Web content....

Olfa Nasraoui, Carlos Rojas, Cesar Cardona

claim paper

Read More »

145

click to vote

PVLDB
2008

99views more PVLDB 2008»

Industry-scale duplicate detection

15 years 6 months ago

Download www.hpi.uni-potsdam.de

Duplicate detection is the process of identifying multiple representations of a same real-world object in a data source. Duplicate detection is a problem of critical importance in...

Melanie Weis, Felix Naumann, Ulrich Jehle, Jens Lu...

claim paper

Read More »

146

click to vote

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Automatically learning document taxonomies for hierarchical classification

16 years 7 months ago

Download www.ideal.ece.utexas.edu

While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...

Kunal Punera, Suju Rajan, Joydeep Ghosh

claim paper

Read More »

170

click to vote

ICSM
2009
IEEE

134views Software Engineering» more ICSM 2009»

On the use of relevance feedback in IR-based concept location

16 years 1 months ago

Download menzies.us

Concept location is a critical activity during software evolution as it produces the location where a change is to start in response to a modification request, such as, a bug repo...

Gregory Gay, Sonia Haiduc, Andrian Marcus, Tim Men...

claim paper

Read More »

171

click to vote

DOCENG
2009
ACM

166views Document Analysis» more DOCENG 2009»

Object-level document analysis of PDF files

16 years 1 months ago

Download www.dbai.tuwien.ac.at

The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...

Tamir Hassan

claim paper

Read More »

« Prev « First page 754 / 995 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers