Search Sciweavers | Sciweavers

6974 search results - page 966 / 1395

» Querying Semi-Structured Data

165

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 7 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

173

click to vote

KDD
2008
ACM

128views Data Mining» more KDD 2008»

Scaling up text classification for large file systems

16 years 7 months ago

Download www.hpl.hp.com

: We combine the speed and scalability of information retrieval with the generally superior classification accuracy offered by machine learning, yielding a two-phase text classifie...

George Forman, Shyamsundar Rajaram

claim paper

Read More »

270

click to vote

VLDB
2007
ACM

121views Database» more VLDB 2007»

Ranked Subsequence Matching in Time-Series Databases

16 years 7 months ago

Download www.vldb.org

Existing work on similar sequence matching has focused on either whole matching or range subsequence matching. In this paper, we present novel methods for ranked subsequence match...

Wook-Shin Han, Jinsoo Lee, Yang-Sae Moon, Haifeng ...

claim paper

Read More »

198

click to vote

SIGMOD
2004
ACM

174views Database» more SIGMOD 2004»

PIPES - A Public Infrastructure for Processing and Exploring Streams

16 years 7 months ago

Download dbs.mathematik.uni-marburg.de

PIPES is a flexible and extensible infrastructure providing fundamental building blocks to implement a data stream management system (DSMS). It is seamlessly integrated into the J...

Bernhard Seeger, Jürgen Krämer

claim paper

Read More »

183

click to vote

EDBT
2004
ACM

131views Database» more EDBT 2004»

Declustering Two-Dimensional Datasets over MEMS-Based Storage

16 years 7 months ago

Download www.eecs.northwestern.edu

Due to the large difference between seek time and transfer time in current disk technology, it is advantageous to perform large I/O using a single sequential access rather than mu...

Hailing Yu, Divyakant Agrawal, Amr El Abbadi

claim paper

Read More »

« Prev « First page 966 / 1395 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers