Search Sciweavers | Sciweavers

336 search results - page 61 / 68

» Content-based language models for spoken document retrieval

164

click to vote

CICLING
2010
Springer

174views Natural Language Processing» more CICLING 2010»

Word Length n-Grams for Text Re-use Detection

15 years 10 months ago

Download users.dsic.upv.es

Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...

Alberto Barrón-Cedeño, Chiara Basile...

claim paper

Read More »

173

click to vote

IJCNLP
2005
Springer

138views Natural Language Processing» more IJCNLP 2005»

Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora

15 years 11 months ago

Download www.cs.ust.hk

Abstract. We present a new implication of Wu’s (1997) Inversion Transduction Grammar (ITG) Hypothesis, on the problem of retrieving truly parallel sentence translations from larg...

Dekai Wu, Pascale Fung

claim paper

Read More »

163

click to vote

SIGIR
2009
ACM

136views Information Technology» more SIGIR 2009»

Estimating query performance using class predictions

16 years 13 days ago

Download research.microsoft.com

We investigate using topic prediction data, as a summary of document content, to compute measures of search result quality. Unlike existing quality measures such as query clarity ...

Kevyn Collins-Thompson, Paul N. Bennett

claim paper

Read More »

187

click to vote

SIGIR
2009
ACM

101views Information Technology» more SIGIR 2009»

Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization

16 years 13 days ago

Download eprints.pascal-network.org

This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the ﬁrst stage, the proposed approach identiﬁes topic th...

Massih-Reza Amini, Nicolas Usunier

claim paper

Read More »

173

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

16 years 27 days ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

« Prev « First page 61 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers