Search Sciweavers | Sciweavers

217

ICDE
2012
IEEE

205views Database» more ICDE 2012»

Optimizing Statistical Information Extraction Programs over Evolving Text

13 years 9 months ago

—Statistical information extraction (IE) programs are increasingly used to build real-world IE systems such as Alibaba, CiteSeer, Kylin, and YAGO. Current statistical IE approach...

Fei Chen, Xixuan Feng, Christopher Re, Min Wang

claim paper

Read More »

159

click to vote

WSDM
2009
ACM

138views Data Mining» more WSDM 2009»

Adaptive subjective triggers for opinionated document retrieval

16 years 1 months ago

Download www.ai.cs.kobe-u.ac.jp

This paper proposes a novel application of a statistical language model to opinionated document retrieval targeting weblogs (blogs). In particular, we explore the use of the trigg...

Kazuhiro Seki, Kuniaki Uehara

claim paper

Read More »

207

click to vote

CIKM
2008
Springer

138views Information Technology» more CIKM 2008»

Identifying table boundaries in digital documents via sparse line detection

15 years 8 months ago

Download chemxseer.ist.psu.edu

Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...

Ying Liu, Prasenjit Mitra, C. Lee Giles

claim paper

Read More »

184

click to vote

WWW
2009
ACM

213views Internet Technology» more WWW 2009»

Extracting article text from the web with maximum subsequence segmentation

16 years 7 months ago

Download www2009.org

Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...

Jeff Pasternack, Dan Roth

claim paper

Read More »

174

click to vote

KDD
2002
ACM

170views Data Mining» more KDD 2002»

Enhanced word clustering for hierarchical text classification

16 years 7 months ago

Download www.cs.utexas.edu

In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...

Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers