Search Sciweavers | Sciweavers

1127 search results - page 39 / 226

» Web-scale extraction of structured data

144

click to vote

SIGMOD
2001
ACM

145views Database» more SIGMOD 2001»

Automatic Segmentation of Text into Structured Records

16 years 6 months ago

Download www.it.iitb.ac.in

In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...

Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...

claim paper

Read More »

192

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 6 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

194

click to vote

IPM
2006

171views more IPM 2006»

Automatic extraction of bilingual word pairs using inductive chain learning in various languages

15 years 5 months ago

Download sig.media.eng.hokudai.ac.jp

In this paper, we propose a new learning method for extracting bilingual word pairs from parallel corpora in various languages. In cross-language information retrieval, the system...

Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi

claim paper

Read More »

167

click to vote

CICLING
2009
Springer

140views Natural Language Processing» more CICLING 2009»

Business Specific Online Information Extraction from German Websites

16 years 6 months ago

Download www.cis.uni-muenchen.de

This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...

Yeong Su Lee, Michaela Geierhos

claim paper

Read More »

149

click to vote

SAINT
2005
IEEE

120views Internet Technology» more SAINT 2005»

Learning Logic Wrappers for Information Extraction from the Web

15 years 11 months ago

Download software.ucv.ro

This paper discusses a methodology for applying general-purpose ﬁrst-order inductive learning to extract information from Web documents structured as unranked ordered trees. The...

Costin Badica, Elvira Popescu, Amelia Badica

claim paper

Read More »

« Prev « First page 39 / 226 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers