Search Sciweavers | Sciweavers

2190 search results - page 342 / 438

» Unweaving a web of documents

164

click to vote

DRR
2008

141views Document Analysis» more DRR 2008»

Hybrid approach combining contextual and statistical information for identifying MEDLINE citation terms

15 years 7 months ago

Download lhncbc.nlm.nih.gov

There is a strong demand for developing automated tools for extracting pertinent information from the biomedical literature that is a rich, complex, and dramatically growing resou...

In-Cheol Kim, Daniel X. Le, George R. Thoma

claim paper

Read More »

153

click to vote

DGO
2006

134views Education» more DGO 2006»

Next steps in near-duplicate detection for eRulemaking

15 years 7 months ago

Download www.cs.cmu.edu

Large volume public comment campaigns and web portals that encourage the public to customize form letters produce many near-duplicate documents, which increases processing and sto...

Hui Yang, Jamie Callan, Stuart W. Shulman

claim paper

Read More »

159

click to vote

WWW
2005
ACM

116views Internet Technology» more WWW 2005»

A search engine for natural language applications

16 years 7 months ago

Download turing.cs.washington.edu

Many modern natural language-processing applications utilize search engines to locate large numbers of Web documents or to compute statistics over the Web corpus. Yet Web search e...

Michael J. Cafarella, Oren Etzioni

claim paper

Read More »

155

click to vote

EDBT
2002
ACM

159views Database» more EDBT 2002»

Cut-and-Pick Transactions for Proxy Log Mining

16 years 6 months ago

Download www.cs.ust.hk

Web logs collected by proxy servers, referred to as proxy logs or proxy traces, contain information about Web document accesses by many users against many Web sites. This "man...

Wenwu Lou, Guimei Liu, Hongjun Lu, Qiang Yang

claim paper

Read More »

161

click to vote

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

16 years 1 months ago

Download www2.pms.ifi.lmu.de

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

« Prev « First page 342 / 438 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers