Sciweavers

4234 search results - page 427 / 847
» A Method for Web Information Extraction
Sort
View
269
Voted
FSKD
2007
Springer
354views Fuzzy Logic» more  FSKD 2007»
16 years 1 months ago
Using Fuzzy-Word Correlation Factors to Compute Document Similarity Based on Phrase Matching
One of the Web information Retrieval (IR) problems these days is to identify redundant information that exist in (replicated) Web documents. These documents can easily be found in...
Jun won Lee, Yiu-Kai Ng
ECIR
2010
Springer
15 years 8 months ago
Mining Neighbors' Topicality to Better Control Authority Flow
Web pages are often recognized by others through contexts. These contexts determine how linked pages influence and interact with each other. When differentiating such interactions,...
Na Dai, Brian D. Davison, Yaoshuang Wang
WWW
2006
ACM
16 years 7 months ago
Towards practical genre classification of web documents
Classification of documents by genre is typically done either using linguistic analysis or term frequency based techniques. The former provides better classification accuracy than...
George Ferizis, Peter Bailey
175
Voted
ICASSP
2011
IEEE
14 years 10 months ago
Fast identification of JPEG 2000 images for digital cinema profiles
A method for identifying JPEG 2000 images with different coding parameters, such as DWT-filters, code-block sizes, quantization step sizes, and resolution levels, is presented. ...
Osamu Watanabe, Takahiro Fukuhara, Hitoshi Kiya
NAACL
2004
15 years 8 months ago
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
We consider the problem of modeling the content structure of texts within a specific domain, in terms of the topics the texts address and the order in which these topics appear. W...
Regina Barzilay, Lillian Lee