Sciweavers

8316 search results - page 201 / 1664
» Web Document Modeling
Sort
View
HT
2009
ACM
16 years 1 months ago
The redocumentation process of computer mediated activity traces: a general framework
The digital world enables the creation of personalized documents. In this paper we are interested in describing a computer mediated activity by a person throughout a semi-automati...
Leila Yahiaoui, Yannick Prié, Zizette Boufa...
LAWEB
2003
IEEE
15 years 11 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
IPM
2006
146views more  IPM 2006»
15 years 6 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
WWW
2003
ACM
16 years 7 months ago
Dynamic maintenance of web indexes using landmarks
Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...
Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...
EUROMICRO
2003
IEEE
15 years 11 months ago
Web Service Engineering with DIWE
A Web service is frequently defined as browser-less access to content on a Web site. The industry’s focus to date has been on providing easy-to-use low-level libraries, tools a...
Engin Kirda, Clemens Kerer, Christopher Krüge...