Sciweavers

1335 search results - page 117 / 267
» Querying the World Wide Web
Sort
View
DEEC
2007
IEEE
16 years 21 days ago
DeepBot: a focused crawler for accessing hidden web content
The crawler engines of today cannot reach most of the information contained in the Web. A great amount of valuable information is "hidden" behind the query forms of onli...
Manuel Álvarez, Juan Raposo, Alberto Pan, F...
ECIR
2008
Springer
15 years 7 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
CIDR
2003
125views Algorithms» more  CIDR 2003»
15 years 7 months ago
Crossing the Structure Chasm
It has frequently been observed that most of the world’s data lies outside database systems. The reason is that database systems focus on structured data, leaving the unstructur...
Alon Y. Halevy, Oren Etzioni, AnHai Doan, Zachary ...
CSMR
1997
IEEE
15 years 10 months ago
Incremental Redocumentation with Hypertext
Redocumentation is the recovery and recording of software comprehension. Since software comprehension is the most expensive part of software maintenance, redocumentation is the ke...
Vaclav Rajlich
DBSEC
2007
121views Database» more  DBSEC 2007»
15 years 7 months ago
XACML Policies for Exclusive Resource Usage
The extensible access control markup language (XACML) is the standard access control policy specification language of the World Wide Web. XACML does not provide exclusive accesse...
Vijayant Dhankhar, Saket Kaushik, Duminda Wijeseke...