Sciweavers

8316 search results - page 407 / 1664
» Web Document Modeling
Sort
View
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
16 years 25 days ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
ACSD
2005
IEEE
153views Hardware» more  ACSD 2005»
16 years 14 days ago
BoPi - A Distributed Machine for Experimenting Web Services Technologies
BoPi is a programming language with a runtime support that allows the distribution and the execution of programs over the network. The language is a process calculus with XML valu...
Samuele Carpineti, Cosimo Laneve, Paolo Milazzo
WWW
2007
ACM
16 years 7 months ago
Web projections: learning from contextual subgraphs of the web
Graphical relationships among web pages have been leveraged as sources of information in methods for ranking search results. To date, specific graphical properties have been used ...
Jure Leskovec, Susan T. Dumais, Eric Horvitz
DEXAW
1999
IEEE
109views Database» more  DEXAW 1999»
15 years 11 months ago
Detection of Polygonal Frames in Complex Document Images
A robust method for the localization of frames within document images is presented. It aims at detecting regions delimited by closed polygonal lines or edges in complex color, gra...
Stefano Messelodi, Carla Maria Modena
EUROCAST
2009
Springer
116views Hardware» more  EUROCAST 2009»
15 years 10 months ago
Complete Sets of Hamiltonian Circuits for Classification of Documents
The calculation of Hamiltonian Circuits is an NP-complete task. This paper uses slightly modified complete sets of Hamiltonian circuits for the classification of documents. The sol...
Bernd Steinbach, Christian Posthoff