Sciweavers

2771 search results - page 233 / 555
» Advances in Document Engineering
Sort
View
LAWEB
2003
IEEE
16 years 10 hour ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
DFG
2003
Springer
15 years 12 months ago
Inter-organizational Business Process Management with XML Nets
Due to the fast growth of internet based electronic business activities, languages for modeling as well as methods for analyzing and executing distributed business processes are be...
Kirsten Lenz, Andreas Oberweis
ISMIS
2003
Springer
15 years 12 months ago
MetaNews: An Information Agent for Gathering News Articles on the Web
This paper presents MetaNews, an information gathering agent for news articles on the Web. MetaNews reads HTML documents from online news sites and extracts article information fro...
Dae-Ki Kang, Joongmin Choi
CIKM
2004
Springer
15 years 10 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang
ESWS
2008
Springer
15 years 8 months ago
Hybrid Search: Effectively Combining Keywords and Semantic Searches
This paper describes hybrid search, a search method supporting both document and knowledge retrieval via the flexible combination of ontologybased search and keyword-based matching...
Ravish Bhagdev, Sam Chapman, Fabio Ciravegna, Vita...