Sciweavers

8701 search results - page 255 / 1741
» Protecting information on the Web
Sort
View
CEAS
2006
Springer
15 years 10 months ago
Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically
Just as email spam has negatively impacted the user messaging experience, the rise of Web spam is threatening to severely degrade the quality of information on the World Wide Web....
Steve Webb, James Caverlee, Calton Pu
ESWS
2008
Springer
15 years 8 months ago
Semantic Sitemaps: Efficient and Flexible Access to Datasets on the Semantic Web
Increasing amounts of RDF data are available on the Web for consumption by Semantic Web browsers and indexing by Semantic Web search engines. Current Semantic Web publishing practi...
Richard Cyganiak, Holger Stenzhorn, Renaud Delbru,...
CIIT
2007
133views Communications» more  CIIT 2007»
15 years 8 months ago
A unified interface for visual and interactive web search
The interfaces used by the top Web search engines have changed very little since the early days of Web search. These interfaces follow the traditional model of information retriev...
Orland Hoeber, Xue Dong Yang
EMNLP
2008
15 years 8 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
AAAI
2006
15 years 8 months ago
Using Semantics to Identify Web Objects
Many common web tasks can be automated by algorithms that are able to identify web objects relevant to the user's needs. This paper presents a novel approach to web object id...
Nathanael Chambers, James F. Allen, Lucian Galescu...