Sciweavers

3530 search results - page 539 / 706
» Technology of Text Mining
Sort
View
CEAS
2007
Springer
15 years 10 months ago
Learning Fast Classifiers for Image Spam
Recently, spammers have proliferated "image spam", emails which contain the text of the spam message in a human readable image instead of the message body, making detect...
Mark Dredze, Reuven Gevaryahu, Ari Elias-Bachrach
AIRWEB
2006
Springer
15 years 10 months ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
AWIC
2003
Springer
15 years 10 months ago
A Natural Language Interface for Information Retrieval on Semantic Web Documents
Abstract. We present a dialogue system that enables the access in natural language to a web information retrieval system. We use a Web Semantic Language to model the knowledge conv...
Paulo Quaresma, Irene Pimenta Rodrigues
HT
1991
ACM
15 years 10 months ago
What's Eliza Doing in the Chinese Room? Incoherent Hyperdocuments - and How to Avoid Them
Research on understanding linear texts has shown that comprehension and navigation mainly depend on the reader’s ability to construct a coherent mental representation. While the...
Manfred Thüring, Jörg M. Haake, Jör...
AIRWEB
2008
Springer
15 years 8 months ago
Cleaning search results using term distance features
The presence of Web spam in query results is one of the critical challenges facing search engines today. While search engines try to combat the impact of spam pages on their resul...
Josh Attenberg, Torsten Suel