Cloaking and redirection are two possible search engine spamming techniques. In order to understand cloaking and redirection on the Web, we downloaded two sets of Web pages while ...
Web spider is a widely used approach to obtain information for search engines. As the size of the Web grows, it becomes a natural choice to parallelize the spider’s crawling proc...
The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we de...
We present the INSYSE method for the annotation of texts, based on extraction of semantic relations from syntactic structures. We is method to a corpus of 5000 Medline abstracts ab...
Laurent Alamarguy, Rose Dieng-Kuntz, Catherine Far...
Gazetteer services are an important component in a wide variety of systems, including geographic search engines and question answering systems. Unfortunately, the footprints provid...
Steven Schockaert, Martine De Cock, Etienne E. Ker...