Sciweavers

1008 search results - page 127 / 202
» Statistical search on the Semantic Web
Sort
View
MIE
2008
123views Healthcare» more  MIE 2008»
15 years 7 months ago
Searching Related Resources in a Quality Controlled Health Gateway: a Feasibility Study
Objective: The neighbors of a document are those documents in a corpus that are most similar to it. The objective of this paper is to develop and evaluate the related resources alg...
Tayeb Merabti, Suzanne Pereira, Catherine Letord, ...
SIGIR
2004
ACM
15 years 11 months ago
The document as an ergodic markov chain
In recent years, statistical language models are being proposed as alternative to the vector space model. Viewing documents as language samples introduces the issue of defining a...
Eduard Hoenkamp, Dawei Song
WWW
2006
ACM
16 years 8 days ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
LREC
2008
117views Education» more  LREC 2008»
15 years 7 months ago
A Suite to Compile and Analyze an LSP Corpus
This paper presents a series of tools for the extraction of specialized corpora from the web and its subsequent analysis mainly with statistical techniques. It is an integrated sy...
Rogelio Nazar, Jorge Vivaldi, Teresa Cabré
WWW
2006
ACM
16 years 7 months ago
Probabilistic models for discovering e-communities
The increasing amount of communication between individuals in e-formats (e.g. email, Instant messaging and the Web) has motivated computational research in social network analysis...
Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, H...