Sciweavers

279 search results - page 15 / 56
» Probability Based Clustering for Document and User Propertie...
Sort
View
ICDAR
2003
IEEE
15 years 11 months ago
Indexing and retrieval of words in old documents
This paper describes a system for efficient indexing and retrieval of words in collections of document images. The proposed method is based on two main principles: unsupervised pr...
Simone Marinai, Emanuele Marino, Giovanni Soda

Publication
203views
15 years 6 months ago
Multigraph Sampling of Online Social Networks
State-of-the-art techniques for probability sampling of users of online social networks (OSNs) are based on random walks on a single social relation. While powerful, these methods ...
Minas Gjoka, Carter T. Butts, Maciej Kurant, Athin...
KDD
2007
ACM
237views Data Mining» more  KDD 2007»
16 years 6 months ago
Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior
Documents, such as those seen on Wikipedia and Folksonomy, have tended to be assigned with multiple topics as a meta-data. Therefore, it is more and more important to analyze a re...
Issei Sato, Hiroshi Nakagawa
AGENTS
1998
Springer
15 years 10 months ago
WebACE: A Web Agent for Document Categorization and Exploration
We propose an agent for exploring and categorizing documents on the World Wide Web based on a user pro le. The heart of the agent is an automatic categorization of a set of docume...
Eui-Hong Han, Daniel Boley, Maria L. Gini, Robert ...
COLING
2010
15 years 28 days ago
Open Entity Extraction from Web Search Query Logs
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mi...
Alpa Jain, Marco Pennacchiotti