Sciweavers

2382 search results - page 208 / 477
» Generating Seeded Trees from Data Sets
Sort
View
NIPS
2003
15 years 8 months ago
Hierarchical Topic Models and the Nested Chinese Restaurant Process
We address the problem of learning topic hierarchies from data. The model selection problem in this domain is daunting—which of the large collection of possible trees to use? We...
David M. Blei, Thomas L. Griffiths, Michael I. Jor...
ISMB
1998
15 years 8 months ago
Phylogenetic Inference in Protein Superfamilies: Analysis of SH2 Domains
This workfocuses on the inference of evolutionary relationships in protein superfamilies, and the uses of these relationships to identify keypositions in the structure, to infer a...
Kimmen Sjölander
SIGMOD
2008
ACM
164views Database» more  SIGMOD 2008»
16 years 6 months ago
Finding frequent items in probabilistic data
Computing statistical information on probabilistic data has attracted a lot of attention recently, as the data generated from a wide range of data sources are inherently fuzzy or ...
Qin Zhang, Feifei Li, Ke Yi
DCC
2007
IEEE
16 years 6 months ago
Nonuniform Compression in Databases with Haar Wavelet
Data synopsis is a lossy compressed representation of data stored into databases that helps the query optimizer to speed up the query process, e.g. time to retrieve the data from ...
S. Chen, A. Nucci
KDD
1998
ACM
148views Data Mining» more  KDD 1998»
15 years 10 months ago
Group Bitmap Index: A Structure for Association Rules Retrieval
Discovery of association rules from large databases of item sets is an important data mining problem. Association rules are usually stored in relational databases for future use i...
Tadeusz Morzy, Maciej Zakrzewicz