Sciweavers

19892 search results - page 379 / 3979
» The POSTGRES Data Model
Sort
View
JCST
2008
121views more  JCST 2008»
15 years 6 months ago
Clustering Text Data Streams
Abstract Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organiza...
Yubao Liu, Jiarong Cai, Jian Yin, Ada Wai-Chee Fu
ICDE
2012
IEEE
277views Database» more  ICDE 2012»
13 years 9 months ago
Aggregate Query Answering on Possibilistic Data with Cardinality Constraints
— Uncertainties in data arise for a number of reasons: when the data set is incomplete, contains conflicting information or has been deliberately perturbed or coarsened to remov...
Graham Cormode, Divesh Srivastava, Entong Shen, Ti...
192
Voted
WWW
2006
ACM
16 years 7 months ago
Time-dependent semantic similarity measure of queries using historical click-through data
It has become a promising direction to measure similarity of Web search queries by mining the increasing amount of clickthrough data logged by Web search engines, which record the...
Qiankun Zhao, Steven C. H. Hoi, Tie-Yan Liu, Soura...
GECCO
2008
Springer
137views Optimization» more  GECCO 2008»
15 years 7 months ago
Informative sampling for large unbalanced data sets
Selective sampling is a form of active learning which can reduce the cost of training by only drawing informative data points into the training set. This selected training set is ...
Zhenyu Lu, Anand I. Rughani, Bruce I. Tranmer, Jos...
BMCBI
2008
122views more  BMCBI 2008»
15 years 6 months ago
Generating samples for association studies based on HapMap data
Background: With the completion of the HapMap project, a variety of computational algorithms and tools have been proposed for haplotype inference, tag SNP selection and genome-wid...
Jing Li, Yixuan Chen