Sciweavers

17688 search results - page 468 / 3538
» Data Set Balancing
Sort
View
WWW
2005
ACM
16 years 7 months ago
Web data cleansing for information retrieval using key resource page selection
With the page explosion of WWW, how to cover more useful information with limited storage and computation resources becomes more and more important in web IR research. Using web p...
Yiqun Liu, Canhui Wang, Min Zhang, Shaoping Ma
ICDE
2010
IEEE
199views Database» more  ICDE 2010»
16 years 6 months ago
Fuzzy Matching of Web Queries to Structured Data
Recognizing the alternative ways people use to reference an entity, is important for many Web applications that query structured data. In such applications, there is often a mismat...
Tao Cheng, Hady Wirawan Lauw, Stelios Paparizos
SAC
2009
ACM
16 years 1 months ago
Parameterless outlier detection in data streams
Outlyingness is a subjective concept relying on the isolation level of a (set of) record(s). Clustering-based outlier detection is a field that aims to cluster data and to detect...
Alice Marascu, Florent Masseglia
ISMDA
2005
Springer
16 years 10 days ago
Relevance, Redundancy and Differential Prioritization in Feature Selection for Multiclass Gene Expression Data
The large number of genes in microarray data makes feature selection techniques more crucial than ever. From various ranking-based filter procedures to classifier-based wrapper tec...
Chia Huey Ooi, Madhu Chetty, Shyh Wei Teng
PAM
2005
Springer
16 years 10 days ago
Analysis of Communities of Interest in Data Networks
Abstract. Communities of interest (COI) have been applied in a variety of environments ranging from characterizing the online buying behavior of individuals to detecting fraud in t...
William Aiello, Charles R. Kalmanek, Patrick Drew ...