Search Sciweavers | Sciweavers

2513 search results - page 171 / 503

» Improving Generalization by Data Categorization

187

click to vote

GECCO
2008
Springer

137views Optimization» more GECCO 2008»

Informative sampling for large unbalanced data sets

15 years 7 months ago

Download www.cs.uvm.edu

Selective sampling is a form of active learning which can reduce the cost of training by only drawing informative data points into the training set. This selected training set is ...

Zhenyu Lu, Anand I. Rughani, Bruce I. Tranmer, Jos...

claim paper

Read More »

190

click to vote

CGA
1999

150views Computational Geometry» more CGA 1999»

Visualizing Large Telecommunication Data Sets

15 years 6 months ago

Download infovis.uni-konstanz.de

displays to abstract network data and let users interactwithit.Wehaveimplementedafull-scaleSwift3D prototype, which generated the examples we present here. Swift-3D We developed Sw...

Eleftherios Koutsofios, Stephen C. North, Daniel A...

claim paper

Read More »

190

click to vote

SIGMOD
2010
ACM

362views Database» more SIGMOD 2010»

Data warehousing and analytics infrastructure at facebook

15 years 1 months ago

Download borthakur.com

Scalable analysis on large data sets has been core to the functions of a number of teams at Facebook - both engineering and nonengineering. Apart from ad hoc analysis of data and ...

Ashish Thusoo, Zheng Shao, Suresh Anthony, Dhruba ...

claim paper

Read More »

168

click to vote

ICML
2003
IEEE

129views Machine Learning» more ICML 2003»

Learning on the Test Data: Leveraging Unseen Features

16 years 7 months ago

Download www.cis.upenn.edu

This paper addresses the problem of classification in situations where the data distribution is not homogeneous: Data instances might come from different locations or times, and t...

Benjamin Taskar, Ming Fai Wong, Daphne Koller

claim paper

Read More »

197

click to vote

KDD
2005
ACM

125views Data Mining» more KDD 2005»

Email data cleaning

16 years 7 months ago

Download research.microsoft.com

Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...

Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang

claim paper

Read More »

« Prev « First page 171 / 503 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers