Sciweavers

926 search results - page 91 / 186
» Large Scale Data Mining: Challenges and Responses
Sort
View
PKDD
2010
Springer
172views Data Mining» more  PKDD 2010»
15 years 4 months ago
Surprising Patterns for the Call Duration Distribution of Mobile Phone Users
How long are the phone calls of mobile users? What are the chances of a call to end, given its current duration? Here we answer these questions by studying the call duration distri...
Pedro O. S. Vaz de Melo, Leman Akoglu, Christos Fa...
PKDD
2005
Springer
101views Data Mining» more  PKDD 2005»
15 years 11 months ago
A Random Method for Quantifying Changing Distributions in Data Streams
In applications such as fraud and intrusion detection, it is of great interest to measure the evolving trends in the data. We consider the problem of quantifying changes between tw...
Haixun Wang, Jian Pei
KDD
2006
ACM
165views Data Mining» more  KDD 2006»
16 years 6 months ago
Training linear SVMs in linear time
Linear Support Vector Machines (SVMs) have become one of the most prominent machine learning techniques for highdimensional sparse data commonly encountered in applications like t...
Thorsten Joachims
WWW
2010
ACM
16 years 1 months ago
Large-scale bot detection for search engines
In this paper, we propose a semi-supervised learning approach for classifying program (bot) generated web search traffic from that of genuine human users. The work is motivated by...
Hongwen Kang, Kuansan Wang, David Soukal, Fritz Be...
PVLDB
2010
150views more  PVLDB 2010»
15 years 4 months ago
DataGarage: Warehousing Massive Performance Data on Commodity Servers
Contemporary datacenters house tens of thousands of servers. The servers are closely monitored for operating conditions and utilizations by collecting their performance data (e.g....
Charles Loboz, Slawek Smyl, Suman Nath