A precondition of existing ensemble-based distributed data mining techniques is the assumption that contributing data are identically and independently distributed. However, this a...
Yan Xing, Michael G. Madden, Jim Duggan, Gerard Ly...
Identification of significant differences in sets of data is a common task of data mining. This paper describes a novel visualization technique that allows the user to interactivel...
Decision support systems are important in leveraging information present in data warehouses in businesses like banking, insurance, retail and health-care among many others. The mu...
In this paper, we study an online data mining problem from streams of semi-structured data such as XML data. Modeling semi-structured data and patterns as labeled ordered trees, w...