The paper presents a method for pruning frequent itemsets based on background knowledge represented by a Bayesian network. The interestingness of an itemset is defined as the abso...
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
We introduce a new algorithm for mining sequential patterns. Our algorithm is especially efficient when the sequential patterns in the database are very long. We introduce a novel...
Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Y...
Class membership probability estimates are important for many applications of data mining in which classification outputs are combined with other sources of information for decisi...
Group work frequently involves transitions between periods of active collaboration and periods of individual activity. We aim to support this typical work practice by introducing ...