Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...
The vast majority of earlier work has focused on graphs which are both connected (typically by ignoring all but the giant connected component), and unweighted. Here we study numer...
Knowledge Discovery in time series usually requires symbolic time series. Many discretization methods that convert numeric time series to symbolic time series ignore the temporal ...
Within the last decade, blogs have become an important element of popular culture, mass media, and the daily lives of countless Internet users. Despite the medium's interacti...
This research explores how ideas occur in creative work and the strategies and tools used to represent and develop them. We describe the analysis of an open questionnaire survey o...