Dynamic data streams are those whose underlying distribution changes over time. They occur in a number of application domains, and mining them is important for these applications....
Statistical machine learning techniques for data classification usually assume that all entities are i.i.d. (independent and identically distributed). However, real-world entities...
Duplicate detection determines different representations of realworld objects in a database. Recent research has considered the use of relationships among object representations t...
We present a framework for automatically summarizing social group activity over time. The problem is important in understanding large scale online social networks, which have dive...
In contextual advertising, estimating the number of impressions of an ad is critical in planning and budgeting advertising campaigns. However, producing this forecast, even within...
Xuerui Wang, Andrei Z. Broder, Marcus Fontoura, Va...