The Companions project is a 4 year, EU funded Framework Programme 6 project involving a consortium of 16 partners across 8 countries. Its aim is to develop a personalised conversa...
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of whi...
The Enron Email Corpus provides "Real World" text in the business email domain, which is a target domain for many speech and language applications. We present a section ...
Several years of consulting with online community hosts and managers have highlighted a variety of issues that recur across many online community development efforts. We summarize...
Danyel Fisher, Tammara Combs Turner, Marc A. Smith
This paper replicates and extends Observed Trends in Spam Construction Techniques: A Case Study of Spam Evolution. A corpus of 169,274 spam email was collected over a period of fi...