Sciweavers

4934 search results - page 331 / 987
» complexity 2008
Sort
View
ISSA
2008
15 years 8 months ago
Spam Construction Trends
This paper replicates and extends Observed Trends in Spam Construction Techniques: A Case Study of Spam Evolution. A corpus of 169,274 spam email was collected over a period of fi...
Barry Irwin, Blake Friedman
LREC
2008
88views Education» more  LREC 2008»
15 years 8 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
LREC
2008
86views Education» more  LREC 2008»
15 years 8 months ago
Tools for Collocation Extraction: Preferences for Active vs. Passive
We present and partially evaluate procedures for the extraction of noun+verb collocation candidates from German text corpora, along with their morphosyntactic preferences, especia...
Ulrich Heid, Marion Weller
LREC
2008
109views Education» more  LREC 2008»
15 years 8 months ago
The Italian Particle "ne": Corpus Construction and Analysis
The Italian particle ne exhibits interesting anaphoric properties that have not been yet explored in depth from a corpus and computational linguistic perspective. We provide: (i) ...
Malvina Nissim, Sara Perboni
LREC
2008
162views Education» more  LREC 2008»
15 years 8 months ago
Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN
The DAM-LR project aims at virtually integrating various European language resource archives that allow users to navigate and operate in a single unified domain of language resour...
Daan Broeder, David Nathan, Sven Strömqvist, ...