Statistical machine translation systems are usually trained on large amounts of bilingual text (used to learn a translation model), and also large amounts of monolingual text in th...
We propose a novel self-training method for a parser which uses a lexicalised grammar and supertagger, focusing on increasing the speed of the parser rather than its accuracy. The...
Jonathan K. Kummerfeld, Jessika Roesner, Tim Dawbo...
Inducing a grammar directly from text is one of the oldest and most challenging tasks in Computational Linguistics. Significant progress has been made for inducing dependency gram...
This paper concerns the fundamental problem of identifying the content nature of a flow, namely text, binary, or encrypted, for the first time. We propose Iustitia, a tool for ide...
All books entitled “Learn … with 1000 exercises” have in common the same basic principle. They aim to supply enough material to students so that they may better understand t...
Enrique Lazcorreta, Federico Botella, Antonio Fern...