Obtaining high-quality and up-to-date labeled data can be difficult in many real-world machine learning applications, especially for Internet classification tasks like review spam...
We describe a novel semi-supervised method called WordCodebook Learning (WCL), and apply it to the task of bionamed entity recognition (bioNER). Typical bioNER systems can be seen...
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...
In order to search corpora written in two or more languages, the simplest and most efficient approach is to translate the query submitted into the required language(s). To achieve...
Id: trecvid2008.tex 197 2008-10-23 13:28:48Z alyr Date: 2008-10-23 15:28:48 +0200 (Thu, 23 Oct 2008) Type Run Description MAP/mean infAP HLF Official H utcwiprimw146 Our prelimina...
Robin Aly, Djoerd Hiemstra, Arjen P. de Vries, Hen...