Sciweavers

11538 search results - page 316 / 2308
» From Text to Knowledge
Sort
View
LREC
2010
208views Education» more  LREC 2010»
15 years 8 months ago
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
We report about tools for the extraction of German multiword expressions (MWEs) from text corpora; we extract word pairs, but also longer MWEs of different patterns, e.g. verb-nou...
Marion Weller, Ulrich Heid
IJDAR
2008
92views more  IJDAR 2008»
15 years 6 months ago
Mobile Retriever: access to digital documents from their physical source
In this paper we describe an image based document retrieval system which runs on camera enabled mobile devices. "Mobile Retriever" aims to seamlessly link physical and di...
Xu Liu, David S. Doermann
ICDAR
2009
IEEE
15 years 4 months ago
A Unified Framework Based on the Level Set Approach for Segmentation of Unconstrained Double-Sided Document Images Suffering fro
A novel method for the segmentation of double-sided ancient document images suffering from bleed-through effect is presented. It takes advantage of the level set framework to prov...
Reza Farrahi Moghaddam, David Rivest-Hénaul...
CORR
2012
Springer
167views Education» more  CORR 2012»
14 years 2 months ago
Multidimensional counting grids: Inferring word order from disordered bags of words
Models of bags of words typically assume topic mixing so that the words in a single bag come from a limited number of topics. We show here that many sets of bag of words exhibit a...
Nebojsa Jojic, Alessandro Perina
ICML
2007
IEEE
16 years 7 months ago
Self-taught learning: transfer learning from unlabeled data
We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabele...
Rajat Raina, Alexis Battle, Honglak Lee, Benjamin ...