Abstract. In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain; strategies are thus needed for maximizing t...
: This is an ongoing project, which is scheduled until 2005. So I decided together with Richard Wang to set up Sup-Projects that could be published earlier. The main objective is t...
Abstract. Clickthrough data has been the subject of increasing popularity as an implicit indicator of user feedback. Previous analysis has suggested that user click behaviour is su...
Falk Scholer, Milad Shokouhi, Bodo Billerbeck, And...
To efficiently compress rasterized compound documents, an encoder must be content-adaptive. Content adaptivity may be achieved by using a layered approach. In such an approach, a ...
George Pavlidis, Sofia Tsekeridou, Christodoulos C...
In this paper the problem of off-line handwritten cursive text recognition is considered. A method for expanding the set of available training textlines by applying random perturb...