Sciweavers

2827 search results - page 307 / 566
» Marking Text Documents
Sort
View
DCC
1998
IEEE
15 years 11 months ago
Lossy Compression of Partially Masked Still Images
Books and magazines often contain pages containing audacious mixtures of color images and text. Our problem consists in coding the background colors of a such documents without wa...
Léon Bottou, Steven Pigeon
ESWS
2008
Springer
15 years 8 months ago
Exploring the Knowledge in Semi Structured Data Sets with Rich Queries
Semantics can be integrated in to search processing during both document analysis and querying stages. We describe a system that incorporates both, semantic annotations of Wikipedi...
Jürgen Umbrich, Sebastian Blohm
NAACL
2004
15 years 8 months ago
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
We consider the problem of modeling the content structure of texts within a specific domain, in terms of the topics the texts address and the order in which these topics appear. W...
Regina Barzilay, Lillian Lee
NIPS
2000
15 years 8 months ago
A PAC-Bayesian Margin Bound for Linear Classifiers: Why SVMs work
We investigate how the normalization of vectors influences the result of SVMs. 1 Normalization For the theoretical background, please refer to [1]. 2 Experiments We empirically co...
Ralf Herbrich, Thore Graepel
CLEF
2010
Springer
15 years 7 months ago
A Plagiarism Detector for Intrinsic Plagiarism - Lab Report for PAN at CLEF 2010
In this paper, we describe the algorithm that has been used to carry out our plagiarism detection within the context of PAN10 competition. Our system is based on the LempelZiv dist...
Pablo Suárez, José Carlos Gonz&aacut...