Sciweavers

1400 search results - page 197 / 280
» Post-Analysis of Learned Rules
Sort
View
CLIN
2003
15 years 7 months ago
Reduction of Dutch Sentences for Automatic Subtitling
We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-...
Erik F. Tjong Kim Sang, Walter Daelemans, Anja H&o...
EACL
2003
ACL Anthology
15 years 7 months ago
Empirical Methods for Compound Splitting
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...
Philipp Koehn, Kevin Knight
ANLP
1997
116views more  ANLP 1997»
15 years 7 months ago
A Maximum Entropy Approach to Identifying Sentence Boundaries
We present a trainable model for identifying sentence boundaries in raw text. Given a corpus annotated with sentence boundaries, our model learns to classify each occurrence of., ...
Jeffrey C. Reynar, Adwait Ratnaparkhi
ICML
2008
IEEE
16 years 7 months ago
Multi-classification by categorical features via clustering
We derive a generalization bound for multiclassification schemes based on grid clustering in categorical parameter product spaces. Grid clustering partitions the parameter space i...
Yevgeny Seldin, Naftali Tishby
ICML
2004
IEEE
16 years 7 months ago
Boosting grammatical inference with confidence oracles
In this paper we focus on the adaptation of boosting to grammatical inference. We aim at improving the performances of state merging algorithms in the presence of noisy data by us...
Jean-Christophe Janodet, Richard Nock, Marc Sebban...