Sciweavers

ANLP
2000
92views more  ANLP 2000»
15 years 7 months ago
Tagging Sentence Boundaries
In this paper we tackle sentence boundary disambiguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementatio...
Andrei Mikheev
SIGIR
2000
ACM
15 years 10 months ago
Document centered approach to text normalization
In this paper we present an approach to tackle three important problems of text normalization: sentence boundary disambiguation, disambiguation of capitalized words when they are ...
Andrei Mikheev