Position information has been proved to be very effective in document summarization, especially in generic summarization. Existing approaches mostly consider the information of se...
This paper describes recent advances in hidden Markov model (HMM) based OCR for machine-printed Arabic documents. A combination of scriptindependent and script-specific techniques...
This paper presents the results of a feasability study that was carried out to evaluate the construction of Use Case Models by comparing the models with groups that used the GUCCRA...
Anderson Belgamo, Sandra Camargo Pinto Ferraz Fabb...
Manual categorisation of documents is a time-consuming task that has been significantly alleviated with the deployment of automatic and machine-aided text categorisation systems. ...
Statistical language models can learn relationships between topics discussed in a document collection and persons, organizations and places mentioned in each document. We present a...
David Newman, Chaitanya Chemudugunta, Padhraic Smy...