Although documents have hundreds of thousands of unique words, only a small number of words are significantly useful for intelligent services. For this reason, feature extraction ...
In today’s knowledge-intensive engineering environment, information management is an important and essential activity. However, existing researches of Engineering Information Man...
The paper presents a framework for publishing relational databases in textual documents such as mails, HTML pages, LATEX or BibTex files, plain texts, etc. The publication proces...
Research on linear text segmentation has been an on-going focus in NLP for the last decade, and it has great potential for a wide range of applications such as document summarizati...
Jingbo Zhu, Na Ye, Xinzhi Chang, Wenliang Chen, Be...
This communication deals with the Writer Identification task. Our previous work has shown the interest of using the graphemes as features for describing the individual properties ...