This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
In this study, unlike previous studies where participants were instructed to pay attention to the advertisements, we set up a more naturalistic situation of reading magazine. Five ...
Geometric layout analysis plays an important role in document image understanding. Many algorithms known in literature work well on standard document images, achieving high text l...
Faisal Shafait, Joost van Beusekom, Daniel Keysers...
Instant intercommunion techniques such as Instant Messaging (IM) are widely popularized. Aiming at such kind of large scale masscommunication media, clustering on its text conte...
The automatic detection of plagiarism is a task that has acquired relevance in the Information Retrieval area and it becomes more complex when the plagiarism is made in a multiling...