"This book is primarily intended to be a text for the programming component in an introductory two semester computer science course (some materials are a little advanced and m...
In this paper, we describe the design, architecture, and the lessons learned from the implementation of a fast regular expression indexing engine FREE. FREE uses a prebuilt index ...
In this investigation, we propose a probabilistic approach for estimating the ages of Blog authors by means of Naive Bayesian Classifier. We can learn context of characteristic wor...
In this paper, an evaluation is presented of a framework that supports flexible content repurposing. Unlike the usual practice where content components, such as slides, images, def...
Number and date expressions are essential information items in corpora and therefore play a major role in various text mining applications. However, so far number expressions were ...