—This paper proposes and uses multivariate methods as a tool to evaluate performances of the hardware of microcomputers using their performance data, speed and price. The evaluat...
In this work we study some probabilistic models for the random generation of words over a given alphabet used in the literature in connection with pattern statistics. Our goal is t...
The concepts of similarity and distance are crucial in data mining. We consider the problem of defining the distance between two data sets by comparing summary statistics compute...
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
In this paper, we describe WikiPop service, a system designed to detect significant increase of popularity of topics related to users’ interests. We exploit Wikipedia page view...