This paper proposes a new bootstrapping approach to unsupervised part-of-speech induction. In comparison to previous bootstrapping algorithms developed for this problem, our appro...
Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed ...
In this paper we introduce the MeanNN approach for estimation of main information theoretic measures such as differential entropy, mutual information and divergence. As opposed to...
We introduce the possibility of combining lexical association measures and present empirical results of several methods employed in automatic collocation extraction. First, we pre...
This technical report describes the XML data integration framework being built within the AutoMed heterogeneous data integration system. It presents a description of the overall f...