- Crawling web pages written in Arabic or any other language with limited content in the web may, at first, seem to parallel the process of crawling the English content. However, t...
An empirical study has been conducted investigating the relationship between the performance of a generative language model in terms of perplexity and the corresponding informatio...
Leif Azzopardi, Mark Girolami, Keith van Rijsberge...
We present a method which, given a few words defining a concept in some language, retrieves, disambiguates and extends corresponding terms that define a similar concept in another...
Performance of n-gram language models depends to a large extent on the amount of training text material available for building the models and the degree to which this text matches...
We measure the effects of a weak language model, estimated from as little as 100k words of text, on unsupervised acoustic model training and then explore the best method of using ...