If we consider most applications accessible through the Web, we can notice a lack of support able to adapt to the different information needs that different users may have regardi...
We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use...
Intelligence analysts construct hypotheses from large volumes of data, but are often limited by social and organizational norms and their own preconceptions and biases. The use of...
Understanding query ambiguity in web search remains an important open problem. In this paper we reexamine query ambiguity by analyzing the result clickthrough data. Previously pro...
This paper gives an account of the practical experiences made in generating special statistical information of web server logs. It emphasizes the problem of combining different dat...
Ernst Georg Haffner, Uwe Roth, Andreas Heuer 0002,...