Sciweavers

627 search results - page 62 / 126
» Privacy-Preserving k-NN for Small and Large Data Sets
Sort
View
PR
2008
113views more  PR 2008»
15 years 6 months ago
Do unbalanced data have a negative effect on LDA?
For two-class discrimination, Ref. [1] claimed that, when covariance matrices of the two classes were unequal, a (class) unbalanced dataset had a negative effect on the performanc...
Jing-Hao Xue, D. Mike Titterington
IJCNLP
2005
Springer
15 years 11 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
ACL
2004
15 years 7 months ago
Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm
This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods: the perceptron algorithm, and a...
Brian Roark, Murat Saraclar, Michael Collins, Mark...
CSL
2007
Springer
15 years 6 months ago
Discriminative n-gram language modeling
This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods: the perceptron algorithm, and a...
Brian Roark, Murat Saraclar, Michael Collins
SIGIR
2009
ACM
16 years 20 days ago
Approximating true relevance distribution from a mixture model based on irrelevance data
Pseudo relevance feedback (PRF), which has been widely applied in IR, aims to derive a distribution from the top n pseudo relevant documents D. However, these documents are often ...
Peng Zhang, Yuexian Hou, Dawei Song