Sciweavers

5107 search results - page 317 / 1022
» Data Mining and Information Retrieval
Sort
View
SDM
2007
SIAM
204views Data Mining» more  SDM 2007»
15 years 8 months ago
Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach
k-anonymity is a popular measure of privacy for data publishing: It measures the risk of identity-disclosure of individuals whose personal information are released in the form of ...
Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Meh...
SIGIR
2002
ACM
15 years 6 months ago
Effective collection metasearch in a hierarchical environment: global vs. localized retrieval performance
We compare standard global IR searching with user-centric localized techniques to address the database selection problem. We conduct a series of experiments to compare the retriev...
Jack G. Conrad, Changwen Yang, Joanne S. Claussen
TREC
2007
15 years 7 months ago
On Retrieving Legal Files: Shortening Documents and Weeding Out Garbage
This paper describes our participation in the TREC Legal experiments in 2007. We have applied novel normalization techniques that are designed to slightly favor longer documents i...
Scott Kulp, April Kontostathis
ICSM
2009
IEEE
15 years 4 months ago
An empirical study on the risks of using off-the-shelf techniques for processing mailing list data
Mailing list repositories contain valuable information about the history of a project. Research is starting to mine this information to support developers and maintainers of longl...
Nicolas Bettenburg, Emad Shihab, Ahmed E. Hassan
WWW
2010
ACM
15 years 6 months ago
Cross-domain sentiment classification via spectral feature alignment
Sentiment classification aims to automatically predict sentiment polarity (e.g., positive or negative) of users publishing sentiment data (e.g., reviews, blogs). Although traditio...
Sinno Jialin Pan, Xiaochuan Ni, Jian-Tao Sun, Qian...