Sciweavers

3530 search results - page 252 / 706
» Technology of Text Mining
Sort
View
SIGIR
2004
ACM
15 years 12 months ago
Constructing a text corpus for inexact duplicate detection
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. The goal of this work i...
Jack G. Conrad, Cindy P. Schriber
CIKM
2004
Springer
15 years 12 months ago
Indexing text data under space constraints
An important class of queries is the LIKE predicate in SQL. In the absence of an index, LIKE queries are subject to performance degradation. The notion of indexing on substrings (...
Bijit Hore, Hakan Hacigümüs, Balakrishna...
COOPIS
2003
IEEE
15 years 11 months ago
Automatic Expansion of Manual Email Classifications Based on Text Analysis
The organization of documents is a task that we face as computer users daily. This is particularly true for management of email. Typically email documents are organized in director...
Enrico Giacoletto, Karl Aberer
HICSS
2003
IEEE
164views Biometrics» more  HICSS 2003»
15 years 11 months ago
On a Text-Processing Approach to Facilitating Autonomous Deception Detection
Abstract—Current techniques towards information security have limited capabilities to detect and counter attacks that involve different kinds of masquerade and spread of misinfor...
Therani Madhusudan
SIGIR
2003
ACM
15 years 11 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann