Sciweavers

3530 search results - page 415 / 706
» Technology of Text Mining
Sort
View
ICWE
2003
Springer
15 years 12 months ago
Genre and Domain Processing in an Information Retrieval Perspective
Abstract. The massive amount of textual data on the Web raises numerous classification problems. Although the notion of domain is widely acknowledged in the IR field, the applica...
Céline Poudat, Guillaume Cleuziou
IDA
2003
Springer
15 years 12 months ago
Very Predictive Ngrams for Space-Limited Probabilistic Models
In sequential prediction tasks, one repeatedly tries to predict the next element in a sequence. A classical way to solve these problems is to fit an order-n Markov model to the da...
Paul R. Cohen, Charles A. Sutton
CLEF
2001
Springer
15 years 11 months ago
Minimalistic Test Runs of the Eidetica Indexer
Participating in a text retrieval conference for the first time, Eidetica has run six minimalistic tests with its t·repository indexer, doing as little tuning as possible, in ord...
Teresita Frizzarin, Annius Groenink
181
Voted
IDA
2001
Springer
15 years 11 months ago
An Algorithm for Segmenting Categorical Time Series into Meaningful Episodes
This paper describes an unsupervised algorithm for segmenting categorical time series. The algorithm first collects statistics about the frequency and boundary entropy of ngrams, t...
Paul R. Cohen, Niall M. Adams
NLPRS
2001
Springer
15 years 11 months ago
Modality Expressions in Japanese and Their Automatic Paraphrasing
It is important for future NLP systems to formulate the semantic equivalence (and more generally, the semantic similarity) of natural language expressions. In particular, paraphra...
Toshifumi Tanabe, Kenji Yoshimura, Kosho Shudo