Sciweavers

945 search results - page 99 / 189
» On the robustness of primitive words
Sort
View
CIKM
2011
Springer
14 years 6 months ago
Simultaneous joint and conditional modeling of documents tagged from two perspectives
This paper explores correspondence and mixture topic modeling of documents tagged from two different perspectives. There has been ongoing work in topic modeling of documents with...
Pradipto Das, Rohini K. Srihari, Yun Fu
IRI
2006
IEEE
16 years 10 days ago
Integration of low level linguistic information for clinical document semantic tagging
We propose a semantic tagger that provides high level concept information for phrases based on several kinds of low level information about words in clinical narrative texts. The ...
Hyeju Jang, Yun Jin, Sung-Hyon Myaeng
CEAS
2005
Springer
15 years 12 months ago
Spam Deobfuscation using a Hidden Markov Model
To circumvent spam filters, many spammers attempt to obfuscate their emails by deliberately misspelling words or introducing other errors into the text. For example viagra may be...
Honglak Lee, Andrew Y. Ng
LREC
2010
155views Education» more  LREC 2010»
15 years 7 months ago
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese
In this paper we bring to light a novel intersection between corpus linguistics and behavioral data that can be employed as an evaluation metric for resources for low-density lang...
Jerid Francom, Amy LaCross, Adam Ussishkin
ACL
2003
15 years 7 months ago
Parametric Models of Linguistic Count Data
It is well known that occurrence counts of words in documents are often modeled poorly by standard distributions like the binomial or Poisson. Observed counts vary more than simpl...
Martin Jansche