We describe a class of translation model in which a set of input variants encoded as a context-free forest is translated using a finitestate translation model. The forest structur...
In a corpus of expert tutoring dialogue, conversation that is considered to be "off topic" (non-pedagogical) according to a previous coding scheme is explored for its va...
Most work on language acquisition treats word segmentation--the identification of linguistic segments from continuous speech-and word learning--the mapping of those segments to me...
In this paper we examine different linguistic features for sentimental polarity classification, and perform a comparative study on this task between blog and review data. We found...
We are interested in diacritizing Semitic languages, especially Syriac, using only diacritized texts. Previous methods have required the use of tools such as part-of-speech tagger...