While Inversion Transduction Grammar (ITG) has regained more and more attention in recent years, it still suffers from the major obstacle of speed. We propose a discriminative ITG...
Identifying background (context) information in scientific articles can help scholars understand major contributions in their research area more easily. In this paper, we propose ...
The pipeline of most Phrase-Based Statistical Machine Translation (PB-SMT) systems starts from automatically word aligned parallel corpus. But word appears to be too fine-grained ...
The Manually Annotated Sub-Corpus (MASC) project provides data and annotations to serve as the base for a communitywide annotation effort of a subset of the American National Corp...
Nancy Ide, Collin F. Baker, Christiane Fellbaum, R...
This paper presents a novel filtration criteria to restrict the rule extraction for the hierarchical phrase-based translation model, where a bilingual but relaxed well-formed depe...