Every day the global media system produces an abundance of news stories, all containing many references to people. An important task is to automatically generate reliable lists of ...
This paper extends previous work on extracting parallel sentence pairs from comparable data (Munteanu and Marcu, 2005). For a given source sentence S, a maximum entropy (ME) class...
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into...
Alkis Simitsis, Panos Vassiliadis, Timos K. Sellis
Abstract. Between-Pathway Models (BPMs) are network motifs consisting of pairs of putative redundant pathways. In this paper, we show how adding another source of high-throughput d...
Benjamin J. Hescott, Mark D. M. Leiserson, Lenore ...
We describe our early experience building and optimizing GOOG-411, a fully automated, voice-enabled, business finder. We show how taking an iterative approach to system developme...