This paper discusses our system’s results at the Spanish Question Answering task of CLEF 2007. Our system is centered in a full data-driven approach that combines information ret...
The sheer volume and complexity of government regulations make any attempt to locate, understand and interpret the information a daunting task. Other factors, such as the scattere...
User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error rob...
Current methods for finding relevant content, especially in media-rich web environments, suggest that metadata is critical for accurate and efficient information retrieval. We des...
End-user interactive concept learning is a technique for interacting with large unstructured datasets, requiring insights from both human-computer interaction and machine learning...
Saleema Amershi, James Fogarty, Ashish Kapoor, Des...