Sciweavers

4772 search results - page 428 / 955
» Annotations in Data Streams
Sort
View
ICPR
2010
IEEE
15 years 4 months ago
Text Separation from Mixed Documents Using a Tree-Structured Classifier
In this paper, we propose a tree-structured multiclass classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured cla...
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...
ACL
2011
14 years 10 months ago
Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction
Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not en...
Shasha Liao, Ralph Grishman
SDM
2011
SIAM
307views Data Mining» more  SDM 2011»
14 years 9 months ago
Block-LDA: Jointly modeling entity-annotated text and entity-entity links
We present a model that improves entity entity link modeling in a mixed membership stochastic block model, by jointly modeling links with text about the entities that are linked i...
Ramnath Balasubramanyan, William W. Cohen
SIGMOD
2004
ACM
92views Database» more  SIGMOD 2004»
16 years 7 months ago
Online Maintenance of Very Large Random Samples
Random sampling is one of the most fundamental data management tools available. However, most current research involving sampling considers the problem of how to use a sample, and...
Chris Jermaine, Abhijit Pol, Subramanian Arumugam
PVLDB
2010
106views more  PVLDB 2010»
15 years 5 months ago
Just-in-time Data Integration in Action
Today’s data integration systems must be flexible enough to support the typical iterative and incremental process of integration, and may need to scale to hundreds of data sour...
Martin Hentschel, Laura M. Haas, Renée J. M...