Sciweavers

242 search results - page 20 / 49
» Japanese Named Entity Extraction Evaluation - Analysis of Re...
Sort
View
APWEB
2010
Springer
15 years 4 months ago
ECON: An Approach to Extract Content from Web News Page
Abstract--This paper provides a simple but effective approach, named ECON, to fully-automatically extract content from Web news page. ECON uses a DOM tree to represent the Web news...
Yan Guo, Huifeng Tang, Linhai Song, Yu Wang 0009, ...
DEXA
2009
Springer
173views Database» more  DEXA 2009»
16 years 20 days ago
Incremental Ontology-Based Extraction and Alignment in Semi-structured Documents
SHIRI 1 is an ontology-based system for integration of semistructured documents related to a specific domain. The system’s purpose is to allow users to access to relevant parts ...
Mouhamadou Thiam, Nacéra Bennacer, Nathalie...
ICASSP
2009
IEEE
16 years 25 days ago
Automatic named identification of speakers using diarization and ASR systems
In this paper, we consider the extraction of speaker identity from audio records of broadcast news without a priori acoustic information about speakers. Using an automatic speech ...
Vincent Jousse, Simon Petit-Renaud, Sylvain Meigni...
RIAO
2007
15 years 7 months ago
Extracting Useful Information from the Full Text of Fiction
In this paper, we describe some experiments in large-scale Information Extraction (IE) focusing on book texts. We investigate the scalability of IE techniques to full-sized books,...
Sharon Givon, Maria Milosavljevic
DGO
2007
192views Education» more  DGO 2007»
15 years 7 months ago
D-HOTM: distributed higher order text mining
We present D-HOTM, a framework for Distributed Higher Order Text Mining based on named entities extracted from textual data that are stored in distributed relational databases. Unl...
William M. Pottenger