Sciweavers

2827 search results - page 272 / 566
» Marking Text Documents
Sort
View
LREC
2008
106views Education» more  LREC 2008»
15 years 8 months ago
Producing an Encyclopedic Dictionary using Patent Documents
Although the World Wide Web has of late become an important source to consult for the meaning of words, a number of technical terms related to high technology are not found on the...
Atsushi Fujii
INTERSPEECH
2010
15 years 1 months ago
Using dependency parsing and machine learning for factoid question answering on spoken documents
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two app...
Pere Comas, Jordi Turmo, Lluís Màrqu...
ICML
2004
IEEE
16 years 7 months ago
Text categorization with many redundant features: using aggressive feature selection to make SVMs competitive with C4.5
Text categorization algorithms usually represent documents as bags of words and consequently have to deal with huge numbers of features. Most previous studies found that the major...
Evgeniy Gabrilovich, Shaul Markovitch
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
16 years 4 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
SBRN
2008
IEEE
16 years 29 days ago
Multi-label Text Categorization Using VG-RAM Weightless Neural Networks
In automated multi-label text categorization, an automatic categorization system should output a category set, whose size is unknown a priori, for each document under analysis. Ma...
Claudine Badue, Felipe Pedroni, Alberto Ferreira d...