Sciweavers

2190 search results - page 300 / 438
» Unweaving a web of documents
Sort
View
ELPUB
2007
ACM
15 years 10 months ago
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878-1879)
The paper presents in brief a project aimed at the development of a methodology and corresponding software tools intended for building of proper environments giving up means for s...
Maria Nisheva-Pavlova, Pavel Pavlov, Nikolay Marko...
LREC
2010
164views Education» more  LREC 2010»
15 years 8 months ago
Enhanced Infrastructure for Creation and Collection of Translation Resources
Statistical Machine Translation (MT) systems have achieved impressive results in recent years, due in large part to the increasing availability of parallel text for system trainin...
Zhiyi Song, Stephanie Strassel, Gary Krug, Kazuaki...
EMNLP
2008
15 years 8 months ago
Soft-Supervised Learning for Text Classification
We propose a new graph-based semisupervised learning (SSL) algorithm and demonstrate its application to document categorization. Each document is represented by a vertex within a ...
Amarnag Subramanya, Jeff Bilmes
LREC
2008
117views Education» more  LREC 2008»
15 years 8 months ago
A Suite to Compile and Analyze an LSP Corpus
This paper presents a series of tools for the extraction of specialized corpora from the web and its subsequent analysis mainly with statistical techniques. It is an integrated sy...
Rogelio Nazar, Jorge Vivaldi, Teresa Cabré
SEBD
2008
177views Database» more  SEBD 2008»
15 years 8 months ago
Using PageRank in Feature Selection
Abstract. Feature selection is an important task in data mining because it allows to reduce the data dimensionality and eliminates the noisy variables. Traditionally, feature selec...
Dino Ienco, Rosa Meo, Marco Botta