Sciweavers

3705 search results - page 267 / 741
» Building Documentation Generators
Sort
View
BIS
2006
106views Business» more  BIS 2006»
15 years 8 months ago
Expected Utility of Content Blocks in Web Content Extraction
In this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. Aft...
Marek Kowalkiewicz
NIPS
2001
15 years 8 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
COLING
2002
15 years 6 months ago
Extracting Important Sentences with Support Vector Machines
Extracting sentences that contain important information from a document is a form of text summarization. The technique is the key to the automatic generation of summaries similar ...
Tsutomu Hirao, Hideki Isozaki, Eisaku Maeda, Yuji ...
ER
2003
Springer
119views Database» more  ER 2003»
15 years 12 months ago
Toward the Automatic Derivation of XML Transformations
Existing solutions to data and schema integration require user interaction/input to generate a data transformation between two different schemas. These approaches are not appropri...
Martin Erwig
ICDM
2007
IEEE
140views Data Mining» more  ICDM 2007»
16 years 1 months ago
Finding Cohesive Clusters for Analyzing Knowledge Communities
Documents and authors can be clustered into “knowledge communities” based on the overlap in the papers they cite. We introduce a new clustering algorithm, Streemer, which fin...
Vasileios Kandylas, S. Phineas Upham, Lyle H. Unga...