Search Sciweavers | Sciweavers

735 search results - page 122 / 147

» Corpora and data preparation

167

click to vote

ACHI
2008
IEEE

83views Human Computer Interaction» more ACHI 2008»

Specification for User Modeling with Self-Observing Systems

15 years 8 months ago

Download alexandria.tue.nl

The complicated user interfaces and complex functionality of nowadays interactive products lead to a new class of failures: People do not understand their products and thus fail t...

Mathias Funk, Piet van der Putten, Henk Corporaal

claim paper

Read More »

265

click to vote

CICLING
2008
Springer

137views Natural Language Processing» more CICLING 2008»

A Semantics-Enhanced Language Model for Unsupervised Word Sense Disambiguation

15 years 8 months ago

Download www.csie.ntu.edu.tw

An N-gram language model aims at capturing statistical word order dependency information from corpora. Although the concept of language models has been applied extensively to handl...

Shou-de Lin, Karin Verspoor

claim paper

Read More »

172

click to vote

ACL
2008

126views Computational Linguistics» more ACL 2008»

Mining Wiki Resources for Multilingual Named Entity Recognition

15 years 7 months ago

Download aclweb.org

In this paper, we describe a system by which the multilingual characteristics of Wikipedia can be utilized to annotate a large corpus of text with Named Entity Recognition (NER) t...

Alexander E. Richman, Patrick Schone

claim paper

Read More »

157

click to vote

ACL
2007

130views Computational Linguistics» more ACL 2007»

Randomised Language Modelling for Statistical Machine Translation

15 years 7 months ago

Download aclweb.org

A Bloom ﬁlter (BF) is a randomised data structure for set membership queries. Its space requirements are signiﬁcantly below lossless information-theoretic lower bounds but it ...

David Talbot, Miles Osborne

claim paper

Read More »

136

click to vote

CASCON
2007

112views Education» more CASCON 2007»

Removing manually generated boilerplate from electronic texts: experiments with project Gutenberg e-books

15 years 7 months ago

Download www.archipel.uqam.ca

Collaborative work on unstructured or semistructured documents, such as in literature corpora or source code, often involves agreed upon templates containing metadata. These templ...

Owen Kaser, Daniel Lemire

claim paper

Read More »

« Prev « First page 122 / 147 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers