Sciweavers

1140 search results - page 84 / 228
» A Novel Ant-Based Clustering Approach for Document Clusterin...
Sort
View
BMCBI
2008
130views more  BMCBI 2008»
15 years 6 months ago
A novel series of compositionally biased substitution matrices for comparing Plasmodium proteins
Background: The most common substitution matrices currently used (BLOSUM and PAM) are based on protein sequences with average amino acid distributions, thus they do not represent ...
Kevin Brick, Elisabetta Pizzi
BMCBI
2010
161views more  BMCBI 2010»
15 years 3 months ago
LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes
Background: Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that...
Zeev Frenkel, Etienne Paux, David I. Mester, Cathe...
JCST
2008
121views more  JCST 2008»
15 years 6 months ago
Clustering Text Data Streams
Abstract Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organiza...
Yubao Liu, Jiarong Cai, Jian Yin, Ada Wai-Chee Fu
ICDE
2007
IEEE
170views Database» more  ICDE 2007»
16 years 7 months ago
Tree-Pattern Similarity Estimation for Scalable Content-based Routing
With the advent of XML as the de facto language for data publishing and exchange, scalable distribution of XML data to large, dynamic populations of consumers remains an important...
Raphaël Chand, Pascal Felber, Minos N. Garofa...
WWW
2010
ACM
16 years 1 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han