Sciweavers

3341 search results - page 273 / 669
» Managing a Large
Sort
View
ISPASS
2010
IEEE
16 years 1 months ago
The Hadoop distributed filesystem: Balancing portability and performance
—Hadoop is a popular open-source implementation of MapReduce for the analysis of large datasets. To manage storage resources across the cluster, Hadoop uses a distributed user-le...
Jeffrey Shafer, Scott Rixner, Alan L. Cox
CSCW
2006
ACM
16 years 19 days ago
What goes around comes around: an analysis of del.icio.us as social space
An emergent class of web applications blurs the boundary between single user application and online public space. Recently popular web applications like del.icio.us help manage in...
Kathy J. Lee
ICAIL
2007
ACM
15 years 10 months ago
Essential deduplication functions for transactional databases in law firms
As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...
Jack G. Conrad, Edward L. Raymond
KES
2007
Springer
16 years 24 days ago
Analysis of the Relation Between Stock Price Returns and Headline News Using Text Categorization
Abstract. In this paper, we analyze about the relation between stock price returns and Headline News. Headline News is very important sources of information in asset management, an...
Satoru Takahashi, Masakazu Takahashi, Hiroshi Taka...
INTERNET
2007
105views more  INTERNET 2007»
15 years 6 months ago
Workflow Planning on a Grid
evel of abstraction, we can represent a workflow as a directed graph with operators (or tasks) at the vertices (see Figure 1). Each operator takes inputs from data sources or from ...
Craig W. Thompson, Wing Ning Li, Zhichun Xiao