We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
With emerging decentralized technologies, peer-to-peer (P2P) content distribution arises as a new model for storage and transmission of data. In this scenario, one peer can be play...
We propose the implementation of a distributed system for DL storage and retrieval which relies on two earlier developed technologies: buckets and document linking based on user r...
Abstract. MKM has been defined as the quest for technologies to manage mathematical knowledge. MKM “in the small” is well-studied, so the real problem is to scale up to large,...
Michael Kohlhase, Florian Rabe, Vyacheslav Zholude...
There has been a lot of work on evaluating and improving the relevance of web search engines. In this paper, we suggest using human computation games to elicit data from players t...
Hao Ma, Raman Chandrasekar, Chris Quirk, Abhishek ...