We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Data dissemination in decentralized networks is often realized by using some form of swarming technique. Swarming enables nodes to gather dynamically in order to fulfill a certai...
Thomas Locher, Remo Meier, Roger Wattenhofer, Stef...
Abstract With the growing importance of XML in data exchange, much research tends to provide a compact labeling scheme and a flexible query facility to extract data from dynamic XM...
Mapping specification has been recognised as a critical bottleneck to the large scale deployment of data integration systems. A mapping is a description using which data structured...
Lu Mao, Khalid Belhajjame, Norman W. Paton, Alvaro...
XML is becoming the standard data exchange format. View transformation of XML data is important and frequent operation in XML data integration and publishing. In schema-based view ...
Daofeng Luo, Ting Chen, Tok Wang Ling, Xiaofeng Me...