The growth of the web has directly influenced the increase in the availability of relational data. One of the key problems in mining such data is computing the similarity between o...
Pradeep Muthukrishnan, Dragomir R. Radev, Qiaozhu ...
Tweets are the most up-to-date and inclusive stream of information and commentary on current events, but they are also fragmented and noisy, motivating the need for systems that c...
With the rapid expansion of the Internet, the implementation of agent technology in electronic commerce (e-commerce) becomes very popular, which provides a promising field for the...
Hypertext is being used more and more for on-line course texts. But the navigational freedom offered by a rich link structure is a burdon for students who need guidance throughout...
Taxonomies of the Web typically have hundreds of thousands of categories and skewed category distribution over documents. It is not clear whether existing text classification tech...
Tie-Yan Liu, Yiming Yang, Hao Wan, Qian Zhou, Bin ...