This paper proposes a license plate detection algorithm using both global statistical features and local Haar-like features. Classifiers using global statistical features are cons...
In many domains, a Bayesian network's topological structure is not known a priori and must be inferred from data. This requires a scoring function to measure how well a propo...
Clustering algorithms typically operate on a feature vector representation of the data and find clusters that are compact with respect to an assumed (dis)similarity measure betwee...
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
This paper proposes a technique for identifying program properties that indicate errors. The technique generates machine learning models of program properties known to result from...