Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors

Abhikesh Nag; David J. Miller; Andrew P. Brown; Kevin Sullivan

doi:10.1109/mlsp.2007.4414333

ScienceGate Book Chapters

JOURNAL ARTICLE

Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors

Abhikesh Nag David J. Miller Andrew P. Brown Kevin Sullivan

Year: 2007 Journal: Machine learning for signal processing ... Vol: b 39 Pages: 360-365 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/mlsp.2007.4414333

Get Full-Text PDF Get Analytical Report

Abstract

We present a system for scale and affine invariant recognition of vehicular objects in video sequences. We use local descriptors (SIFT keypoints) from image frames to model the object. These features are claimed in the literature to be highly distinctive and invariant to rotation, scale, and affine transformations. However, since the SIFT keypoints that are extracted from an object are instance-specific (variable), they form a dynamic feature space. This presents certain challenges for classification techniques, which generally require use of the same set of features for every instance of an object to be classified. To resolve this difficulty, we associate the extracted keypoints to the components (representative keypoints) in a mixture model for each target class. While the extracted keypoints are variable, the mixture components are fixed. The mixture models the keypoint features, as well as the location and scale at which each keypoint was detected in the frame. Key-point to component association is achieved via a switching opti-mization procedure that locally maximizes the joint likelihood of keypoints and their locations and scales with the latter based on an affine transformation. To each mixture component from a class, we link a (first layer) support vector machine (SVM) classifier which votes for or against the hypothesis that the keypoint associated to the component belongs to the model's target class. A second layer SVM pools the votes from the ensemble of SVM classifiers in the first layer and gives the final class decision.

Keywords:

Artificial intelligence Discriminative model Pattern recognition (psychology) Scale-invariant feature transform Affine transformation Classifier (UML) Computer science Cognitive neuroscience of visual object recognition Support vector machine Affine hull Generative model Feature extraction Computer vision Feature vector Mathematics Generative grammar Affine space

Metrics

Cited By

0.60

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors

Abstract

Metrics

Citation History

Topics

Related Documents

Discriminative Learning of Local Image Descriptors

Multiple kernel learning with ICA: Local discriminative image descriptors for recognition

Object Recognition using image descriptors

Efficient discriminative local learning for object recognition

Object Recognition Using Local Descriptors: A Comparison