Jonathan KrauseHailin JinShuicheng YanLi Fei-Fei
Scaling up fine-grained recognition to all domains of fine-grained objects is a challenge the computer vision community will need to face in order to realize its goal of recognizing all object categories. Current state-of-the-art techniques rely heavily upon the use of keypoint or part annotations, but scaling up to hundreds or thousands of domains renders this annotation cost-prohibitive for all but the most important categories. In this work we propose a method for fine-grained recognition that uses no part annotations. Our method is based on generating parts using co-segmentation and alignment, which we combine in a discriminative mixture. Experimental results show its efficacy, demonstrating state-of-the-art results even when compared to methods that use part annotations during training.
Long ChenShengke WangKin‐Man LamHuiyu ZhouMuwei JianJunyu Dong
C B LiXiaokang LiuQi JiaJinyuan LiuZhiying JiangLiu FengYu LiuZhongxuan LuoXin Fan
C. E. GoeringErik RodnerAlexander FreytagJoachim Denzler
Jia DengJonathan KrauseLi Fei-Fei