Eye semantic segmentation is a fundamental task in many works such as identification and medical applications. In this study, three encoder-decoder architectures using convolutional neural network are applied to segment the eyes. A simple encoder-decoder architecture is capable of generating only coarse segmentation results. On the other hand, fine details like eyelashes can be achieved by U-net and SegNet architectures. However, they sometimes produce overall results worse than the simple one. To resolve this problem, we introduce a deep convolutional neural network-based ensemble technique for eye segmentation. The results from those architectures are combined in order to yield good results in both coarse-level and fine-level segmentation. In the proposed technique, a trainable mask function is applied to achieve an optimal ensemble of coarse-level and fine-level results. Our dataset comprises 64 eye images from different environments, camera settings, people, and eye conditions. Experimental results show that our ensemble technique can improve the results from the conventional architectures. The proposed ensemble method manages to reach the average accuracy of 96.33% for three-class segmentation.
Johan VertensAbhinav ValadaWolfram Burgard
V BhavadharshiniS MridulaB SakthipriyaJeffin Gracewell
Wenbin YangQuan ZhouYawen FanGuangwei GaoSongsong WuWeihua OuHuimin LuJie ChengLongin Jan Latecki
Ching‐Sheng ChangJin‐Fa LinMing-Ching LeeChristoph Palm
Haozhe JiaYong XiaWeidong CaiMichael FulhamDagan Feng