Abstract: Existing portrait segmentation methods are easily affected by the background. To address this challenge, we propose a simple convolution-based portrait segmentation algorithm to solve the problems of complex background, low accuracy and poor efficiency in real environments. Combined with multi-scale convolution, we design a multi-scale sandglass module structure (MultiScale-Sandglass-CA, MSC), which expands the depth and width of the network without increasing the number of parameters, and enhances the feature learning ability of the model for complex backgrounds; and the MSC module is used for feature extraction, and the introduction of coordinate attention mechanism not only improves the learning ability of the portrait features, but also enhances the network's ability to learn the key information. The introduction of the coordinate attention mechanism not only improves the learning ability of the portrait features, but also enhances the focusing of the network on the key information; at the same time, the introduction of the hybrid loss function, which helps to obtain clear portrait edges. Experiments are conducted on Matting and EG1800 public datasets, and the results show that the algorithm can significantly improve the mean intersection and merger ratio (mIoU) and pixel accuracy (Acc), etc. The mIoU can reach up to 97%, and the pixel accuracy can reach up to 10%. Among them, the mIoU index can reach 97.91% (Matting) and 94.72% (EG1800). Compared with the mIoU predicted by PortraitNet, it is 1.29% and 1.19% higher, respectively. In addition, experiments are conducted for the MSC structure proposed in this paper, which proves the superiority of the model in this paper.
Pengbo ZhaiHao YangTingting SongYu KangMa LongxiangXiangsheng Huang
Jianzhuang LinWenzhong YangSixiang Tan
Hanlin MoZhikai YangPengcheng LiQibiao Wang