Automatic image captioning has recently made significant strides by leveraging semantic concepts inferred from the image. Integration of deep learning with computer vision revealed wide applications and image captioning as being significant among them. This lays its foot in virtual assistants and is a boom to the visually challenged. It is a digital twin with computer vision (CV) and natural language processing (NLP). Image captioning combines the problem of image object recognition and semantic labeling . This chapter gives insight into various deep leaning (DL) models employed in object detection and also image captioning models.
Arunkumar GopuPratyush NishchalVishesh MittalKuna Srinidhi
Siddharth SrivastavaYash ChaudhariYash DamaniaParul Jadhav
Junaid Ahmad WaniSahilpreet Singh
Rizwan SayyedAkash SatputeTushar VarkhedePrasad ZorePriya Surana
Rupendra Kumar KaushikSushil Kumar Sharma