SemanTV: A Content-Based Video Retrieval Framework

Juan Miguel A. Mendoza; China Marie G. Lao; Antolin J. Alipio; Dan Michael A. Cortez; Anne Camille M. Maupay; Charito M. Molina; Criselle J. Centeno; Jonathan Morano

doi:10.1145/3533050.3533067

JOURNAL ARTICLE

SemanTV: A Content-Based Video Retrieval Framework

Juan Miguel A. Mendoza China Marie G. Lao Antolin J. Alipio Dan Michael A. Cortez Anne Camille M. Maupay Charito M. Molina Criselle J. Centeno Jonathan Morano

Year: 2022 Pages: 105-110

DOI: 10.1145/3533050.3533067

Get Full-Text PDF Get Analytical Report

Abstract

With the increased adaption of CCTV for surveillance, challenges in terms of retrieval have recently gained attention. Most Surveillance Video Systems can only retrieve footage based on its metadata, (date, time, camera location, etc.) which limits the diversity of meaningful footage intended to be retrieved by the user. To solve this, a content-based video retrieval framework was proposed to retrieve relevant videos based on their content and match it to the user's query. This framework composes of two (2) methods: A method for Video Content Extraction that utilizes Google's Video Intelligence API for Optical Character Recognition and Label Detection, and a method for Video Retrieval. Various setups for the Video Retrieval method are explored; this includes the usage of SBERT and Okapi BM25. Each setup was tested against various text queries with equivalent test video results based on the MSVD dataset. To measure each setup's performance in terms of relevance, Recall and Precision at K and Median and Mean Rank were used. It was concluded that the framework composed of the Video Intelligence API along with SBERT alone performed better than the other proposed setup for returning videos relevant to the user's text query more accurately than the other setups of the method.

Keywords:

Computer science Information retrieval Metadata Precision and recall Video retrieval Relevance (law) Rank (graph theory) Image retrieval Artificial intelligence Image (mathematics) World Wide Web

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.05

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

SemanTV: A Content-Based Video Retrieval Framework

Abstract

Metrics

Topics

Related Documents

Content-Based Video Retrieval

Content-Based Video Retrieval

Content-Based Video Retrieval

Content Based Video Retrieval

Content-based video retrieval