论文标题

海洋视频套件:一种新的海洋视频数据集,用于基于内容的分析和检索

Marine Video Kit: A New Marine Video Dataset for Content-based Analysis and Retrieval

论文作者

Truong, Quang-Trung, Vu, Tuan-Anh, Ha, Tan-Sang, Jakub, Lokoc, Tim, Yue Him Wong, Joneja, Ajay, Yeung, Sai-Kit

论文摘要

对异常域特定视频收集的有效分析代表了一个重要的实际问题,最新的通用模型仍面临局限性。因此,希望设计基准数据集,以挑战具有其他约束的特定领域的新型强大模型。重要的是要记住,特定域数据可能更嘈杂(例如,内窥镜或水下视频),并且通常需要更多经验丰富的用户才能有效搜索。在本文中,我们专注于从水下环境中移动相机拍摄的单次视频,这构成了研究目的的非平凡挑战。提出了一个新的海洋视频套件数据集的第一个碎片,用于用于视频检索和其他计算机视觉挑战。我们的数据集在视频浏览器摊牌2023期间的特殊会话中使用。除了基本的元数据统计数据外,我们还基于低级功能以及所选键框的语义注释提供了一些见解。该分析还包含显示尊敬的通用模型的局限性的实验。我们的数据集和代码可在https://hkust-vgd.github.io/marinevideokit上公开获取。

Effective analysis of unusual domain specific video collections represents an important practical problem, where state-of-the-art general purpose models still face limitations. Hence, it is desirable to design benchmark datasets that challenge novel powerful models for specific domains with additional constraints. It is important to remember that domain specific data may be noisier (e.g., endoscopic or underwater videos) and often require more experienced users for effective search. In this paper, we focus on single-shot videos taken from moving cameras in underwater environments, which constitute a nontrivial challenge for research purposes. The first shard of a new Marine Video Kit dataset is presented to serve for video retrieval and other computer vision challenges. Our dataset is used in a special session during Video Browser Showdown 2023. In addition to basic meta-data statistics, we present several insights based on low-level features as well as semantic annotations of selected keyframes. The analysis also contains experiments showing limitations of respected general purpose models for retrieval. Our dataset and code are publicly available at https://hkust-vgd.github.io/marinevideokit.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源