将视频在时序维度(镜头 + 场景)进行理解, 相关公开数据集和benchmark:SoccerNet-v2、 Kinetics-GEBD、MovieNet
ViTT-AACL2020
镜头切分benchmark: ClipShots、TRECVID、SoccerNet-v2
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation [CVPR 2020]
A Unified Framework for Shot Type Classification Based on Subject Centric Lens[ECCV2020]
镜头拍摄风格识别
Deep Relationship Analysis in Video with Multimodal Feature Fusion [ACM MM 2020]
多模态场景理解
Shot Contrastive Self-Supervised Learning for Scene Boundary Detection [CVPR2021]
Amazon
BaSSL: Boundary-aware Self-supervised Learning for Video Scene Segmentation
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection
Scene Consistency Representation Learning for Video Scene Segmentation
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
博主太厉害了!