: Tracking and identifying actors across different scenes.
: 92,000 tags for cinematic styles (lighting, camera motion, view scale) and 65,000 tags for action and location. mvs movienet verified
MovieNet is the first comprehensive dataset that integrates multiple modalities—such as video, audio, and text—to help machines understand complex stories. It contains data from , featuring: : Tracking and identifying actors across different scenes