New Multilingual Video Description Dataset VATEX Receives Three Strong Accepts at ICCV
This paper presents a new large-scale multilingual video description dataset, covering over 41,250 videos and 825,000 captions in both Chinese and English.
AI Technology & Industry Review