ByteDance High-Resolution AMT System Achieves SOTA in Piano Note and Pedal Transcription
ByteDance introduces a high-resolution piano transcription system trained by regressing the precise onset and offset times of piano notes and pedals.
AI Technology & Industry Review
ByteDance introduces a high-resolution piano transcription system trained by regressing the precise onset and offset times of piano notes and pedals.
AMBERT (A Multigrained BERT) leverages both fine-grained and coarse-grained tokenizations to achieve SOTA performance on English and Chinese language tasks.
HoliCity, a city-scale dataset and all-in-one data platform for research into learning abstracted high-level holistic 3D structures derived from city CAD (computer-aided design) models.
Researchers from ByteDance AILab and Shanghai Jiao Tong University have introduced Xiaomingbot, a multilingual and multimodal news reporter.
This paper presents a new large-scale multilingual video description dataset, covering over 41,250 videos and 825,000 captions in both Chinese and English.
Although machine learning has achieved huge advances in speech recognition, gaming and many other applications, some critics still regard it as little more than glorified “curve fitting” that lacks high-level cognitive abilities and reasoning skills.
At a press conference in Beijing today US chipmaker Intel and Chinese tech pioneer ByteDance announced they will collaborate on setting up an AI research lab, talent training, and development of AI applications.