Site icon Synced

Best NLP Model Ever? Google BERT Sets New Standards in 11 Language Tasks

The new Google AI paper BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding is receiving accolades from across the machine learning community. Google researchers present a deep bidirectional Transformer model that redefines the state of the art for 11 natural language processing tasks, even surpassing human performance in the challenging area of question answering. Some highlights from the paper:

The paper’s first author is Jacob Devlin, a Google senior research scientist with a primary research interest in developing deep learning models for natural language tasks. He previously led Microsoft Translate’s transition from phrase-based translation to neural machine translation (NMT) as a Principle Research Scientist at Microsoft Research from 2014 to 2017.

Google Brain Research Scientist Thang Luong enthusiastically tweeted “a new era of NLP has just begun a few days ago: large pre-training models (Transformer 24 layers, 1024 dim, 16 heads) + massive compute is all you need.”

Baoxun Wang, Chief Scientist of Chinese AI startup Tricorn, also praised the Google paper as “a milestone” in his keynote address at this week’s Artificial Intelligence Industry Alliance conference in Suzhou, China. The paper leverages massive amounts of data and compute and well-polished engineering works, representing what Wang calls “Google’s tradition of violent aesthetics.”

The pre-trained model and code will be released in the next two weeks. The paper is on arXiv.


Journalist: Tony Peng | Editor: Michael Sarazen

Exit mobile version