‘Train Large, Then Compress’ – UC Berkeley BAIR Improves Large Transformer Model Training and Inference

Researchers from the Berkeley Artificial Intelligence Research (BAIR) Lab at UC Berkeley explored the effect of Transformer model size on training and inference efficiency.

by Synced

2020-03-09

Comments 46

In the current state of deep learning, methods that can be used to improve model accuracy basically come down to increasing model size, dataset size, or number of training steps. These methods however require large and very expensive compute resources. Optimizing computing efficiency has become a key goal for researchers when computing resources are limited. How to achieve higher accuracy with limited hardware support and training time?

To address this issue, researchers from the Berkeley Artificial Intelligence Research (BAIR) Lab at UC Berkeley explored the effect of Transformer model size on training and inference efficiency. Their new paper shows that with limited resources, training and inference efficiency can be improved by significantly increasing the size of the Transformer models and heavily compressing them.

Under the usual presumption that models are trained to convergence, only small models that are fast-to-execute are feasible in resource-constrained settings. The work shows that the most compute-efficient training scheme is instead to train very large models, stop them well short of convergence, and then heavily compress them to meet test-time constraints.

The researchers conducted several experiments and found that in a given time, the deeper RoBERTa model (RoBERTa is an optimized BERT pretraining approach) with more layers had lower perplexity than the model with fewer layers. The wider RoBERTa model also had lower perplexity.

Researchers also evaluated the validation BLEU score of models in different sizes when training an English-French transformer machine translation model. BLEU score is an automatic evaluation metric for machine translation (the higher, the better). In the same training time, deeper and wider models outperformed the smaller models. Researchers also found that increasing model width or depth resulted in faster training for RoBERTa pretraining, and that the wider model works better in machine translation tasks.

Although training a larger model can deliver higher efficiency, this also raises the computation and memory cost of inference, and the total cost of inference is much higher than the training cost in most practical applications. The “Train Large, Then Compress” approach can solve this problem. Researchers used compression techniques such as quantization and pruning, both of which can reduce inference latency and memory requirements.

In the case of RoBERTa, the researchers first pretrained different size RoBERTa models with the same given time, then fine-tuned these models on a downstream text classification task and applied pruning or quantization methods for compression. It was found that in a given test time, increasing model size and then applying heavy compression worked best.

Researchers conducted a preliminary investigation of their findings limited to the field of natural language processing, and say their conclusions could be further explored in the other fields in the future.

The paper Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers is on arXiv.

Author: Herin Zhao | Editor: Michael Sarazen

46 comments on “‘Train Large, Then Compress’ – UC Berkeley BAIR Improves Large Transformer Model Training and Inference”

Ziane

2020-03-11

Thanks for you.

Loading...

Reply
Zoulikha

2020-03-11

Nice topic.

Loading...

Reply
Ziane.ziane

2020-03-11

Thanks for you.

Loading...

Reply
Ziane

2020-03-14

Best wishesI enjoyed reading the topic and thank you for sharing it with us, Best Regards

Loading...

Reply
zianezo

2020-07-12

Thank you very much

Loading...

Reply
zianezo

2020-07-12

This topic good

Loading...

Reply
zianezo

2020-07-12

This article good

Loading...

Reply
zianezo

2020-07-12

I like to thanks

Loading...

Reply
zianezo

2020-07-12

Good like

Loading...

Reply
zianezo

2020-07-12

I love the article

Loading...

Reply
zianezo

2020-07-12

I love this topic

Loading...

Reply
zianezo

2020-07-12

Yes ,i will to

Loading...

Reply
zianezo

2020-07-12

Goood and nice

Loading...

Reply
zianezo

2020-07-12

Good

Loading...

Reply
zianezo

2020-07-12

Wi can you and Thanks

Loading...

Reply
zianezo

2020-07-12

I uesed to

Loading...

Reply
ziane

2020-10-30

Thank you very good……….

Loading...

Reply
ziane

2020-10-30

Good good good

Loading...

Reply
ziane

2020-11-02

Thank you very much

Loading...

Reply
ziane

2020-11-02

Good good good good topic

Loading...

Reply
ziane

2020-11-02

Thank you this topic good

Loading...

Reply
ziane

2020-11-02

Very nice ….

Loading...

Reply
ziane

2020-11-02

Mérci pour article

Loading...

Reply
ziane

2020-11-02

Very very very very niiiiice

Loading...

Reply
ziane

2020-11-15

Good article and thank you

Loading...

Reply
ziane

2020-11-18

Good good article…

Loading...

Reply
ziane

2020-11-18

Very very niiiiiice

Loading...

Reply
ziane

2020-11-18

Good
Good good

Loading...

Reply
ziane

2020-11-18

Very very nice..

Loading...

Reply
ziane

2020-11-18

Very very niiiiiiiiiiiiiiiiiiice

Loading...

Reply
ziane

2020-11-18

Bien bien bien article

Loading...

Reply
ziane

2020-11-18

Mèrci pour article

Loading...

Reply
ziane

2020-11-18

Very very very nice

Loading...

Reply
ziane

2020-11-18

Mèrci mèrci..

Loading...

Reply
ziane

2020-11-18

Topic is good.

Loading...

Reply
ziane

2020-11-18

Good good very good

Loading...

Reply
ziane

2020-11-18

Bien bien article

Loading...

Reply
ziane

2020-11-18

Good good topic.

Loading...

Reply
ziane

2020-11-18

Veeeery niiiiiiiiiiiice

Loading...

Reply
ziane

2020-11-18

Good
Thank you very much.

Loading...

Reply
ziane

2020-11-18

Very very niiiiiiice
And good
Article

Loading...

Reply
ziane

2020-11-18

Nice nice and gooooooooood

Loading...

Reply
ziane

2020-11-18

Bien article mèrci

Loading...

Reply
ziane

2020-11-18

Good article very nice
…………

Loading...

Reply
ziane

2020-11-18

Bien bien article good very nice…………….

Loading...

Reply
ziane

2020-11-18

Niiiiiice niiiiiice good

Loading...

Reply

‘Train Large, Then Compress’ – UC Berkeley BAIR Improves Large Transformer Model Training and Inference

Like this:

46 comments on “‘Train Large, Then Compress’ – UC Berkeley BAIR Improves Large Transformer Model Training and Inference”

Leave a Reply Cancel reply

Related

Share this:

Like this:

46 comments on “‘Train Large, Then Compress’ – UC Berkeley BAIR Improves Large Transformer Model Training and Inference”

Leave a Reply Cancel reply

Related