Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’

In the new paper BloombergGPT: A Large Language Model for Finance, a research team from Bloomberg and Johns Hopkins University presents BloombergGPT, a 50 billion parameter language model trained on a 700 billion token dataset that significantly outperforms current benchmark models on financial tasks.

by Synced

2023-04-04

Comments 21

Large language models (LLMs) popularized by the GPT family have shown impressive language processing, understanding and generating capabilities across diverse domains. Many industry researchers are now exploring ways to improve the task-specific performance of such models and integrate them into their workflows, with US firm Bloomberg emerging as a first-mover in the financial domain.

In the new paper BloombergGPT: A Large Language Model for Finance, a research team from Bloomberg and Johns Hopkins University presents BloombergGPT, a 50 billion parameter language model trained on a 700 billion token dataset that significantly outperforms current benchmark models on financial tasks.

Bloomberg’s goal was to train an LLM capable of achieving best results across a wide range of financial tasks while maintaining competitive performance on general-purpose LLM benchmarks. To this end, the team first leveraged Bloomberg’s extensive data sources to compile what they believe to be the largest-ever finance-specific dataset, comprising 363 billion tokens. This was augmented with various public datasets to reach a total of 700 billion tokens and used to train their 50 billion parameter BloombergGPT model.

BloombergGPT is a decoder-only causal LLM based on the BLOOM (Scao et al., 2022) architecture, comprising 70 layers of transformer decoder blocks with multi-head self-attention, layer-normalization, and a feed-forward network with one hidden layer.

The team used the Amazon AWS SageMaker service for model training and evaluation and the proprietary SageMaker Model Parallelism (SMP) for efficient parallel computing.

In their empirical study, the team compared BloombergGPT with larger baseline models — GPT-NeoX (Black et al., 2022), OPT66B (Zhang et al., 2022a) and BLOOM176B (Scao et al., 2022) — on finance-specific and general-purpose benchmarks.

In the experiments, BloombergGPT achieved the best performance on most financial tasks and comparable or better performance on the general-purpose benchmarks.

“We see tremendous value in having developed the first LLM focused on the financial domain,” says Bloomberg Chief Technology Officer Shawn Edwards, “BloombergGPT will enable us to tackle many new types of applications, while it delivers much higher performance out-of-the-box than custom models for each application, at a faster time-to-market.”

The paper BloombergGPT: A Large Language Model for Finance is on arXiv.

Author: Hecate He | Editor: Michael Sarazen

We know you don’t want to miss any news or research breakthroughs. Subscribe to our popular newsletter Synced Global AI Weekly to get weekly AI updates.

21 comments on “Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’”

Avatar Game

2023-04-19

Large language models and GPT are the outstanding topics that people discuss recently. I am really curious about it but I haven’t practice or test.

Loading...

Reply
Amanda The Adventurer

2023-06-25

Bloomberg’s initiative in developing BloombergGPT demonstrates the potential for large language models to enhance and streamline processes within specific domains. As further research and development take place, we can expect to see more industry-specific language models that cater to the unique needs of various sectors.

Loading...

Reply
SienJoel

2023-06-26

To be honest, the topic of using such a system is very poorly disclosed. What tasks will this AI help to solve? Finance is a rather conservative model even if we consider crypto. See what binary options offer – https://www.binaryoptions.com/tools/ As you can see, the list of popular tools is quite limited. But this is what brings in the main income. That’s why I’m skeptical about the trendy machine learning now. I see that an ordinary person can be successful without using such crutches in trading.

Loading...

Reply
Pingback: Massive Language Fashions LLMs vs. Small Language Fashions SLMs for Monetary Establishments: A 2025 Sensible Enterprise AI Information -
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - Data Shield
Pingback: Giant Language Fashions LLMs vs. Small Language Fashions SLMs for Monetary Establishments: A 2025 Sensible Enterprise AI Information - The News92
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - juicytalk.now
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide — itinai content
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - TechAiReports
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide – The Future Tech
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - Annapoorna Infotech
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - Crypto Simmba
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide – The TechBriefs
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide – CryptoKeeperCanada
Pingback: The 2025 Enterprise AI Guide: Large Language Models (LLMs) vs. small language models (SLMs) for Financial Institutions - AI-trends.today
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - Trending News92
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide – Swisscryptodaily.ch
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide - The Crypto Post
Pingback: Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide – SuperWow Tech News
Pingback: Modelos de idiomas grandes LLM versus modelos de idiomas pequeños SLM para instituciones financieras: una guía práctica de IA de la empresa 2025 - 7 minutos
Pingback: LLMs vs. SLMs: Financial Institutions Enterprise AI 2025 Guide

Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’

Like this:

21 comments on “Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’”

Leave a Reply Cancel reply

Related

Share this:

Like this:

21 comments on “Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’”

Leave a Reply Cancel reply

Related