Training on model-generated synthetic data is a promising approach for fine-tuning Large Language Models (LLMs). However, opinions among researchers are divided. Some highlight the benefits of synthetic data, while others caution that it can negatively impact model performance.
In a new paper RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold, a research team from Carnegie Mellon University, Google DeepMind and MultiOn provides insights into how synthetic data affects performance. Their findings suggest that a specific schema can achieve consistent gains over using only positive data, achieving performance equivalent to an eightfold increase in synthetic data volume.

The researchers aim to understand synthetic data’s impact on LLM capabilities via a study on math reasoning, a prevalent scenario where synthetic data is used. They derive scaling laws for positive and negative data on common reasoning benchmarks such as GSM8K and MATH.


The researchers focused on understanding the impact of synthetic data on LLM capabilities through a study on math reasoning, a common application for synthetic data. They derived scaling laws for both positive and negative data using reasoning benchmarks like GSM8K and MATH. Their key observations include:
Training on positive synthetic data from capable models results in significantly slower scaling rates compared to standard empirical risk minimization.
- Using model-generated positive synthetic data can improve sample efficiency by 2× but also increases spurious correlations.
- Constructing learner-specific negative data with a focus on critical steps leads to performance gains equivalent to an eightfold increase in positive data.
- Training with negative data helps unlearn spurious correlations.
- They present a conceptual model inspired by reinforcement learning (RL) to explain these observations and the generalization benefits of synthetic data.
Overall, this study provides valuable insights and conceptual models to understand the role of synthetic data in reasoning tasks. It validates that consistent gains can be achieved over using only positive data, and that training on per-step negatives can help unlearn spurious correlations, offering robustness benefits similar to those of reinforcement learning.
The paper RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold is on arXiv.
Author: Hecate He | Editor: Chain Zhang

Our mission is to provide high-quality cleaning solutions that meet the unique needs of our clients, ensuring their complete satisfaction. Our cleaning service nyc is a premier cleaning company dedicated to delivering exceptional cleaning services to both residential and commercial clients. With years of experience in the industry, we have built a reputation for reliability, professionalism, and outstanding customer service.
Best MLM Software in india at Cryptocurrency MLM Software
We provide crypto MLM software development in India as complete software solutions for Blockchain & Cryptocurrency, Mobile Application, Wallet Development.
The synthesis based on real technology data analysis source brings quality research coreball information. Learn more to apply it effectively. Exploit and explore the markets to enhance the experience together.
The “Spellie Wordle game for kids online” provides a child-centered word puzzle experience with intuitive design and positive reinforcement that makes spelling practice delightful for young learners.
Access the “latest tech stories from DualMedia News” for up-to-the-minute coverage of major tech IPOs, cybersecurity breach disclosures, and groundbreaking hardware announcements from international tech expos.
Access the “latest tech stories from DualMedia News” for up-to-the-minute coverage of major tech IPOs, cybersecurity breach disclosures, and groundbreaking hardware announcements from international tech expos.
This blog series is interesting. Articles are coherent with individual highlights.
very nicely written i appreciate the author for bringing up this topic and write it soo elegantly.
good guide the things that are mentioned in this are not mentioned anywhere else really well researched.
This latest research on using synthetic data to boost LLM performance is fascinating! It offers great insights for those exploring AI advancements. Speaking of innovative experiences, check out this unique game that blends music creation with horror elements: スプランキー.