model training

by Synced 2024-07-01 10

Achieving 8× Performance Gains with Reinforcement Learning on Synthetic Data in Large Language Models

In a new paper RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold, a research team provides insights into how synthetic data affects performance, suggesting that a specific schema can achieve consistent gains over using only positive data, achieving performance by 8× in synthetic data volume.

by Synced 2023-08-25 4

AI Machine Learning & Data Science Research

DeepMind & Toulouse U Contribute Composable Function Preserving Transformations to Boost Transformer Training

In a new paper Composable Function-preserving Expansions for Transformer Architectures, a research team from Google DeepMind and University of Toulouse introduces parameter expansion transformations for transformer-based neural networks while preserving functionality, enabling the expansion of the capability of the model as needed.

by Synced 2021-11-10 3

AI Machine Learning & Data Science Research

Microsoft India Proposes Varuna: Scalable, Low-Cost Training of Massive Deep Learning Models

A Microsoft Research India team presents Varuna, a system for training massive deep learning models on commodity networking that eliminates the need for specialized hyperclusters and alleviates the cost, scale, and resource utilization challenges of deep learning model training.

by Synced 2021-10-05 1

AI Machine Learning & Data Science Research

DeepMind’s FIRE PBT: Automated Hyperparameter Tuning With Faster Model Training and Better Final Performance

A DeepMind research team proposes Faster Improvement Rate PBT (FIRE PBT) for Population Based Training (PBT), an automated hyperparameter tuning method for neural network training. The novel approach achieves faster improvement rates and better long-term performance.

by Synced 2020-02-06 1

AI Machine Learning & Data Science Research

Radioactive Data: Facebook AI Knows Where You Got Your Training Dataset

Researchers proposed a “radioactive data” technique for subtly marking images in a dataset to help researchers later determine whether they were used to train a particular model.

by Synced 2018-06-29 5

AI Emerging Company

Seattle’s OneClick.ai — Bringing AI a Click Closer

OneClick.ai, a startup founded by two former Microsoft engineers in Seattle, is on a mission to make AI more accessible to businesses. “We design, build and deploy custom AI models as a scalable API that can be accessed from anywhere. Just prepare your data, and we’ll take care of the rest,” says co-founder and CTO Ning Jiang.