Supervised Learning

by Synced 2023-10-19 3

NVIDIA’s STEERLM Approach: Empowering User-Steerable Language Models

In a new paper SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF, an NVIDIA research team introduces STEERLM, a novel supervised fine-tuning method that empowers end-users to control model responses during inference, surpassing even state-of-the-art baselines, including RLHF models like ChatGPT-3.5.

by Synced 2022-11-30 2

AI Machine Learning & Data Science Research

DeepMind Studies Process- vs Outcome-based Model Supervision, Significantly Reducing Reasoning Errors on Math Word Problems

In the new paper Solving Math Word Problems With Process- and Outcome-based Feedback, a DeepMind research team conducts the first comprehensive comparison between process- and outcome-based model supervision. The two approaches achieve comparable final-answer error rate improvements on math word problems, while the process-based method significantly reduces reasoning errors from 14.0 to just 3.4 percent.

by Synced 2019-03-27 0

AI Research

New Method Applies Monte Carlo Neural Fictitious Self-Play to Texas Hold’em

Facing the incomplete information environment, the asynchronous neural virtual self-play (ANFSP) method allows AI to learn to generate optimal decisions in multiple virtual environments. The approach has performed well in Texas Hold’em and multiplayer FPS video games.

by Synced 2019-03-26 0

AI Research

Snorkel DryBell Exploits the Strength of Weakly Supervised ML for Information Integration

Snorkel Drybell, an experimental internal system which leverages the open-sourced Snorkel framework to harness various existing organizational knowledge resources and generate training data for web-scale machine learning models.

by Synced 2018-12-29 3

AI Research

Explore, Exploit, and Explode — The Time for Reinforcement Learning is Coming

Reinforcement learning (RL) has been making spectacular achievements, e.g., Atari games, AlphaGo, AlphaGo Zero, AlphaZero, DeepStack, Libratus, OpenAI Five, Dactyl, DeepMimic, Catch The Flag, learning to dress, data center cooling, chemical syntheses, drug design, etc. See more RL applications.

by Synced 2017-06-12 1

Industry Research

Epic’s Tim Sweeney: Deep Learning A.I. Will Open New Frontiers in Game Design

Tim Sweeney predicts that video game companies will ultimately utilize advanced AI techniques when the VR/AR games (or so-called “The Metaverse” games) become popular, due to advancements of cameras and displays

by Synced 2017-05-19 1

AI Research

Big Picture Machine Learning: Classifying Text with Neural Networks and TensorFlow

TensorFlow is one of the most popular open source AI libraries. Its high in computing efficiency, and the rich development resources make it widely adopted by companies and individual developers.

by Synced 2017-02-28 1

Industry

AI in News Reporting: Machines are Now Writing Dialogues, Q&As, and News Articles

Chief Scientist Lei Li of Toutiao discusses how to apply machine learning to natural language understanding and producing machine-written news articles

by Synced 2017-02-25 7

Research

Yann Le Cun: Predicting under Uncertainty, the Next Frontier in AI

professor Yann LeCun discussed about “predicting under uncertainty: the next frontier in AI” during the lecture at the University of Edinburgh