Synced | Synced

by Synced 2024-10-09 3

Scaling Multi-Objective Optimization: Meta & FAIR’s CGPO Advances General-purpose LLMs

In a new paper The Perfect Blend: Redefining RLHF with Mixture of Judges, a research team from Meta GenAI and FAIR developed Constrained Generative Policy Optimization (CGPO), which offers a more structured approach to RLHF, advancing the performance of general-purpose LLMs.

by Synced 2024-10-07 3

AI Machine Learning & Data Science Research

Instant 3D Vision: Apple’s Depth Pro Delivers High-Precision Depth Maps in 0.3 Seconds

Apple introduces Depth Pro, a state-of-the-art foundation model designed for zero-shot metric monocular depth estimation. This model can generate high-resolution depth maps with exceptional clarity and fine detail, producing a 2.25-megapixel depth map in just 0.3 seconds on a standard GPU.

by Synced 2024-10-03 8

AI Machine Learning & Data Science Research

Law of the Weakest Link: Advancing Large Language Models Through Cross-Capability

A joint research team from Meta and the University of Illinois Urbana-Champaign introduces CrossEval, a benchmark designed to assess both individual and cross capabilities. Their findings demonstrate that LLMs often adhere to the “Law of the Weakest Link”—where performance on complex tasks is limited by the weakest capability.

by Synced 2024-09-30 14

AI Machine Learning & Data Science Research

Google’s Zero-Shot Cross-Lingual Voice Transfer for Dysarthric Speakers

In a new paper Zero-shot Cross-lingual Voice Transfer for TTS, a Google research team presents a new VT module that seamlessly integrates into a multilingual TTS system, enabling voice transfer across languages.

by Synced 2024-09-28 11

AI Machine Learning & Data Science Research

Practical Lossless Text Compression: FineZip Delivers 54x Speed Boost via Large Language Models

In a new paper FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression, a research team from UC Berkeley and NYU introduces FineZip, a novel LLM-based compression system designed to significantly reduce compression time.

by Synced 2024-09-24 5

AI Asia China Computer Vision & Graphics Global News Press Release Research

ByteDance Disrupts Video Generation Race with Breakthrough in Multi-Subject Interaction

On September 24, ByteDance’s technology arm, Volcano Engine, introduced two state-of-the-art video generation models, PixelDance and Seaweed, which significantly enhanceContinue Reading

by Synced 2024-09-23 8

AI Machine Learning & Data Science Research

Microsoft’s MarS: A Game-Changer in Financial Market Simulations Powered by Generative AI

A Microsoft Research Asia research team introduces MarS, a financial market simulation engine powered by a Large Market Model, which addresses the unique demands of modeling the market impact of orders while enabling highly realistic, controllable simulations.

by Synced 2024-09-20 10

AI Machine Learning & Data Science Research

MIT’s SciAgents: Automating Scientific Discovery with AI-Powered Graph Reasoning

A research team presents SciAgents which aims to automate the process of scientific discovery by revealing hidden interdisciplinary relationships that traditional research methods often overlook. SciAgents operates on a scale, precision, and exploratory power that far surpasses human-driven approaches.

by Synced 2024-09-17 4

AI Machine Learning & Data Science Research

Stanford’s Landmark Study: AI-Generated Ideas Rated More Novel Than Expert Concepts

A Sandford U’s research team introduces an experimental framework aimed at evaluating LLMs’ ability to generate research ideas. This study, the first of its kind, compares the ideation capabilities of over 100 expert NLP researchers against an LLM-based ideation system.

by Synced 2024-09-13 7

AI Machine Learning & Data Science Research

Revolutionizing Autonomous Agents: Salesforce’s xLAM Outperforms GPT-4

A Salesforce AI Research team presents the xLAM series, a collection of large action models designed to enhance the performance of open-source LLMs for autonomous AI agents. This work aims to accelerate innovation in the field and make high-performance models for agent tasks more accessible.

by Synced 2024-09-11 16

AI Machine Learning & Data Science Research

Outperforming Giants: TinyAgent’s Edge-Based Solution Surpasses GPT-4-Turbo

A research team introduces TinyAgent, a framework designed to train and deploy small, task-specific language models capable of performing function calls for agentic systems at the edge, which outperforms larger models such as GPT-4-Turbo in this specific function-calling ability.

by Synced 2024-09-09 742

AI Machine Learning & Data Science Research

Microsoft’s Fully Pipelined Distributed Transformer Processes 16x Sequence Length with Extreme Hardware Efficiency

A Microsoft research team introduces the Fully Pipelined Distributed Transformer, which leverages the multiple memory hierarchies available in modern GPU clusters, enhancing hardware efficiency and cost-effectiveness while achieving exceptionally high Model FLOPs Utilization (MFU).

by Synced 2024-09-06 25

AI Machine Learning & Data Science Research

Google’s GameNGen: Bringing Real-Time Game Simulation to Life with Neural Models

In a new paper Diffusion Models Are Real-Time Game Engines, a Google research team presents GameNGen, the first game engine powered entirely by a neural model that enables real-time interaction with complex environments over extended sequences, maintaining high-quality output.

by Synced 2024-09-04 7

AI Machine Learning & Data Science Research

Samsung’s MobileQuant: Bringing High-Performance Language Models to Your Pocket

A research team from Samsung makes a first attempt to facilitate LLM deployment on edge devices using integer-only quantization. The proposed MobileQuant, is a post-training quantization technique that reduces both inference latency and energy consumption while preserving accuracy comparable to those achieved with 16-bit activations.

by Synced 2024-08-30 5

AI Machine Learning & Data Science Research

NYU & Stanford’s GPUDrive: Achieving Over 1 Million Steps per Second in Multi-Agent Driving Simulations

A research team presents GPUDrive, a GPU-accelerated multi-agent simulator built on the Madrona Game Engine, which is capable of generating over a million experience steps per second, making it a game-changer for applying sample-inefficient yet powerful reinforcement learning algorithms to multi-agent planner design.

by Synced 2024-08-29 4

AI Machine Learning & Data Science Research

NVIDIA’s Minitron: Compressing Llama 3.1 and Mistral NeMo for Superior Performance in 4B and 8B Models

In a new paper LLM Pruning and Distillation in Practice: The Minitron Approach, an NVIDIA research team presents the Minitron compression strategy, which effectively produces a robust 4B model from Llama 3.1 8B and a cutting-edge Mistral-NeMo-Minitron-8B model derived from Mistral NeMo 12B.

by Synced 2024-08-27 4

AI Machine Learning & Data Science Research

Meta’s Sapiens: Revolutionizing Human Pose, Segmentation, and Depth Estimation with Vision Transformers

In a new paper Sapiens: Foundation for Human Vision Models, a Meta research team introduces Sapiens, a suite of models designed to address four core human-centric vision tasks: 2D pose estimation, body-part segmentation, depth estimation, and surface normal prediction.

by Synced 2024-08-26 5

AI Machine Learning & Data Science Research

Open Sparse Autoencoders Everywhere: The Ambitious Vision of DeepMind’s Gemma Scope

In a new paper Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2, a Google DeepMind research team introduces Gemma Scope, a comprehensive suite of JumpReLU SAEs.

by Synced 2024-08-22 4

AI Machine Learning & Data Science Research

Snowflake’s Arctic-TILT: Matching the Power of Models 1,000x Larger in Document Understanding

A Snowflake research team presents Arctic-TILT, a model that is specifically engineered for large-scale, cost-effective deployment while also being adaptable to various domains. It achieves state-of-the-art performance on benchmarks for both business and long documents.

by Synced 2024-08-17 3

AI Machine Learning & Data Science Research

Apple Intelligence: Unveiling Foundation Models Powering the Future of iOS, iPadOS, and macOS

An Apple research team introduces the foundation language models developed to power Apple Intelligence features. These models include a ∼3 billion parameter model optimized for efficient on-device performance and a larger server-based model designed for Private Cloud Compute.

by Synced 2024-08-15 9

AI Machine Learning & Data Science Research

Google DeepMind’s Robot Mastering Human-Level Table Tennis

In a new paper Achieving Human Level Competitive Robot Table Tennis, a Google DeepMind research team introduces the first robot agent that attains amateur human-level performance in competitive table tennis.

by Synced 2024-08-13 2

AI Machine Learning & Data Science Research

NVIDIA’s Wolf: World Summarization Framework Beats GPT-4V on Video Captioning by 55.6%

In a new paper Wolf: Captioning Everything with a World Summarization Framework, a research team introduces a novel approach known as the WOrLd summarization Framework (Wolf). This automated captioning framework significantly advances video captioning—both in terms of quality (improved by 55.6%) and similarity (improved by 77.4%)—compared to GPT-4V.

by Synced 2024-08-09 4

AI Machine Learning & Data Science Research

From 500 Tokens to One: The Breakthrough Power of Cambridge U’s 500xCompressor

In a new paper 500xCompressor: Generalized Prompt Compression for Large Language Models, a Cambridge U team proposes the 500xCompressor, a method designed to condense extensive natural language contexts into a minimum of just one special token, achieving compression ratios ranging from 6x to 480x.

by Synced 2024-08-06 9

AI Machine Learning & Data Science Research

Llama 3: Meta AI’s Multilingual and Multimodal Marvel

In a new paper The Llama 3 Herd of Models, a Meta AI research team presents Llama 3, a new set of foundation models for language, delivering competitive performance comparing to state-of-the-art language models such as GPT-4 on a plethora of tasks.

by Synced 2024-07-31 2

AI Machine Learning & Data Science Research

From YouTube to Keys: Transforming Internet Data into Robotic Musical Talent

In a new paper PianoMime: Learning a Generalist, Dexterous Piano Player from Internet Demonstrations, a research team introduces PianoMime, a framework for training a robot to play the piano using internet-sourced demonstrations.

by Synced 2024-07-30 7

AI Machine Learning & Data Science Research

Unlocking Generalist AI Potential in Software Development with OpenDevin

In a new paper OpenDevin: An Open Platform for AI Software Developers as Generalist Agents, a research team introduces OpenDevin, an Open Platform for AI Software Developers as Generalist Agents. This community-driven platform supports the development of AI agents that interact with software systems.

by Synced 2024-07-26 4

AI Machine Learning & Data Science Research

From Images to Insights: DeepMind’s Versatile Vision-Language Model PaliGemma Achieves SOTA Results

A DeepMind research team release PaliGemma, a robust and versatile vision language model with 3 billion parameters. PaliGemma excels in transfer learning across various vision and language tasks, achieving state-of-the-art performance in a multitude of open-world applications.

by Synced 2024-07-25 4

AI Computer Vision & Graphics Machine Learning & Data Science Research

Automating Video Highlights: Breakthrough Unsupervised Method Leverages Audio and Visual Cues

A research team from Saskatchewan University and Google introduces an innovative unsupervised method for automatic video highlight detection, eliminating the requirements for manual annotations while achieving superior performance compared to previous methods.

by Synced 2024-07-21 4

AI Machine Learning & Data Science Research

Stanford’s Hypothetical Minds: Revolutionizing Multi-Agent AI with Theory of Mind and Large Language Models

A Stanford University research team proposes Hypothetical Minds, builds on recent advancements in LLM-based agents designed for multi-agent environments, aiming to enhance adaptability in competitive, cooperative, and mixed-motive scenarios with concealed information.

by Synced 2024-07-18 8

AI Machine Learning & Data Science Nature Language Tech Research

Revolutionizing Transformers: DeepMind’s PEER Layer and the Power of a Million Experts

A DeepMind research team introduces PEER, a innovative layer design leverages the product key technique for sparse retrieval from an extensive pool of tiny experts (over a million), which unlocks the potential for further scaling transformer models while maintaining computational efficiency.

by Synced 2024-07-16 2

AI Machine Learning & Data Science Research

Overcoming Computational Challenges in Large Language Model Inference with MInference 1.0

A research team from Microsoft and University of Surrey introduces MInference (Milliontokens Inference), which employs a sparse calculation approach designed to expedite the pre-filling of long-sequence processing. It can reduce inference latency by up to 10 times on an A100 GPU while preserving accuracy.

by Synced 2024-07-12 4

AI Machine Learning & Data Science Research

Mastering Enterprise Chatbots: NVIDIA’s Guide to Building Secure RAG-Based Chatbots with Generative AI

In a new paper FACTS About Building Retrieval Augmented Generation-based Chatbots, an NVIDIA research team introduces the FACTS framework, designed to create robust, secure, and enterprise-grade RAG-based chatbots.

by Synced 2024-07-08 7

AI Machine Learning & Data Science Research

Meta AI Unveils LLM Compiler for Advanced Code and Compiler Optimization

A Meta AI research team introduces Meta Large Language Model Compiler, a suite of robust, openly available, pre-trained models is specifically designed for code optimization tasks, aiming to provide a scalable, cost-effective foundation for further research and development in compiler optimization.

by Synced 2024-07-03 4

AI Machine Learning & Data Science Research

Google’s SecBoost: Boosting Any Loss Function Beyond Zeroth-Order Limits

In a new paper How to Boost Any Loss Function, a Google research team provides a constructive, formal answer, demonstrating that any loss function can be optimized with boosting.

by Synced 2024-07-01 10

AI Machine Learning & Data Science Research

Achieving 8× Performance Gains with Reinforcement Learning on Synthetic Data in Large Language Models

In a new paper RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold, a research team provides insights into how synthetic data affects performance, suggesting that a specific schema can achieve consistent gains over using only positive data, achieving performance by 8× in synthetic data volume.

by Synced 2024-06-28 7

AI Machine Learning & Data Science Research

4.5x Performance Boost: University of Illinois’ Muti-Agent AI System Takes on Cyber Threats

A research team from University of Illinois Urbana-Champaign introduces HPTSA, a multi-agent system that significantly advances cybersecurity exploits, achieving up to 4.5 times better performance on a benchmark of 15 real-world vulnerabilities compared to previous efforts.

by Synced 2024-06-25 6

AI Machine Learning & Data Science Research

Oxford U & DeepMind Harness Cultural Accumulation in Reinforcement Learning

In a new paper Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning, a research team from the University of Oxford and Google DeepMind introduces methods to achieve cultural accumulation in Reinforcement Learning (RL) agents. This research opens new pathways for modeling human culture through artificial systems.

by Synced 2024-06-21 3

AI Machine Learning & Data Science Research

Contrastive Learning Advances Sleep Science: Superior Multi-Modal Model Enhances Disorder Detection

In a new paper SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals, a research team introduces SleepFM, the first attempt at developing a multi-modal contrastive learning (CL) approach for PSG analysis, outperforming baselines in tasks like demographic attribute prediction and sleep stage classification.

by Synced 2024-06-19 6

AI Machine Learning & Data Science Research

Google’s Proofread: AI-Driven Typing Accuracy in One Tap

In a new paper Proofread: Fixes All Errors with One Tap, a Google research team introduces Proofread, an innovative Gboard feature powered by a server-side LLM. This feature allows for seamless sentence and paragraph corrections with a single tap. Launched on Pixel 8 devices, it benefits thousands of users daily.

by Synced 2024-06-17 4

AI Machine Learning & Data Science Research

AI Pioneers Gather at BAAI 2024: Unveiling Innovations in Large-Scaled AI Models for Language, Multimodal, Embodied, Bio-Computing, and FlagOpen 2.0

“Global Vision, Ideas in Collision, Leading Cutting-Edge Innovations” – The 6th annual BAAI Conference successfully concluded on June 15. Over 200 AI scholars and industry leaders gathered to discuss the trajectories and applications of advanced AI technologies.