Synced - Part 4

by Synced 2024-02-16 2

AI Machine Learning & Data Science Research

DeepMind & Stanford U’s UNFs: Advancing Weight-Space Modeling with Universal Neural Functionals

A research team from Google DeepMind and Stanford University introduces a groundbreaking algorithm known as universal neural functionals (UNFs), which autonomously constructs permutation-equivariant models for any weight space, offering a versatile solution to the architectural constraints encountered in prior works.

by Synced 2024-02-11 2

AI Machine Learning & Data Science Research

Introducing NVIDIA’s Audio Flamingo, the Next Frontier in Audio Language Models

An NVIDIA research team introduces Audio Flamingo, a groundbreaking audio language model that incorporates in-context learning (ICL), retrieval augmented generation (RAG), and multi-turn dialogue capabilities, achieving SOTA performance across various audio understanding tasks.

by Synced 2024-02-07 8

AI Machine Learning & Data Science Nature Language Tech Research

Nomic Embed: The Inaugural Open-Source Long Text Embedding Model Outshining OpenAI’s Finest

In a new paper Nomic Embed: Training a Reproducible Long Context Text Embedder, a Nomic AI research team introduces nomic-embed-text-v1, which marks the inception of the first fully reproducible, open-source, open-weights, open-data text embedding model, capable of handling an extensive context length of 8192 in English.

by Synced 2024-02-05 4

AI Machine Learning & Data Science Research

PokéLLMon Triumph: Georgia Tech Unleashes the First LLM Agent Mastering Human-Level Skills in Pokemon Battles

In a new paper PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models, a Georgia Institute of Technology research team introduces PokéLLMon, a pioneering LLM-embodied agent demonstrating human-competent performance in tactical battle games.

by Synced 2024-01-31 9

AI Machine Learning & Data Science Research

Neural Networks on the Brink of Universal Prediction with DeepMind’s Cutting-Edge Approach

In a new paper Learning Universal Predictors, a Google DeepMind research team proposes the utilization of Universal Turing Machines (UTMs) for generating training data, thereby enhancing meta-learning and enabling trained neural networks capable of mastering universal prediction strategies.

by Synced 2024-01-29 6

AI Machine Learning & Data Science Research

Google and UT Austin’s Game-Changing Approach Distills Vision-Language Models on Millions of Videos

In a new paper Distilling Vision-Language Models on Millions of Videos, a research team introduces a straightforward yet highly effective method to adapt image-based vision-language models to video. The approach involves generating high-quality pseudo-captions for millions of videos, outperforming state-of-the-art methods across various video-language benchmarks.

by Synced 2024-01-27 12

AI Machine Learning & Data Science Research

Stanford U & Open AI’s Meta-Prompting Elevates Language Model Performance, Surpassing Standard Prompting by 17%

In a new paper Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding, the team introduces meta-prompting. This innovative scaffolding approach proves to be highly effective, surpassing standard prompting by 17.1%, expert (dynamic) prompting by 17.3%, and multi-persona prompting by 15.2%.

by Synced 2024-01-23 3

AI Machine Learning & Data Science Research

NVIDIA’s ChatQA Reaches GPT-4 Performance Without Using Data From OpenAI GPT

In a new paper ChatQA: Building GPT-4 Level Conversational QA Models, an NVIDIA research team introduces ChatQA, a suite of conversational question-answering models that achieve GPT-4 level accuracies without relying on synthetic data from OpenAI GPT models.

by Synced 2024-01-22 5

AI Machine Learning & Data Science Research

DeepMind’s GATS: A Novel Module for Seamless Integration of Multimodal Foundation Models

In a new paper GATS: Gather-Attend-Scatter, a Google DeepMind research team introduces Gather-Attend-Scatter (GATS), a pioneering module designed to seamlessly combine pretrained foundation models—whether trainable or frozen—into larger multimodal networks.

by Synced 2024-01-20 20

AI Machine Learning & Data Science Research

Nature’s New Breakthrough: Control Human Language Network via Large Language Model

In a new breakthrough paper Driving and suppressing the human language network using large language models, a research team from Massachusetts Institute of Technology, MIT-IBM Watson AI Lab, University of Minnesota and Harvard University leverages a GPT-based encoding model to identify sentences predicted to elicit specific responses within the human language network.

by Synced 2024-01-17 6

AI Machine Learning & Data Science Research

Google’s AMIE Marks A Significant Milestone Toward Conversational Diagnostic AI

In a new paper Towards Conversational Diagnostic AI, a research team from Google Research and Google DeepMind introduces AMIE (Articulate Medical Intelligence Explorer), an LLM-based AI system meticulously optimized for clinical history-taking and diagnostic dialogues, showcasing superior diagnostic accuracy and outperforming primary care physicians (PCPs).

by Synced 2024-01-09 4

AI Machine Learning & Data Science Nature Language Tech Research

Beyond Behemoths: How Blended Chat AIs Outshine Trillion-Parameters ChatGPT with Elegance

Can a collective of moderately-sized LLMs collaboratively constitute a chat AI with equivalent or superior abilities? Motivated by this query, a new paper “Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM” confirms this idea and introduces the Blended approach.

by Synced 2024-01-07 3

AI Machine Learning & Data Science Nature Language Tech Research

LangSplat: Turbocharging 3D Language Fields with a Mind-Blowing 199x Speed Boost

In a new paper LangSplat: 3D Language Gaussian Splattin, a research team from Tsinghua University and Harvard University introduces LangSplat, a groundbreaking 3D Gaussian Splatting-based method designed for 3D language fields, which surpasses the state-of-the-art LERF method while boasting a remarkable speed improvement of 199 times.

by Synced 2024-01-02 4

AI Machine Learning & Data Science Research

Gemini: Bridging Tomorrow’s Deep Neural Network Frontiers with Unrivaled Chiplet Accelerator Mastery

A research team introduces Gemini, an innovative framework, focusing on both architecture and mapping co-exploration, aims to propel large-scale DNN chiplet accelerators to new heights, achieving an impressive average performance improvement of 1.98× and an energy efficiency boost of 1.41× compared to the state-of-the-art Simba architecture.

by Synced 2023-12-31 4

AI Machine Learning & Data Science Research

Breaking LLMs’ Limits: Upstage AI’s SOLAR 10.7B Shines Bright with Simple Scaling Magic

In a new paper SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling, a Upstage AI research team introduces depth up-scaling (DUS), which emerges as an efficient and uncomplicated technique for amplifying LLMs, surpassing existing open-source state-of-the-art LLMs, such as Llama 2 and Mistral 7B.

by Synced 2023-12-29 2

AI Machine Learning & Data Science Research

Precision Coding Redefined: Microsoft WaveCoder’s Pioneering Approach to Fine-Tuned LLM Model Performance

In a new paper WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation, a Microsoft research team introduces CodeOcean, which harnesses source code to explicitly control data quality, significantly improving the generalization ability of fine-tuned LLM models.

by Synced 2023-12-28 3

AI Machine Learning & Data Science Research

DreamWire: A Generative AI Enabling Everyone to Be Multi-View Wire Artist

In a new paper Wired Perspectives: Multi-View Wire Art Embraces Generative AI, a research team from University of Surrey and Beijing University of Posts and Telecommunications introduces DreamWire, an innovative AI system poised to democratize the creation of MVWA.

by Synced 2023-12-26 2

AI Computer Vision & Graphics Machine Learning & Data Science Research

Reconstructing Videos In Just 14 Seconds: Meta AI’s Fairy Accelerates Video Synthesis by 44×

A Meta GenAI research team introduces Fairy, a versatile and efficient video-to-video synthesis framework. Fairy stands out for its ability to generate high-quality videos at remarkable speed, producing 120-frame 512×384 videos in just 14 seconds, surpassing previous works by a factor of at least 44×.

by Synced 2023-12-22 8

AI Machine Learning & Data Science Nature Language Tech Research

A Robot Chemist Driven by GPT-4 Made Its Debut in Nature: Autonomously Designs Reactions and Performs Complex Experiments

In a new paper Autonomous chemical research with large language models, a research team from Carnegie Mellon University and Emerald Cloud Lab introduces an innovative LLMs-Powered system named Coscientist, which autonomously designs, plans, and executes complex scientific experiments, marking a significant leap forward in the integration of laboratory automation technologies with powerful language models.

by Synced 2023-12-20 5

AI Machine Learning & Data Science Popular Research

DeepMind’s Highly Capable Multimodal Model Gemin Reaches Human-Expert Level

A Google DeepMind research team introduces a groundbreaking family of multimodal models Gemini, which showcase exceptional proficiency across image, audio, video, and text comprehension, pushing the boundaries of large-scale language modeling, image interpretation, audio processing, and video understanding.

by Synced 2023-12-15 12

AI Machine Learning & Data Science Research

New Language Model Breakthrough on Nature: FunSearch Addresses A Longstanding Mathematics Challenge

In a recent paper titled “Mathematical Discoveries from Program Search with Large Language Models, a research team introduces FunSearch—a novel approach that elevates LLM-guided evolutionary procedures. FunSearch not only achieves breakthroughs in established open problems but also leads to the discovery of new algorithms.

by Synced 2023-12-12 1

AI Machine Learning & Data Science Research

Tencent’s FaceStudio Redefines Image Generation with Identity-Preserving Efficiency in Seconds

A recent paper from Tencent’s research team introduces a novel identity-preserving synthesis approach, with a specific focus on human images. The proposed model adopts a direct feed-forward mechanism, eliminating the need for intensive fine-tuning and streamlining the image generation process.

by Synced 2023-12-09 5

AI Machine Learning & Data Science Research

Microsoft’s TaskWeaver: Empowering Intelligent Conversational Agents for Handling Domain-Specific Complex Tasks

A Microsoft research team introduces TaskWeaver, a cutting-edge, code-first framework designed to empower LLM-powered autonomous agents. TaskWeaver offers a potent and flexible platform for constructing intelligent conversational agents capable of handling complex tasks and seamlessly adapting to domain-specific scenarios.

by Synced 2023-12-06 4

AI Machine Learning & Data Science Research

Tencent & Sydney U’s GPT4Video: A Unified Multimodal Large Language Significantly Elevates LMs’ Video Generative Capabilities

A collaborative effort between Tencent AI Lab and The University of Sydney introduces GPT4Video, which stands as a unified multi-model framework that endows Large Language Models (LLMs) with the unique ability for both video understanding and generation.

by Synced 2023-11-30 5

AI Machine Learning & Data Science Research

Spatial-Temporal Innovation: STLVQE Redefines Real-Time Video Enhancement for an Unmatched Viewing Experience

A paper titled “Online Video Quality Enhancement with Spatial-Temporal Look-up Tables” introduces a novel method, STLVQE. This research, conducted by a team from Tongji University and Microsoft Research Asia, pioneers the exploration of the online video quality enhancement problem and presents the first method achieving real-time processing speed.

by Synced 2023-11-28 3

AI Machine Learning & Data Science Research

Adobe’s DMV3D Achieves SOTA Performance for High-Fidelity 3D Objects Generation Within Seconds

A research team innovative single-stage category-agnostic diffusion model. This model can generate 3D Neural Radiance Fields (NeRFs) from either text or a single-image input condition through direct model inference, enabling the creation of diverse high-fidelity 3D objects in just 30s/asset.

by Synced 2023-11-27 1

AI Machine Learning & Data Science Research

DeepMind’s DiLoCo Revolutionizes Language Model Training with 500× Less Communication

In a new paper DiLoCo: Distributed Low-Communication Training of Language Models, a Google DeepMind research team presents Distributed Low-Communication (DiLoCo). DiLoCo employs a distributed optimization algorithm that facilitates the training of language models on islands of poorly connected devices, surpassing the performance of fully synchronous models while reducing communication by 500 times.

by Synced 2023-11-26 3

AI Machine Learning & Data Science Research

Meet LEO: An Embodied Generalist Agent Excelling in 3D World Tasks

In a new paper An Embodied Generalist Agent in 3D World, a research team introduces LEO, which stands as an embodied multi-modal and multi-task generalist agent that excels in essential capabilities such as perception, grounding, reasoning, planning, and action within the intricate 3D world.

by Synced 2023-11-24 10

AI Machine Learning & Data Science Research

ETH Zurich’s UltraFastBERT Realizes 78x Speedup for Language Models

In a new paper Exponentially Faster Language Modelling, an ETH Zurich research team introduces UltraFastBERT, a variant of the BERT architecture. UltraFastBERT takes a revolutionary approach by replacing feedforward layers with fast feedforward networks, resulting in an impressive 78x speedup over the optimized baseline feedforward implementation.

by Synced 2023-11-22 1

AI Machine Learning & Data Science Research

Microsoft Orca 2’s Triumph: Comparable or Superior Performance to Models 5-10x Its Size in Mastering Reasoning Tasks

Microsoft has recently unveiled Orca 2 in a new paper titled “Orca 2: Teaching Small Language Models How to Reason.” to explore how enhanced training signals can augment the reasoning abilities of smaller language models. Notably, Orca 2 surpasses models of similar size, achieving performance levels comparable to or better than models 5-10 times larger.

by Synced 2023-11-18 2

AI Machine Learning & Data Science Research

Democratizing Data: How Apple and UW’s Data Filtering Networks Redefine Large-Scale Training Sets

In a new paper Data Filtering Networks, a research team from Apple and University of Washington introduces the concept of data filtering networks (DFNs). These neural networks, specifically designed for data filtration, demonstrate the capacity to generate extensive, high-quality pre-training datasets efficiently.

by Synced 2023-11-13 6

AI Computer Vision & Graphics Machine Learning & Data Science Research

Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s

In a new paper LRM: Large Reconstruction Model for Single Image to 3D, a research team from Adobe Research and Australian National Univerisity introduces an innovative Large Reconstruction Model (LRM). This groundbreaking model has the remarkable ability to predict a 3D model of an object from a single input image in a mere 5 seconds.

by Synced 2023-11-06 2

AI Machine Learning & Data Science Research

Google’s E3 TTS Provides Effortless Approach to High-Quality Audio Synthesis Through Diffusion Models

In a new paper E3 TTS: Easy End-to-End Diffusion-based Text to Speech, a Google research team proposes Easy End-to-End Diffusion-based Text to Speech. This streamlined and efficient text-to-speech model hinges solely on diffusion to preserve temporal structure, allowing it to accept plain text as input and generate audio waveforms directly.

by Synced 2023-11-01 4

AI Machine Learning & Data Science Research

Apple Repurposes Large Language Models for Reinforcement Learning challenges in Embodied AI

An Apple research team presents Large LAnguage model Reinforcement Learning Policy (LLaRP). LLaRP effectively repurposes LLMs for Reinforcement Learning (RL) challenges within the realm of Embodied Artificial Intelligence (AI), achieving a remarkable 1.7 times higher success rate compared to other established baselines and zero-shot LLM applications.

by Synced 2023-10-31 6

AI Machine Learning & Data Science Research

Supercharging Large Language Models: DEJAVU’s Inference Time Surpasses FasterTransformer by 2×

In a new paper Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time, a research team presents DEJAVU, a system that employs a cost-effective algorithm to predict contextual sparsity dynamically for each layer, combined with an asynchronous and hardware-aware implementation to accelerate LLM inference.

by Synced 2023-10-30 1

AI Machine Learning & Data Science Research

MoE: Revolutionizing Memory-Efficient Execution of Massive-Scale MoE Models

A research team from Institute of Science and Technology Austria (ISTA) and Neural Magic Inc. introduces the QMoE framework. This innovative framework offers an effective solution for accurately compressing massive MoEs and conducting swift compressed inference, reducing model sizes by 10–20×, achieving less than 1 bit per parameter.

by Synced 2023-10-26 4

AI Machine Learning & Data Science Research

DeepMind Verifies ConvNets Can Match Vision Transformers at Scale

In a new paper ConvNets Match Vision Transformers at Scale, a Google DeepMind research team challenges the prevailing belief that Vision Transformers possess superior scaling capabilities compared to ConvNets and provides empirical results revealing that ConvNets can indeed hold their own against Vision Transformers at scale.

by Synced 2023-10-25 5

AI Machine Learning & Data Science Research

Elevating Sample Quality: OpenAI’s Consistency Models Training Techniques Redefine the Game

In a new paper Techniques for Training Consistency Models, an OpenAI research team introduces innovative methods that enable consistency models to learn directly from data, surpassing the performance of consistency distillation (CD) in producing high-quality samples, all while breaking free from the clutches of LPIPS.

by Synced 2023-10-24 2

AI Machine Learning & Data Science Research

Redefining Search Stack: Microsoft Unleashes the Potential of Large Language Models

In a new paper Large Search Model: Redefining Search Stack in the Era of LLMs, a Microsoft research team presents a novel conceptual framework, large search model, which reimagines the conventional search stack by consolidating various search tasks under a single Large Language Model (LLM).

by Synced 2023-10-23 2

AI Machine Learning & Data Science Research

OpenAI & Microsoft’s DALL-E 3 Masters Image Creation Through Enhanced Captions

In a new paper Improving Image Generation with Better Captions, a research team from OpenAI and Microsoft introduces DALL-E 3, a cutting-edge text-to-image generation system that is benchmarked for its prowess in prompt following, coherence, and aesthetics, demonstrating its competitive edge against existing counterparts.

PopularSee all posts

Latest Posts