ML | Synced - Part 4

by Synced 2023-12-31 3

Breaking LLMs’ Limits: Upstage AI’s SOLAR 10.7B Shines Bright with Simple Scaling Magic

In a new paper SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling, a Upstage AI research team introduces depth up-scaling (DUS), which emerges as an efficient and uncomplicated technique for amplifying LLMs, surpassing existing open-source state-of-the-art LLMs, such as Llama 2 and Mistral 7B.

by Synced 2023-12-29 2

AI Machine Learning & Data Science Research

Precision Coding Redefined: Microsoft WaveCoder’s Pioneering Approach to Fine-Tuned LLM Model Performance

In a new paper WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation, a Microsoft research team introduces CodeOcean, which harnesses source code to explicitly control data quality, significantly improving the generalization ability of fine-tuned LLM models.

by Synced 2023-12-28 3

AI Machine Learning & Data Science Research

DreamWire: A Generative AI Enabling Everyone to Be Multi-View Wire Artist

In a new paper Wired Perspectives: Multi-View Wire Art Embraces Generative AI, a research team from University of Surrey and Beijing University of Posts and Telecommunications introduces DreamWire, an innovative AI system poised to democratize the creation of MVWA.

by Synced 2023-12-26 2

AI Computer Vision & Graphics Machine Learning & Data Science Research

Reconstructing Videos In Just 14 Seconds: Meta AI’s Fairy Accelerates Video Synthesis by 44×

A Meta GenAI research team introduces Fairy, a versatile and efficient video-to-video synthesis framework. Fairy stands out for its ability to generate high-quality videos at remarkable speed, producing 120-frame 512×384 videos in just 14 seconds, surpassing previous works by a factor of at least 44×.

by Synced 2023-12-22 8

AI Machine Learning & Data Science Nature Language Tech Research

A Robot Chemist Driven by GPT-4 Made Its Debut in Nature: Autonomously Designs Reactions and Performs Complex Experiments

In a new paper Autonomous chemical research with large language models, a research team from Carnegie Mellon University and Emerald Cloud Lab introduces an innovative LLMs-Powered system named Coscientist, which autonomously designs, plans, and executes complex scientific experiments, marking a significant leap forward in the integration of laboratory automation technologies with powerful language models.

by Synced 2023-12-20 5

AI Machine Learning & Data Science Popular Research

DeepMind’s Highly Capable Multimodal Model Gemin Reaches Human-Expert Level

A Google DeepMind research team introduces a groundbreaking family of multimodal models Gemini, which showcase exceptional proficiency across image, audio, video, and text comprehension, pushing the boundaries of large-scale language modeling, image interpretation, audio processing, and video understanding.

by Synced 2023-12-15 12

AI Machine Learning & Data Science Research

New Language Model Breakthrough on Nature: FunSearch Addresses A Longstanding Mathematics Challenge

In a recent paper titled “Mathematical Discoveries from Program Search with Large Language Models, a research team introduces FunSearch—a novel approach that elevates LLM-guided evolutionary procedures. FunSearch not only achieves breakthroughs in established open problems but also leads to the discovery of new algorithms.

by Synced 2023-12-12 1

AI Machine Learning & Data Science Research

Tencent’s FaceStudio Redefines Image Generation with Identity-Preserving Efficiency in Seconds

A recent paper from Tencent’s research team introduces a novel identity-preserving synthesis approach, with a specific focus on human images. The proposed model adopts a direct feed-forward mechanism, eliminating the need for intensive fine-tuning and streamlining the image generation process.

by Synced 2023-12-09 5

AI Machine Learning & Data Science Research

Microsoft’s TaskWeaver: Empowering Intelligent Conversational Agents for Handling Domain-Specific Complex Tasks

A Microsoft research team introduces TaskWeaver, a cutting-edge, code-first framework designed to empower LLM-powered autonomous agents. TaskWeaver offers a potent and flexible platform for constructing intelligent conversational agents capable of handling complex tasks and seamlessly adapting to domain-specific scenarios.

by Synced 2023-12-06 4

AI Machine Learning & Data Science Research

Tencent & Sydney U’s GPT4Video: A Unified Multimodal Large Language Significantly Elevates LMs’ Video Generative Capabilities

A collaborative effort between Tencent AI Lab and The University of Sydney introduces GPT4Video, which stands as a unified multi-model framework that endows Large Language Models (LLMs) with the unique ability for both video understanding and generation.

by Synced 2023-11-30 4

AI Machine Learning & Data Science Research

Spatial-Temporal Innovation: STLVQE Redefines Real-Time Video Enhancement for an Unmatched Viewing Experience

A paper titled “Online Video Quality Enhancement with Spatial-Temporal Look-up Tables” introduces a novel method, STLVQE. This research, conducted by a team from Tongji University and Microsoft Research Asia, pioneers the exploration of the online video quality enhancement problem and presents the first method achieving real-time processing speed.

by Synced 2023-11-28 3

AI Machine Learning & Data Science Research

Adobe’s DMV3D Achieves SOTA Performance for High-Fidelity 3D Objects Generation Within Seconds

A research team innovative single-stage category-agnostic diffusion model. This model can generate 3D Neural Radiance Fields (NeRFs) from either text or a single-image input condition through direct model inference, enabling the creation of diverse high-fidelity 3D objects in just 30s/asset.

by Synced 2023-11-27 1

AI Machine Learning & Data Science Research

DeepMind’s DiLoCo Revolutionizes Language Model Training with 500× Less Communication

In a new paper DiLoCo: Distributed Low-Communication Training of Language Models, a Google DeepMind research team presents Distributed Low-Communication (DiLoCo). DiLoCo employs a distributed optimization algorithm that facilitates the training of language models on islands of poorly connected devices, surpassing the performance of fully synchronous models while reducing communication by 500 times.

by Synced 2023-11-26 3

AI Machine Learning & Data Science Research

Meet LEO: An Embodied Generalist Agent Excelling in 3D World Tasks

In a new paper An Embodied Generalist Agent in 3D World, a research team introduces LEO, which stands as an embodied multi-modal and multi-task generalist agent that excels in essential capabilities such as perception, grounding, reasoning, planning, and action within the intricate 3D world.

by Synced 2023-11-24 4

AI Machine Learning & Data Science Research

ETH Zurich’s UltraFastBERT Realizes 78x Speedup for Language Models

In a new paper Exponentially Faster Language Modelling, an ETH Zurich research team introduces UltraFastBERT, a variant of the BERT architecture. UltraFastBERT takes a revolutionary approach by replacing feedforward layers with fast feedforward networks, resulting in an impressive 78x speedup over the optimized baseline feedforward implementation.

by Synced 2023-11-22 1

AI Machine Learning & Data Science Research

Microsoft Orca 2’s Triumph: Comparable or Superior Performance to Models 5-10x Its Size in Mastering Reasoning Tasks

Microsoft has recently unveiled Orca 2 in a new paper titled “Orca 2: Teaching Small Language Models How to Reason.” to explore how enhanced training signals can augment the reasoning abilities of smaller language models. Notably, Orca 2 surpasses models of similar size, achieving performance levels comparable to or better than models 5-10 times larger.

by Synced 2023-11-18 2

AI Machine Learning & Data Science Research

Democratizing Data: How Apple and UW’s Data Filtering Networks Redefine Large-Scale Training Sets

In a new paper Data Filtering Networks, a research team from Apple and University of Washington introduces the concept of data filtering networks (DFNs). These neural networks, specifically designed for data filtration, demonstrate the capacity to generate extensive, high-quality pre-training datasets efficiently.

by Synced 2023-11-13 6

AI Computer Vision & Graphics Machine Learning & Data Science Research

Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s

In a new paper LRM: Large Reconstruction Model for Single Image to 3D, a research team from Adobe Research and Australian National Univerisity introduces an innovative Large Reconstruction Model (LRM). This groundbreaking model has the remarkable ability to predict a 3D model of an object from a single input image in a mere 5 seconds.

by Synced 2023-11-06 2

AI Machine Learning & Data Science Research

Google’s E3 TTS Provides Effortless Approach to High-Quality Audio Synthesis Through Diffusion Models

In a new paper E3 TTS: Easy End-to-End Diffusion-based Text to Speech, a Google research team proposes Easy End-to-End Diffusion-based Text to Speech. This streamlined and efficient text-to-speech model hinges solely on diffusion to preserve temporal structure, allowing it to accept plain text as input and generate audio waveforms directly.

by Synced 2023-11-01 3

AI Machine Learning & Data Science Research

Apple Repurposes Large Language Models for Reinforcement Learning challenges in Embodied AI

An Apple research team presents Large LAnguage model Reinforcement Learning Policy (LLaRP). LLaRP effectively repurposes LLMs for Reinforcement Learning (RL) challenges within the realm of Embodied Artificial Intelligence (AI), achieving a remarkable 1.7 times higher success rate compared to other established baselines and zero-shot LLM applications.

by Synced 2023-10-31 6

AI Machine Learning & Data Science Research

Supercharging Large Language Models: DEJAVU’s Inference Time Surpasses FasterTransformer by 2×

In a new paper Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time, a research team presents DEJAVU, a system that employs a cost-effective algorithm to predict contextual sparsity dynamically for each layer, combined with an asynchronous and hardware-aware implementation to accelerate LLM inference.

by Synced 2023-10-30 1

AI Machine Learning & Data Science Research

MoE: Revolutionizing Memory-Efficient Execution of Massive-Scale MoE Models

A research team from Institute of Science and Technology Austria (ISTA) and Neural Magic Inc. introduces the QMoE framework. This innovative framework offers an effective solution for accurately compressing massive MoEs and conducting swift compressed inference, reducing model sizes by 10–20×, achieving less than 1 bit per parameter.

by Synced 2023-10-26 3

AI Machine Learning & Data Science Research

DeepMind Verifies ConvNets Can Match Vision Transformers at Scale

In a new paper ConvNets Match Vision Transformers at Scale, a Google DeepMind research team challenges the prevailing belief that Vision Transformers possess superior scaling capabilities compared to ConvNets and provides empirical results revealing that ConvNets can indeed hold their own against Vision Transformers at scale.

by Synced 2023-10-25 5

AI Machine Learning & Data Science Research

Elevating Sample Quality: OpenAI’s Consistency Models Training Techniques Redefine the Game

In a new paper Techniques for Training Consistency Models, an OpenAI research team introduces innovative methods that enable consistency models to learn directly from data, surpassing the performance of consistency distillation (CD) in producing high-quality samples, all while breaking free from the clutches of LPIPS.

by Synced 2023-10-24 2

AI Machine Learning & Data Science Research

Redefining Search Stack: Microsoft Unleashes the Potential of Large Language Models

In a new paper Large Search Model: Redefining Search Stack in the Era of LLMs, a Microsoft research team presents a novel conceptual framework, large search model, which reimagines the conventional search stack by consolidating various search tasks under a single Large Language Model (LLM).

by Synced 2023-10-23 2

AI Machine Learning & Data Science Research

OpenAI & Microsoft’s DALL-E 3 Masters Image Creation Through Enhanced Captions

In a new paper Improving Image Generation with Better Captions, a research team from OpenAI and Microsoft introduces DALL-E 3, a cutting-edge text-to-image generation system that is benchmarked for its prowess in prompt following, coherence, and aesthetics, demonstrating its competitive edge against existing counterparts.

by Synced 2023-10-19 3

AI Machine Learning & Data Science Research

NVIDIA’s STEERLM Approach: Empowering User-Steerable Language Models

In a new paper SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF, an NVIDIA research team introduces STEERLM, a novel supervised fine-tuning method that empowers end-users to control model responses during inference, surpassing even state-of-the-art baselines, including RLHF models like ChatGPT-3.5.

by Synced 2023-10-18 3

AI Machine Learning & Data Science Research

Microsoft Azure’s Idea2Img: Enabling Automatic Image Design and Generation with Enhanced Image Quality

A Microsoft Azure AI research team introduces “Idea2Img” in their paper, “Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation.”, which leverages the capabilities of GPT-4V(ision) to revolutionize the process of automatic image design and generation with enhanced image quality.

by Synced 2023-10-17 2

AI Machine Learning & Data Science Research

MatFormer: The Universal Elastic Transformer Capable to Generate Submodels With Zero Extra Training Costs

In a new paper MatFormer: Nested Transformer for Elastic Inference, a research team proposes MatFormer, a Transformer architecture that is inherently designed for elasticity, enables the training of a single universal model capable of generating numerous smaller submodels without the need for additional training.

by Synced 2023-10-17 1

AI Machine Learning & Data Science Research

Google Unveils the Enigma of Memorization and Generalization in Neural Models

In a new paper What do larger image classifiers memorise?, a Google Research team delivers a comprehensive empirical analysis addressing the question of whether larger neural models exhibit greater memorization tendencies

by Synced 2023-10-12 3

AI Machine Learning & Data Science Research

Microsoft’s DeepSpeed-VisualChat: Breaking Boundaries in Multi-Modal Language Models

In a new paper DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention, a research team from DeepSpeed of Microsoft presents the DeepSpeed-VisualChat framework, which is designed to optimize LLMs by incorporating multi-modal capabilities, demonstrating superior scalability, even up to a 70 billion parameter model size.

by Synced 2023-10-11 1

AI Machine Learning & Data Science Research

Yale U & Google’s HyperAttention: Long-Context Attention with the Best Possible Near-Linear Time Guarantee

In a new paper HyperAttention: Long-context Attention in Near-Linear Time, a research team from Yale University and Google Research presents HyperAttention, an approximate attention mechanism not only offers practical efficiency but also delivers the best near-linear time guarantee for long contexts processing.

by Synced 2023-10-10 2

AI Machine Learning & Data Science Research

Efficiency Meets Quality: Google & JHU Pioneers Conditional Diffusion Distillation in Just 1-4 Sampling Steps

In a new paper Conditional Diffusion Distillation, a research team from Google Research and Johns Hopkins University introduces an innovative framework that distills an unconditional diffusion model into a conditional one, enabling image generation with significantly fewer steps.

by Synced 2023-10-09 2

AI Machine Learning & Data Science Research

Mind-to-Speech: The New Frontier in Neuro Communication Through Perception From Non-Invasive Brain Signals

In a new paper Decoding speech perception from non-invasive brain recordings, a research team from Meta AI, Inria Saclay and PSL University exhibits the remarkable capability to decode speech from brain signals recorded non-invasively through magneto-encephalography (MEG) or electro-encephalography (EEG).

by Synced 2023-10-06 1

AI Machine Learning & Data Science Research

General-Purpose Robot RT-X: A Collaboration between DeepMind and 33 Academic Labs

DeepMind, in collaboration with 33 academic laboratories heralds the arrival of RT-1-X, a novel robotics transformer (RT) model that evolves from RT-1. RT-1-X is meticulously trained on the novel Open X-Embodiment dataset constructed by the researchers and showcases a remarkable 50% improvement in success rates compared to task-specified models.

by Synced 2023-10-04 4

AI Machine Learning & Data Science Research

Microsoft Unveils the Potential of Large Multimodal Models with GPT-4V(ision)

A Microsoft research team conducts an in-depth analysis of the latest model, GPT-4V(ision). Their report delves into the emerging application scenarios and outlines future research directions for GPT-4V-based systems, with the goal of inspiring research on next-generation multimodal task formulation and the development of more robust LLMs.

by Synced 2023-10-03 2

AI Machine Learning & Data Science Research

NNAISENSE’s New Class of Generative Model: Bayesian Flow Networks Break Barriers in Handing Discrete Data

A NNAISENSE research team introduces a novel class of generative models known as Bayesian Flow Networks (BFNs). These BFNs combine the power of Bayesian inference with neural networks in an iterative modeling process, enabling successful application to continuous, discretized, and discrete data while maintaining competitive performance.

by Synced 2023-10-02 2

AI Machine Learning & Data Science Research

Standford U’s MAPTree: Redefining Decision Trees – Precision, Speed, and Efficiency Unleashed

In a new paper MAPTree: Beating “Optimal” Decision Trees with Bayesian Decision Trees, a Stanford University research team introduces MAPTree, an algorithm that confidently uncovers the maximum a posteriori tree within Bayesian Classification and Regression Trees (BCART) posterior, achieving strong performance with significantly leaner and faster trees.

by Synced 2023-09-29 2

AI Machine Learning & Data Science Research

Microsoft’s CodePlan: Unleashing the Power of Language Models for Repository-Level Coding Tasks

In a recent paper, “CodePlan: Repository-level Coding using LLMs and Planning,” a team from Microsoft Research introduces CodePlan—a versatile framework designed to address the complexities of repository-level coding tasks, encompassing extensive code changes across large, interconnected codebases.

by Synced 2023-09-27 2

AI Machine Learning & Data Science Research

Meta AI’s Long-Context LLMs: Redefining the Landscape of Natural Language Processing

In a new paper Effective Long-Context Scaling of Foundation Models, a Meta AI research team presents a series of long-context LLMs, built through the pretraining from LLAMA 2. These models support effective context windows of up to 32,768 tokens and outperform all existing open-sourced models in terms of performance.