Research | Synced

by Synced 2023-10-17 1

Google Unveils the Enigma of Memorization and Generalization in Neural Models

In a new paper What do larger image classifiers memorise?, a Google Research team delivers a comprehensive empirical analysis addressing the question of whether larger neural models exhibit greater memorization tendencies

by Synced 2023-10-12 3

AI Machine Learning & Data Science Research

Microsoft’s DeepSpeed-VisualChat: Breaking Boundaries in Multi-Modal Language Models

In a new paper DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention, a research team from DeepSpeed of Microsoft presents the DeepSpeed-VisualChat framework, which is designed to optimize LLMs by incorporating multi-modal capabilities, demonstrating superior scalability, even up to a 70 billion parameter model size.

by Synced 2023-10-11 1

AI Machine Learning & Data Science Research

Yale U & Google’s HyperAttention: Long-Context Attention with the Best Possible Near-Linear Time Guarantee

In a new paper HyperAttention: Long-context Attention in Near-Linear Time, a research team from Yale University and Google Research presents HyperAttention, an approximate attention mechanism not only offers practical efficiency but also delivers the best near-linear time guarantee for long contexts processing.

by Synced 2023-10-10 2

AI Machine Learning & Data Science Research

Efficiency Meets Quality: Google & JHU Pioneers Conditional Diffusion Distillation in Just 1-4 Sampling Steps

In a new paper Conditional Diffusion Distillation, a research team from Google Research and Johns Hopkins University introduces an innovative framework that distills an unconditional diffusion model into a conditional one, enabling image generation with significantly fewer steps.

by Synced 2023-10-09 2

AI Machine Learning & Data Science Research

Mind-to-Speech: The New Frontier in Neuro Communication Through Perception From Non-Invasive Brain Signals

In a new paper Decoding speech perception from non-invasive brain recordings, a research team from Meta AI, Inria Saclay and PSL University exhibits the remarkable capability to decode speech from brain signals recorded non-invasively through magneto-encephalography (MEG) or electro-encephalography (EEG).

by Synced 2023-10-06 1

AI Machine Learning & Data Science Research

General-Purpose Robot RT-X: A Collaboration between DeepMind and 33 Academic Labs

DeepMind, in collaboration with 33 academic laboratories heralds the arrival of RT-1-X, a novel robotics transformer (RT) model that evolves from RT-1. RT-1-X is meticulously trained on the novel Open X-Embodiment dataset constructed by the researchers and showcases a remarkable 50% improvement in success rates compared to task-specified models.

by Synced 2023-10-04 4

AI Machine Learning & Data Science Research

Microsoft Unveils the Potential of Large Multimodal Models with GPT-4V(ision)

A Microsoft research team conducts an in-depth analysis of the latest model, GPT-4V(ision). Their report delves into the emerging application scenarios and outlines future research directions for GPT-4V-based systems, with the goal of inspiring research on next-generation multimodal task formulation and the development of more robust LLMs.

by Synced 2023-10-03 2

AI Machine Learning & Data Science Research

NNAISENSE’s New Class of Generative Model: Bayesian Flow Networks Break Barriers in Handing Discrete Data

A NNAISENSE research team introduces a novel class of generative models known as Bayesian Flow Networks (BFNs). These BFNs combine the power of Bayesian inference with neural networks in an iterative modeling process, enabling successful application to continuous, discretized, and discrete data while maintaining competitive performance.

by Synced 2023-10-02 2

AI Machine Learning & Data Science Research

Standford U’s MAPTree: Redefining Decision Trees – Precision, Speed, and Efficiency Unleashed

In a new paper MAPTree: Beating “Optimal” Decision Trees with Bayesian Decision Trees, a Stanford University research team introduces MAPTree, an algorithm that confidently uncovers the maximum a posteriori tree within Bayesian Classification and Regression Trees (BCART) posterior, achieving strong performance with significantly leaner and faster trees.

by Synced 2023-09-29 2

AI Machine Learning & Data Science Research

Microsoft’s CodePlan: Unleashing the Power of Language Models for Repository-Level Coding Tasks

In a recent paper, “CodePlan: Repository-level Coding using LLMs and Planning,” a team from Microsoft Research introduces CodePlan—a versatile framework designed to address the complexities of repository-level coding tasks, encompassing extensive code changes across large, interconnected codebases.

by Synced 2023-09-27 2

AI Machine Learning & Data Science Research

Meta AI’s Long-Context LLMs: Redefining the Landscape of Natural Language Processing

In a new paper Effective Long-Context Scaling of Foundation Models, a Meta AI research team presents a series of long-context LLMs, built through the pretraining from LLAMA 2. These models support effective context windows of up to 32,768 tokens and outperform all existing open-sourced models in terms of performance.

by Synced 2023-09-26 1

AI Machine Learning & Data Science Nature Language Tech Research

The Reversal Curse: Uncovering the Intriguing Limits of Language Models

In a new paper titled “The Reversal Curse: LLMs trained on ‘A is B’ fail to learn ‘B is A'” authored by a collaborative research team from Vanderbilt University, the UK Frontier AI Taskforce, Apollo Research, New York University, the University of Sussex, and the University of Oxford, has unveiled a remarkable shortcoming in auto-regressive large language models (LLMs).

by Synced 2023-09-25 2

AI Machine Learning & Data Science Nature Language Tech Research

One half-day of training using a few hundred dollars yields similar results to mainstream large models, open-source and commercial-free domain-specific LLM solution

Being at the forefront of cost reduction and efficiency enhancement for large models, the Colossal-AI team maximizes the core capabilities of LLaMA-2. Through innovative training techniques, Colossal-AI has achieved remarkable results by utilizing only approximately 0.0085 trillion tokens of data, investing 15 hours, and incurring training costs in the range of a few hundred dollars.

by Synced 2023-09-24 10

AI Machine Learning & Data Science Research

Language Models Redefined: Transforming Textual Mastery into Compression Brilliance

In a new paper Language Modeling Is Compression, a collaborative team from Google DeepMind, Meta AI, and Inria delves into the lossless compression capabilities of foundation models, unveiling their achievement of state-of-the-art compression rates across various data types.

by Synced 2023-09-20 1

AI Machine Learning & Data Science Research

From Stagnant to Stunning: Google Transforms Still Images into Photo-Realistic Animations

In a paper titled “Generative Image Dynamics,” a Google research team introduces an innovative approach to model natural oscillation dynamics using a single static image. This approach yields photo-realistic animations derived from a lone image, surpassing the performance of previous methods by a substantial margin.

by Synced 2023-09-20 2

AI Machine Learning & Data Science Nature Language Tech Research

Unveiling the Enigma: Meta AI & UPC Decodes the Inner Workings of Large Scale Language Models

In a new paper Neurons in Large Language Models: Dead, N-gram, Positional, a research team from Meta AI and Universitat Politècnica de Catalunya conducts comprehensive analysis of a family of Open Pre-trained Transformer Language Models (OPT) up to 66b parameters to provide insights of how feed-forward network (FFN) layers act.

by Synced 2023-09-18 9

AI Machine Learning & Data Science Research

Revolutionizing Autonomous Agents: AGENTS Framework Puts Power in Your Hands

In a new paper Agents: An Open-source Framework for Autonomous Language Agents, a research team from AIWaves Inc., Zhejiang University and ETH Zürich releases AGENTS, an open-source framework that enables non-specialists for developing and deploying state-of-the-art autonomous language agents with minimal coding work.

by Synced 2023-09-15 3

AI Machine Learning & Data Science Research

DeepMind Decodes the Puzzle of ‘ Grokking ’ In Neural Network Generalization Through Circuit Efficiency

In a new paper Explaining grokking through circuit efficiency, a DeepMind research team solves the puzzle of the grokking through circuit efficiency theory, revealing that the generalizing solution is slower to learn then memorizing.

by Synced 2023-09-14 2

AI Machine Learning & Data Science Research

Microsoft’s phi-1.5 Challenges LLMs Scaling Law, Showcases the Crucial Rule for ‘Textbook Quality’ Dataset

A Microsoft research team introduce phi-1.5, a 1.3 billion parameter model trained on a vast dataset of 30 billion tokens, remarkably delivering performance that rivals models five times its size. Moreover, it outperforms most non-frontier LLMs in tackling intricate reasoning tasks.

by Synced 2023-09-12 1

AI Machine Learning & Data Science Research

Unlocking the Power of Visual Modeling: Microsoft’s Sparse MoEs Redefine Efficiency and Excellence

An Apple research team introduces the concept of sparse Mobile Vision MoEs (V-MoEs), which represents a streamlined and mobile-friendly Mixture-of-Experts architecture that efficiently downscales Vision Transformers (ViTs) while preserving impressive model performance.

by Synced 2023-09-12 20

AI Machine Learning & Data Science Research

Revolutionizing Optimization: DeepMind Leverages Large Language Models as Intelligent Optimizers

In a new paper Large Language Models as Optimizers, a Google DeepMind research team introduces Optimization by PROmpting (OPRO), an effective method that leverages large language models (LLMs) as optimizers, which can generate optimization solutions conditioned on the natural language that describes the optimization task.

by Synced 2023-09-08 2

AI Machine Learning & Data Science Research

Equall & Apple’s Revolutionizing Transformers: One Wide Feedforward for Unprecedented Efficiency and Accuracy

A collaborative research effort from Equall and Apple delves into the role of the FFN and uncovers a surprising revelation: despite consuming a significant portion of the model’s parameters, the FFN exhibits high redundancy. As a result, the researchers propose sharing a single FFN across both the encoder and decoder, thereby reducing the parameter count while causing only a modest drop in accuracy.

by Synced 2023-09-06 3

AI Machine Learning & Data Science Research

Unlocking Limitless Retrieval Power: Google’s MEMORY-VQ Revolutionizes LLMs with Remarkable Compression

In a new paper MEMORY-VQ: Compression for Tractable Internet-Scale Memory, a Google research team introduces MEMORY-VQ, a novel method that significantly reduce storage requirements for memory-based methods while maintaining high performance, achieving 16x compression rate on the KILT benchmark.

by Synced 2023-09-05 5

AI Machine Learning & Data Science Research

MIT’s AskIt Provides A Unified Programming Interface for Code Generation with LLMs

In a new paper AskIt: Unified Programming Interface for Programming with Large Language Models, a MIT CSAIL research team presents AskIt, a domain-specific language (DSL) tailored for LLMs to accommodate a wide variety of tasks, which substantially reducing practitioners’ developmental overhead and effort for software.

by Synced 2023-09-04 3

AI Machine Learning & Data Science Research

70 billion parameter LLaMA2 model training accelerated by 195% with best foundation model practice upgraded

Colossal-AI provides revolutionary LLaMA2 training efficiency for 8 to 512 GPUs, fine-tuning, and inference solutions. The 70 billion parameter training can be accelerated by 195%, and provides a fully-managed ML cloud platform solution, greatly reducing the cost of large model development and applications.

by Synced 2023-08-31 5

AI Machine Learning & Data Science Research

Meta AI’s Nougat Enables Conversion of Mathematic Expressions from PDF Files to Machine Readable Texts

A Meta AI research team presents Neural Optical Understanding for Academic Documents (Nougat), a Visual Transformer model that can effectively convert scientific documents stored in PDF format to a lightweight markup language, even intensive mathematical equations are involved.

by Synced 2023-08-31 5

AI Machine Learning & Data Science Research

CMU & Tsinghua U’s Prompt2Model Generates Deployable Models Following Natural Language Instructions

In a new paper Prompt2Model: Generating Deployable Models from Natural Language Instructions, a research team from Carnegie Mellon University and Tsinghua University introduces Prompt2Model, a general-purpose approach that is able to use prompting technique to specify system behavior while resulting in a deployable special purpose model that enjoys all the advantages thereof.

by Synced 2023-08-29 4

AI Machine Learning & Data Science Research

Published In Nature: New Breakthrough of Speech-to-Text BCI Achieves Speed of 62 Words/Minute

Speech brain–computer interfaces (BCIs) is a innovate technology that establishes a communication channel between a user and certain devices viaContinue Reading

by Synced 2023-08-29 7

AI Machine Learning & Data Science Nature Language Tech Research

Meta AI Open Sources Code Llama: A SOTA Code-Specialized Llama 2

In a new paper Code Llama: Open Foundation Models for Code, a Meta AI research team releases Code Llama, a family of code-specialized Llama 2 models for code generation and infilling, which achieves state-of-the-art performance against open models on code benchmarks.

by Synced 2023-08-26 12

AI Machine Learning & Data Science Research

Diversifying AI: DeepMind Pushes AI Toward Creative Game Players

In a new paper Diversifying AI: Towards Creative Chess with AlphaZero, a Google DeepMind research team explores whether artificial intelligence can benefit from creative problem-solving mechanisms identified in human intelligence while pushing to the limits of its computational rationality.

by Synced 2023-08-25 4

AI Machine Learning & Data Science Research

DeepMind & Toulouse U Contribute Composable Function Preserving Transformations to Boost Transformer Training

In a new paper Composable Function-preserving Expansions for Transformer Architectures, a research team from Google DeepMind and University of Toulouse introduces parameter expansion transformations for transformer-based neural networks while preserving functionality, enabling the expansion of the capability of the model as needed.

by Synced 2023-08-22 9

AI Machine Learning & Data Science Research

Microsoft’s SpeechX: A Leap in Versatile Generative Speech Synthesis

In a new paper SpeechX: Neural Codec Language Model as a Versatile Speech Transformer, a Microsoft research team presents SpeechX, a versatile, robust, and extensible speech generation model that is capable to address zero-shot TTS and various speech transformation tasks, handling both clean and noisy signals.

by Synced 2023-08-21 5

AI Machine Learning & Data Science Research

Boston U’s Platpus Provides Quick, Cheap, and Powerful Refinement of LLMs, Achieving Top 1 in Open LLM Leaderboard

In a new paper Platypus: Quick, Cheap, and Powerful Refinement of LLMs, a Boston University research team presents Platpus, a family of fine-tuned and merged Large Language Models (LLMs) that achieves the first place in HuggingFace’s Open LLM Leaderboard by performing quick, cheap and powerful refinement of conventional LLMs.

by Synced 2023-08-18 10

AI Machine Learning & Data Science Research

HPC-AI Tech Raises 22 Million USD in Series A Funding to Fuel Team Expansion and Business Growth

Singapore – HPC-AI Tech, a pioneering company specializing in efficient large AI model training, is delighted to announce the successful completion of its Series A funding round, securing a total of 22 Million USD.

by Synced 2023-08-18 7

AI Machine Learning & Data Science Research

Alex Graves’s Team Latest Work, Bayesian Flow Networks Address Discrete Data Generation Issues

In a new paper Bayesian Flow Networks, the NNAISENSE research team presents Bayesian Flow Networks (BFNs), a novel family of generative model manipulates parameters of the data distribution rather than operating on noisy data, which provides an effective solution to deal with discrete data.

by Synced 2023-08-16 5

AI Computer Vision & Graphics Machine Learning & Data Science Research

MIT & Harvard’s Open-Source FAn System Enables Real-Time Any Objects Detection, Tracking, and Following

In a new paper Follow Anything: Open-set detection, tracking, and following in real-time, a research team from MIT and Harvard University presents the follow anything system (FAn), an open-set real-time any object following framework that can detect, segment, track, and follow any object, and is able to adapt to new objects using text, images, or click queries.

by Synced 2023-08-15 14

AI Machine Learning & Data Science Research

Meta AI’s Shepherd Criticize Language Model Outputs to Crash Hallucinations

In a new paper Shepherd: A Critic for Language Model Generation, a Meta AI research team presents Shepherd, a language model that are explicitly tuned to critique model generated outputs as well as to generate feedbacks to suggest improvements on solving the factuality, logical errors, coherence, and alignment issues.

by Synced 2023-08-14 2

AI Machine Learning & Data Science Research

Futureverse’ Universal High-Quality Text-to-Music Generator JEN-1 Makes Significant Advancements

In a new paper JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models, a Futureverse research team presents JEN-1, a universal framework that combines bidirectional and unidirectional modes to generate high-quality music conditioned on either text or music representations.

by Synced 2023-08-13 1

AI Machine Learning & Data Science Research

DeepMind’s AlphaStar Benchmark Improves RL Offline Agent With 90% Win Rate Against SOTA AlphaStar Supervised Agent

In a new paper AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning, a DeepMind research team presents AlphaStar Unplugged, an unprecedented challenging large-scale offline reinforcement learning benchmark that leverages a offline dataset from StarCraft II for RL agents training.

by Synced 2023-08-10 1

AI Machine Learning & Data Science Research

Open-Source Large Autoregressive Vision-Language Models: Institutions Join Forces to Replicate DeepMind’s Flamingo Models

In a new paper OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models, a research team releases OpenFlamingo, an open-source replication of DeepMind’s Flamingo models for training autoregressive vision-language models.