Synced | Synced

by Synced 2023-05-03 1

Optimizing Transformers: Microsoft & RUC’s ResiDual Solves Gradient Vanishing and Representation Collapse Issues

In the new paper ResiDual: Transformer With Dual Residual Connections, a team from Microsoft Research, Microsoft Azure Translation, and Renmin University of China proposes ResiDual, a novel transformer architecture that fuses the connections in post-layer normalization and pre-layer normalization to exploit the benefits of both while also addressing their limitations.

by Synced 2023-05-02 0

AI Machine Learning & Data Science Nature Language Tech Research

Google & TAU Explore How Transformer-Based LLMs Extract Knowledge From Their Parameters

In the new paper Dissecting Recall of Factual Associations in Auto-Regressive Language Models, a team from Google DeepMind, Tel Aviv University and Google Research investigates how factual associations are stored and extracted internally in transformer-based language models and provides insights on how such models’ factual predictions are formed.

by Synced 2023-05-01 4

AI Machine Learning & Data Science Research

CMU & Meta’s AlbedoGAN Advances Realistic 3D Face Generation

In the new paper Towards Realistic Generative 3D Face Models, a research team from Carnegie Mellon University and Meta proposes a 3D controllable generative model capable of generating high-resolution textures and capturing high-frequency details in facial geometry. Their proposed AlbedoGAN outperforms state-of-the-art baselines in facial shape reconstruction.

by Synced 2023-04-29 1

AI Machine Learning & Data Science Research

Microsoft & Peking U’s WizardLM Enables LLMs to Automatically Mass-Produce Complex Instructions

In the new paper WizardLM: Empowering Large Language Models to Follow Complex Instructions, a research team from Microsoft and Peking University presents Evol-Instruct, a novel approach that leverages LLMs to automatically generate large amounts of instruction data with varying levels of complexity. In human evaluations, the team’s resulting WizardLM model’s generated instructions were judged superior to human-created instruction datasets.

by Synced 2023-04-27 1

AI Machine Learning & Data Science Research

UC Berkeley’s FastRLAP Learns Aggressive and Effective High-Speed Driving Strategies With <20 Minutes of Real-World

In the new paper FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing, a UC Berkeley research team proposes FastRLAP (Fast Reinforcement Learning via Autonomous Practicing), a system that autonomously practices in the real world and learns aggressive maneuvers to enable effective high-speed driving.

by Synced 2023-04-26 0

AI Machine Learning & Data Science Research

Microsoft’s NaturalSpeech 2 Outperforms Previous TTS Systems in Zero-Shot Speech and Singing Synthesis

In the new paper NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers, a Microsoft team introduces NaturalSpeech 2, a TTS system with latent diffusion models for natural and strong zero-shot voice synthesis that captures expressive prosodies with superior robustness.

by Synced 2023-04-24 0

AI Computer Vision & Graphics Machine Learning & Data Science Research

Look Again, YOLO: Baidu’s RT-DETR Detection Transformer Achieves SOTA Results on Real-Time Object Detection

In the new paper DETRs Beat YOLOs on Real-Time Object Detection, a Baidu Inc. research team presents Real-Time Detection Transformer (RT-DETR), a real-time end-to-end object detector that leverages a hybrid encoder and novel IoU-aware query selection to address inference speed delay issues. RT-DETR outperforms YOLO object detectors in both accuracy and speed.

by Synced 2023-04-20 0

AI Machine Learning & Data Science Research

Huawei’s DiffFit Unlocks the Transferability of Large Diffusion Models to New Domains

In the new paper DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning, a Huawei Noah’s Ark Lab research team introduces DiffFit, a parameter-efficient fine-tuning technique that enables fast adaptation to new domains for diffusion image generation. Compared to full fine-tuning approaches, DiffFit achieves 2x training speed-ups while using only ~0.12 percent of trainable parameters.

by Synced 2023-04-19 2

AI Machine Learning & Data Science Research

DeepMind & MPG Establish a Research Program for Meta-Learned Models of Cognition

In the new paper Meta-Learned Models of Cognition, a team from the Max Planck Institute for Biological Cybernetics (Max-Planck-Gesellschaft, MPG) and DeepMind proposes the establishment of a research program focused on meta-learned models of cognition. The team cites machine learning papers demonstrating how meta-learning can be used to construct Bayes-optimal learning algorithms and suggests it can significantly expand the scope of the rational analysis of cognition.

by Synced 2023-04-18 3

AI Computer Vision & Graphics Machine Learning & Data Science Research

Microsoft & Bath U’s SpectFormer Significantly Improves Vision Transformers via Frequency and Attention

In the new paper SpectFormer: Frequency and Attention Is What You Need in a Vision Transformer, a research team from Microsoft and the University of Bath proposes Spectformer, a novel transformer architecture that combines spectral and multi-headed attention layers to better capture appropriate feature representations and improve performance.

by Synced 2023-04-17 1

AI Machine Learning & Data Science Research

Google & UC Berkeley’s ‘Self-Debugging’ Framework Teaches LLMs to Debug Their Own Code

In the new paper Teaching Large Language Models to Self-Debug, a Google Research and UC Berkeley team presents Self-Debugging, a framework that teaches large language models to debug their own predicted code via few-shot demonstrations and improves baseline accuracy by up to 12 percent.

by Synced 2023-04-13 3

AI Machine Learning & Data Science Nature Language Tech Research

Microsoft’s LLMA Accelerates LLM Generations via an ‘Inference-With-Reference’ Decoding Approach

In the new paper Inference with Reference: Lossless Acceleration of Large Language Models, a Microsoft research team proposes LLMA, an inference-with-reference decoding mechanism that achieves up to 2x lossless speed-ups with identical generation results by exploiting the overlaps between LLM outputs and references.

by Synced 2023-04-12 3

AI Machine Learning & Data Science Research

Stanford U & Google’s Generative Agents Produce Believable Proxies of Human Behaviours

In the new paper Generative Agents: Interactive Simulacra of Human Behavior, a team from Stanford University and Google Research presents agents that draw on generative models to simulate both individual and emergent group behaviours that are humanlike and based on their changing experiences and environment.

by Synced 2023-04-11 1

AI Machine Learning & Data Science Research

Adobe & UCL’s Pix2Video: Text-Guided Video Editing via Image Diffusion Without Preprocessing or Finetuning

In the new paper Pix2Video: Video Editing Using Image Diffusion, an Adobe Research and University College London team presents Pix2Video, a framework for realistic text-guided video editing using a pretrained image diffusion model.

by Synced 2023-04-10 2

AI Computer Vision & Graphics Machine Learning & Data Science Research

UC Berkeley’s Instruct-NeRF2NeRF Edits 3D Scenes With Text Instructions

In the new paper Instruct-NeRF2NeRF: Editing 3D Scenes With Instructions, a UC Berkeley research team presents Instruct-NeRF2NeRF, an approach for editing 3D NeRF scenes through natural language text instructions. The proposed method can edit large-scale, real-world 3D scenes with improved ease of use and realism.

by Synced 2023-04-06 0

AI Machine Learning & Data Science Research

Google Reveals Its Latest TPU v4-Based Supercomputer, Which Betters Nvidia’s A100s in Speed and Efficiency

In the new paper TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support, a Google Research team presents TPU v4, the company’s latest supercomputer. TPU v4 is ten times faster than v3 and 1.2–1.7x faster than Nvidia A100 GPUs while using 1.3x–1.9x less power.

by Synced 2023-04-05 2

AI Machine Learning & Data Science Research

AI Needs a Therapist: Columbia U & IBM’s SafeguardGPT Leverages Psychotherapy & RL to Build Healthy AI Systems

In the new paper Towards Healthy AI: Large Language Models Need Therapists Too, a team from Columbia University and IBM Research proposes SafeguardGPT, a framework that incorporates psychotherapy and reinforcement learning to correct the potentially harmful behaviours of AI chatbots.

by Synced 2023-04-04 3

AI Machine Learning & Data Science Research

Bloomberg & JHU’s BloombergGPT: ‘A Best-in-Class LLM for Financial NLP’

In the new paper BloombergGPT: A Large Language Model for Finance, a research team from Bloomberg and Johns Hopkins University presents BloombergGPT, a 50 billion parameter language model trained on a 700 billion token dataset that significantly outperforms current benchmark models on financial tasks.

by Synced 2023-04-03 1

AI Machine Learning & Data Science Research

Meet TaskMatrix.AI: A Microsoft ‘Super-AI’ That Links Foundation Models With Millions of APIs to Perform Diverse Tasks

In the new paper TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs, a Microsoft research team proposes TaskMatrix.AI, a novel ecosystem that connects foundation models with millions of existing models and system APIs to build a “super-AI” capable of addressing a wide range of digital and physical tasks.

by Synced 2023-03-30 4

AI Machine Learning & Data Science Research

Revolutionizing Games: Parametrix.ai Unveils the Potential of Virtual Interactive Experiences Powered by AI NPCs

A recent tech demo called “Living Chang’an City” has been garnering attention. In this video, AI-powered NPCs can be seen roaming the streets of Chang’an City, each possessing unique identities and short-term and long-term goals. They engage in various life-like interactions, such as chatting, shopping and even falling in love.

by Synced 2023-03-29 3

AI Machine Learning & Data Science Nature Language Tech Research

ColossalChat: An Open-source Solution for Cloning ChatGPT with A Complete RLHF Pipeline

Colossal-AI open sources a complete RLHF pipeline that includes supervised data collection, supervised fine-tuning, reward model training, and reinforcement learning fine-tuning, based on the LLaMA pre-trained model, and shares ColossalChat, the most practical open-source project that closely resembles the original ChatGPT technical solution!

by Synced 2023-03-28 0

AI Machine Learning & Data Science Nature Language Tech Research

Google’s CoLT5 Processes Extremely Long Inputs via Conditional Computation

A Google Research team addresses transformers’ input sequence limitations in the new paper CoLT5: Faster Long-Range Transformers with Conditional Computation, proposing CoLT5 (Conditional LongT5), a family of models that applies a novel conditional computation approach for higher quality and faster long-input processing of up to 64,000 tokens.

by Synced 2023-03-27 2

AI Machine Learning & Data Science Research

Microsoft Does a Deep Dive on GPT-4, Finds “Sparks of AGI”

In the new paper Sparks of Artificial General Intelligence: Early Experiments with GPT-4, a Microsoft Research team investigates GPT-4, demonstrating its ability to achieve human-level performance on novel and difficult tasks in domains ranging from mathematics and coding to vision, medicine, law and psychology; and proposing it as an early version of an AGI system.

by Synced 2023-03-23 5

AI Machine Learning & Data Science Nature Language Tech Research

OpenAI, Open Research & UPenn Paper Considers How GPTs Will Impact the US Labour Market

In the new paper GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models, a research team from OpenAI, OpenResearch, and the University of Pennsylvania investigates the potential impact of LLMs like GPT on the US labour market, shedding light on the economic, social, and policy implications.

by Synced 2023-03-22 8

AI Machine Learning & Data Science Nature Language Tech Research

Microsoft’s UPRISE Automatically Retrieves Prompts to Boost the Zero-Shot Performance of Large Language Models

In the new paper UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation, a Microsoft research team introduces a novel approach that tunes a lightweight and versatile retriever to retrieve prompts for any given task input to improve the zero-shot performance of LLMs.

by Synced 2023-03-22 1

AI Machine Learning & Data Science Research

Avatars With Attitude: ETH Zurich & Microsoft’s X-Avatar Expands Expressiveness in Digital Humans

In the new paper X-Avatar: Expressive Human Avatars, a research team from ETH Zurich and Microsoft presents X-Avatar, an expressive implicit human avatar model designed to capture high fidelity human body and hand poses, facial expressions and other appearance characteristics in a holistic fashion.

by Synced 2023-03-20 2

AI Machine Learning & Data Science Research

Columbia U’s ViperGPT Solves Complex Visual Queries via Python Execution

In the new paper ViperGPT: Visual Inference via Python Execution for Reasoning, a Columbia University research team presents ViperGPT, a framework for solving complex visual queries by integrating code-generation models into vision via a Python interpreter. The proposed approach requires no further training and achieves state-of-the-art results.

by Synced 2023-03-16 2

AI Machine Learning & Data Science Nature Language Tech Research

Microsoft’s MathPrompter Dramatically Improves LLM Performance on Mathematical Reasoning Tasks

In the new paper MathPrompter: Mathematical Reasoning Using Large Language Models, a Microsoft Research team presents MathPrompter, a novel approach that leverages chain-of-thought (CoT) prompting techniques to improve LLM performance on mathematical reasoning problems and increase confidence in their predictions.

by Synced 2023-03-15 0

AI Machine Learning & Data Science Research

UBC, Google & Amii’s Exphormer: Scaling Graph Transformers While Slashing Costs

In the new paper Exphormer: Sparse Transformers for Graphs, a team from the University of British Columbia, Google Research and the Alberta Machine Intelligence Institute proposes Exphormer, a class of graph transformers with improved scalability and reduced computational complexity that achieves state-of-the-art performance on graph benchmarks.

by Synced 2023-03-14 3

AI Machine Learning & Data Science Research

Microsoft’s Visual ChatGPT Enables Image Understanding and Generation

In the new paper Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models, a Microsoft Research Asia team presents Visual ChatGPT, a system that incorporates various visual foundation models to enable ChatGPT to understand, generate and edit visual information.

by Synced 2023-03-13 0

AI Machine Learning & Data Science Research

Speak a Foreign Language in Your Own Voice? Microsoft’s VALL-E X Enables Zero-Shot Cross-Lingual Speech Synthesis

In the new paper Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling, a Microsoft research team presents VALL-E X, a simple yet effective cross-lingual neural codec language model that inherits strong in-context learning capabilities from VALL-E and demonstrates high-quality zero-shot cross-lingual speech synthesis performance.

by Synced 2023-03-09 3

AI Machine Learning & Data Science Nature Language Tech Research

Google’s Universal Speech Model Scales Automatic Speech Recognition to 100+ Languages

In the new paper Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages, Google introduces the Universal Speech Model (USM), a scalable self-supervised training framework that extends automatic speech recognition to more than 100 languages.

by Synced 2023-03-08 1

AI Machine Learning & Data Science Research

OpenAI’s Consistency Models Support Fast One-Step Generation for Diffusion Models

Diffusion-based AI models continue to wow the world with remarkable image, audio, and video generation capabilities. This performance however comesContinue Reading

by Synced 2023-03-07 17

AI Machine Learning & Data Science Nature Language Tech Popular Research

Toward AGI: Microsoft’s KOSMOS-1 MLLM Can Perceive General Modalities, Follow Instructions, and Perform In-Context Learning

In the new paper Language Is Not All You Need: Aligning Perception with Language Models, a Microsoft research team presents KOSMOS-1, a multimodal large language model (MLLM) that can perceive general modalities, learn in context, and follow instructions.

by Synced 2023-03-06 19

AI Machine Learning & Data Science Research

Introducing SpikeGPT: UCSC & Kuaishou’s LLM With Spiking Neural Networks Slashes Language Generation Costs

In the new paper SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks, a research team from the University of California and Kuaishou Technology presents SpikeGPT, the first generative spiking neural network language model. The team’s largest, 260M parameter version achieves DNN-level performance while maintaining the energy efficiency of spike-based computations.

by Synced 2023-03-02 1

AI Machine Learning & Data Science Nature Language Tech Research

Tackling Hallucinations: Microsoft’s LLM-Augmenter Boosts ChatGPT’s Factual Answer Score

In the new paper Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback, a Microsoft Research and Columbia University team presents LLM-Augmenter, a system that augments black-box large language models with a set of plug-and-play modules to significantly improve the factuality of their responses.

by Synced 2023-03-01 1

AI Machine Learning & Data Science Research

Google’s ROSIE Data Augmentation Strategy Scales Robot Learning With Semantically Imagined Experience

In a new paper Scaling Robot Learning with Semantically Imagined Experience, a team from Robotics at Google and Google Research Robot proposes Learning with Semantically Imagined Experience (ROSIE), a general and semantically-aware data augmentation strategy that leverages text-to-image models to obtain data for robot learning.

by Synced 2023-02-28 1

AI Machine Learning & Data Science Nature Language Tech Research

CMU & Inspired Cognition’s DocPrompting Improves Code Generation by Retrieving Relevant Documentation

In the new paper DocPrompting: Generating Code by Retrieving the Docs, a research team from Carnegie Mellon University and Inspired Cognition presents DocPrompting, a natural-language-to-code generation approach. Tasked with generating code to unseen functions or libraries from a natural language intent, DocPrompting retrieves corresponding code documentation to enable the model to learn to perform the task.

by Synced 2023-02-27 0

AI Machine Learning & Data Science Research

Meta Heats Up the AI Race With Their State-Of-The-Art Foundation Language Model LLaMA

Meta AI reveals the technical details of their LLaMA collection of foundation language models in the new paper LLaMA: Open and Efficient Foundation Language Models. The LLaMA models were trained on trillions of tokens and achieve performance competitive with state-of-the-art models such as GPT-3 and PaLM while being much smaller and using only publicly available training data.

by Synced 2023-02-23 0

AI Computer Vision & Graphics Machine Learning & Data Science Research

Oxford U Presents RealFusion: 360° Reconstructions of Any Object from a Single Image

In the new paper RealFusion: 360° Reconstruction of Any Object from a Single Image, an Oxford University research team leverages a diffusion model to generate 360° reconstructions of objects from a single image. Their RealFusion approach achieves state-of-the-art performance on monocular 3D reconstruction benchmarks.