Deep Neural Networks | Synced

by Synced 2023-02-22 1

Open Source Solution Replicates ChatGPT Training Process! Ready To Go With Only 1.6GB GPU Memory And Gives You 7.73 Times Faster Training!

Colossal-AI, as one of the hottest open-source solutions for large AI models, presents an open-source complete PyTorch-based ChatGPT equivalent implementation process that achieves 7.73 times faster compared to the original PyTorch approach with only 1.6GB GPU memory.

by Synced 2023-02-21 0

AI Machine Learning & Data Science Research

Google & UCLA Formulate Algorithm Discovery as Program Search, Yielding ‘Lion’ for SOTA DNN Optimization

In the new paper Symbolic Discovery of Optimization Algorithms, a research team from Google and UCLA presents a method for formulating algorithm discovery as program search and applies it to find EvoLved Sign Momentum (Lion), a simple and effective optimization algorithm that surpasses state-of-the-art methods while reducing computation costs.

by Synced 2023-02-17 1

AI Machine Learning & Data Science Research

Stanford U & Google’s ResMem Improves Neural Network Models’ Generalization via Explicit Memorization

In the new paper ResMem: Learn What You Can and Memorize the Rest, a Stanford University research team proposes the residual-memorization (ResMem) algorithm, a novel approach to improving the generalization ability of neural network models by performing explicit memorization via a separate k-nearest neighbour component.

by Synced 2023-02-16 1

AI Machine Learning & Data Science Research

Meta AI & UPF’s Toolformer: Enabling Language Models to Teach Themselves to Use External Tools

In the new paper Toolformer: Language Models Can Teach Themselves to Use Tools, a team from Meta AI Research and the Universitat Pompeu Fabra proposes Toolformer, a model that self-learns how to choose and use external tools such as search engines, calculators, and translation systems to boost performance on downstream tasks.

by Synced 2023-02-14 2

AI Machine Learning & Data Science Research

CMU & Adobe’s Pix2Pix-Zero Enables Training- and Prompt-Free Image-to-Image Translation

In the new paper Zero-Shot Image-to-Image Translation, a team from Carnegie Mellon University and Adobe Research introduces pix2pix-zero, a diffusion-based image-to-image translation method that performs structure-preserving image editing without requiring manual prompting or additional training.

by Synced 2023-02-13 2

AI Machine Learning & Data Science Research

Hugging Face Releases LoRA Scripts for Efficient Stable Diffusion Fine-Tuning

A Hugging Face team collaborates with researcher Simo Ryu to provide a general approach that enables users to implement Low-Rank Adaptation (LoRA) in diffusers via both Dreambooth and full fine-tuning methods.

by Synced 2023-02-09 0

AI Machine Learning & Data Science Nature Language Tech Research

DeepMind’s Speculative Sampling Achieves 2–2.5x Decoding Speedups in Large Language Models

In the new paper Accelerating Large Language Model Decoding with Speculative Sampling, a DeepMind research team presents SpS (Speculative Sampling), an algorithm that achieves 2–2.5x decoding speedups on a 70 billion parameter Chinchilla language model. The novel approach maintains sample quality and does not require any modifications to model parameters or architecture.

by Synced 2023-02-08 1

AI Machine Learning & Data Science Research

Google Brain Extends Its PyGlove Library to Easily and Scalably Share ML Ideas as Code

In the new paper PyGlove: Efficiently Exchanging ML Ideas as Code, a Google Brain research team extends their PyGlove Python Library to leverage symbolic rule-based patches and simplify the scalable exchange of ML ideas as code.

by Synced 2023-02-07 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

Google & HUJI Present Dreamix: The First Diffusion Model for General Video Editing

In the new paper Dreamix: Video Diffusion Models Are General Video Editors, a team from Google Research and the Hebrew University of Jerusalem presents Dreamix, a novel approach that leverages a video diffusion model (VDM) to enable text-based motion and appearance video editing.

by Synced 2023-02-06 1

AI Machine Learning & Data Science Research

Google & Columbia U’s Mnemosyne: Learning to Train Transformers With Transformers

In the new paper Mnemosyne: Learning to Train Transformers with Transformers, a research team from Google and Columbia University presents Mnemosyne Optimizer, a learning-to-learn system for training entire neural network architectures without any task-specific optimizer tuning.

by Synced 2023-02-03 12

AI Machine Learning & Data Science Research

Genius or Subpar AI Mathematician? New Study Questions ChatGPT’s Mathematical Capabilities

In the new paper Mathematical Capabilities of ChatGPT, an international research team tests ChatGPT’s mathematical capabilities and evaluates its suitability as an assistant to professional mathematicians. The team concludes that despite the glowing reviews in mainstream media, ChatGPT’s mathematical abilities “are significantly below those of an average mathematics graduate student.”

by Synced 2023-02-01 0

AI Machine Learning & Data Science Nature Language Tech Research

Stanford U’s DetectGPT Takes a Curvature-Based Approach to LLM-Generated Text Detection

In the new paper DetectGPT: Zero-Shot Machine-Generated Text Detection Using Probability Curvature, a Stanford University research team presents DetectGPT, a zero-shot machine-generated text detection algorithm that uses probability curvature to predict whether a candidate passage was generated by a large language model.

by Synced 2023-01-31 2

AI Machine Learning & Data Science Research

AI Jam Session: Google & Sorbonne U’s MusicLM Achieves SOTA Performance on High-Fidelity Music Generation from Text

In the new paper MusicLM: Generating Music From Text, a Google Research and Sorbonne University team presents MusicLM, a model for generating high-fidelity music that can be conditioned on both text and melody. MusicLM surpasses baselines in both its audio quality and adherence to the text descriptions.

by Synced 2023-01-30 0

AI Machine Learning & Data Science Research

Microsoft & UCLA Introduce ClimaX: A Foundation Model for Climate and Weather Modelling

In the new paper ClimaX: A Foundation Model for Weather and Climate, a team from Microsoft Autonomous Systems and Robotics Research, Microsoft Research AI4Science and the University of California at Los Angeles presents ClimaX, a foundation model for weather and climate that can be efficiently adapted for general-purpose tasks related to the Earth’s atmosphere.

by Synced 2023-01-26 3

AI Machine Learning & Data Science Research

Stanford U’s Brain-Computer Interface Enables Stroke and ALS Patients to ‘Speak’ 62 Words per Minute

A Stanford University research team presents a brain-computer interface for translating speech-related neural activity into text (speech BCI) in the new paper A High-performance Speech Neuroprosthesis. Theirs is the first speech BCI to record impulse activity from intracortical microelectrode arrays and could benefit people unable to produce clear utterances due to diseases such as stroke and ALS.

by Synced 2023-01-25 6

AI Machine Learning & Data Science Research

Oxford U’s Deep Double Duelling Q-Learning Translates Trading Signals Into SOTA Trading Strategies

In the new paper Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets, an Oxford University research team introduces Deep Duelling Double Q-learning with the APEX architecture to train a trading agent to translate predictive signals into optimal limit order trading strategies.

by Synced 2023-01-24 0

AI Machine Learning & Data Science Research

Forget About Catastrophic Forgetting: Google’s Continual HyperTransformer Enables Efficient Continual Few-Shot Learning

In the new paper Continual Few-Shot Learning Using HyperTransformers, a Google Research team proposes Continual HyperTransformer, which modifies the recently published HyperTransformer few-shot learning method to sequentially update a convolutional neural network’s weights based on the information in a new task without forgetting the knowledge it learned from previous tasks.

by Synced 2023-01-23 0

AI Machine Learning & Data Science Research

Meet Tracr: DeepMind & ETH Zurich’s Novel Interpretability Tool Compiles Human-Readable Code to Transformers’ Weights

In the new paper Tracr: Compiled Transformers as a Laboratory for Interpretability, a research team from ETH Zurich and DeepMind presents Tracr, a compiler that addresses the absence of ground truth explanations in deep neural network models by “compiling” human readable code to the weights of a transformer model.

by Synced 2023-01-19 2

AI Machine Learning & Data Science Research

BERT-Style Pretraining on Convnets? Peking U, ByteDance & Oxford U’s Sparse Masked Modelling With Hierarchy Leads the Way

In the new paper Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling, a research team from Peking University, ByteDance, and the University of Oxford presents Sparse Masked Modelling with Hierarchy (SparK), the first BERT-style pretraining approach that can be used on convolutional models without any backbone modifications.

by Synced 2023-01-18 0

AI Machine Learning & Data Science Nature Language Tech Research

Google Brain & Alberta U Paper Confirms the Computational Universality of Memory-Augmented Large Language Models

In the new paper Memory Augmented Large Language Models are Computationally Universal, Google Brain and University of Alberta researcher Dale Schuurmans establishes computational universality for a large language model augmented with an associative read-write memory.

by Synced 2023-01-17 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

CMU’s DensePose From WiFi: An Affordable, Accessible and Secure Approach to Human Sensing

In the new paper DensePose From WiFi, a Carnegie Mellon University research team proposes WiFi-based DensePose, a neural network architecture capable of estimating human dense pose using only WiFi signals in scenarios with occlusion and multiple people.

by Synced 2023-01-16 0

AI Machine Learning & Data Science Research

Absci’s Generative AI Approach Opens a Promising New Path for De Novo Antibody Design

In the new paper Unlocking de Novo Antibody Design With Generative Artificial Intelligence, researchers from Absci Corporation leverage the power of generative artificial intelligence for de novo antibody design in a zero-shot and controllable manner, dramatically reducing time and resource requirements for the task.

by Synced 2023-01-12 1

AI Machine Learning & Data Science Research

DeepMind Explores the Connection Between Gradient-Based Meta-Learning and Convex Optimization

In the new paper Optimistic Meta-Gradients, a DeepMind research team explores the connection between gradient-based meta-learning and convex optimization, demonstrating that optimism in meta-learning is achievable via the Bootstrapped Meta-Gradients approach.

by Synced 2023-01-11 0

AI Machine Learning & Data Science Research

Microsoft’s Neural Codec Language Models Synthesize High-Quality Personalized Speech From a 3-Second Sample

In the new paper Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers, a Microsoft research team presents VALL-E, the first language model-based text-to-speech (TTS) system with strong in-context learning. VALL-E achieves state-of-the-art personalized speech synthesis quality via prompting in a zero-shot setting.

by Synced 2023-01-10 0

AI Machine Learning & Data Science Research

Baidu Create 2022 Forum Details Strategy for Next-Level AI-Enhanced Creativity via Feedback-Driven Innovation

Baidu, Inc. today hosted its annual flagship developer conference Baidu Create 2022. In the meeting, Baidu offered an in-depth exploration of Baidu’s research and analysis of future technology trends, covering a range of emerging technologies including artificial intelligence, autonomous driving, intelligent search, quantum computing and AI scientific computing.

by Synced 2023-01-09 0

AI Machine Learning & Data Science Research

Google’s Masked Generative Transformers Achieve SOTA Text-To-Image Performance With Improved Efficiency

In the new paper Muse: Text-To-Image Generation via Masked Generative Transformers, a Google Research team introduces Muse, a transformer-based text-to-image synthesis model that leverages masked image modelling to achieve state-of-the-art performance while being significantly faster than diffusion or autoregressive models.

by Synced 2023-01-04 6

AI Machine Learning & Data Science Research

Baidu Research Releases Top 10 Tech Trends for 2023

Baidu research releases top 10 tech trends for 2023, e.g. big model building, digital-real convergence, virtual-real symbiosis, autonomous Driving, etc.

by Synced 2023-01-04 1

AI Machine Learning & Data Science Research

Hardware Savings Up to 46 Times for AIGC and Automatic Parallelism in New Colossal-AI Release

Colossal-AI (https://github.com/hpcaitech/ColossalAI), the widely-used open-source library for training, inference and fine-tuning of large deep learning models, has released a new automatic parallelism feature and functionality that reduces hardware costs by up to 46 times for AI-Generate Content (AIGC) solutions.

by Synced 2023-01-03 2

AI Machine Learning & Data Science Research

Stanford & Buffalo U Advance Language Modelling with State Space Models

In the new paper Hungry Hungry Hippos: Towards Language Modeling with State Space Models, Stanford University and State University of New York at Buffalo researchers explore the expressivity gap between state space models and transformer language model attention mechanisms and propose FlashConv to improve state space model training efficiency on modern hardware.

by Synced 2022-12-29 1

AI Machine Learning & Data Science Nature Language Tech Research

Improving Instruction Tuning for LLMs: Meta AI Presents the OPT-IML Benchmark of 2000 NLP Tasks

In the new paper OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization, a Meta AI research team presents OPT-IML Bench, an Instruction Meta Learning benchmark comprising 2000 NLP tasks and an evaluation framework for model generalization.

by Synced 2022-12-28 1

AI Machine Learning & Data Science Research

DeepMind & Google’s ML-Based GraphCast Outperforms the World’s Best Medium-Range Weather Forecasting System

In the new paper GraphCast: Learning Skillful Medium-Range Global Weather Forecasting, a research team from DeepMind and Google presents GraphCast, a machine-learning (ML)-based weather simulator that scales well with data and can generate a 10-day forecast in under 60 seconds. GraphCast outperforms the world’s most accurate deterministic operational medium-range weather forecasting system and betters existing ML-based benchmarks.

by Synced 2022-12-27 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

OpenAI’s Point·E: Generating 3D Point Clouds From Complex Prompts in Minutes on a Single GPU

In the new paper Point-E: A System for Generating 3D Point Clouds from Complex Prompts, An OpenAI research team presents Point·E, a system for text-conditional synthesis of 3D point clouds that leverages diffusion models to generate diverse and complex 3D shapes conditioned on complex text prompts in minutes on a single GPU.

by Synced 2022-12-22 0

AI Machine Learning & Data Science Research

Google’s Mu2SLAM: Toward a Single Model For All Speech and Text Understanding Tasks

In the new paper Mu2SLAM: Multitask, Multilingual Speech and Language Models, a Google Research team presents Mu2SLAM, a multilingual sequence-to-sequence pretraining method for speech and text models that covers arbitrary tasks in over 100 languages.

by Synced 2022-12-21 0

AI Computer Vision & Graphics Machine Learning & Data Science Research

Meet Google’s FlexiViT: A Flexible Vision Transformer for All Patch Sizes

In the new paper FlexiViT: One Model for All Patch Sizes, a Google Research team presents FlexiViT, a flexible ViT that performs well across a wide range of patch sizes, matching or outperforming standard fixed-patch ViT performance with no extra costs.

by Synced 2022-12-20 0

AI Machine Learning & Data Science Nature Language Tech Research

Microsoft’s Structured Prompting Breaks In-Context Learning Length Limits, Scales to Thousands of Examples

In the new paper Structured Prompting: Scaling In-Context Learning to 1,000 Examples, a Microsoft Research team proposes structured prompting. The novel approach breaks through conventional in-context learning length limits, scaling to thousands of examples with reduced computation complexity and superior performance and stability.

by Synced 2022-12-19 1

AI Machine Learning & Data Science Research

Will AGI Systems Undermine Human Control? OpenAI, UC Berkeley & Oxford U Explore the Alignment Problem

In the new paper The Alignment Problem From a Deep Learning Perspective, a research team from OpenAI, UC Berkeley and the University of Oxford examines the alignment problem with regard to deep learning, identifying potential issues and how we might mitigate them.

by Synced 2022-12-18 6

AI Computer Vision & Graphics Machine Learning & Data Science Research

Maryland U & NYU’s Visual Exploration Reveals What Vision Transformers Learn

In the new paper What Do Vision Transformers Learn? A Visual Exploration, a research team from the University of Maryland and New York University uses large-scale feature visualizations from a wide range of vision transformers to gain insights into what they learn from images and how they differ from convolutional neural networks.

by Synced 2022-12-14 2

AI Machine Learning & Data Science Nature Language Tech Research

Finding Truth in LLMs: UC Berkeley & Peking U Propose Unsupervised Contrast-Consistent Search

In the new paper Discovering Latent Knowledge in Language Models Without Supervision, a research team from UC Berkeley and Peking University presents Contrast-Consistent Search (CCS), an unsupervised approach for discovering latent knowledge in language models.

by Synced 2022-12-13 2

AI Machine Learning & Data Science Research

Microsoft’s E5 Text Embedding Model Tops the MTEB Benchmark With 40x Fewer Parameters

In the new paper Text Embeddings by Weakly-Supervised Contrastive Pre-training, a Microsoft research team introduces Embeddings from Bidirectional Encoder Representations (E5), a general-purpose text embedding model for tasks requiring a single-vector representation of texts and the first model to surpass the BM25 baseline on the BEIR retrieval benchmark under a zero-shot setting.

by Synced 2022-12-12 2

AI Machine Learning & Data Science Nature Language Tech Research

ServiceNow Research & Hugging Face Release The Stack: 3 TB of Permissively Licensed Source Code for LLMs

In the new paper The Stack: 3 TB of Permissively Licensed Source Code, a team from ServiceNow Research and Hugging Face advances open and responsible research on code LLMs by releasing The Stack, a 3.1 TB dataset of permissively licensed source code in 30 programming languages.