Deep Learning | Synced

by Synced 2023-08-18 10

HPC-AI Tech Raises 22 Million USD in Series A Funding to Fuel Team Expansion and Business Growth

Singapore – HPC-AI Tech, a pioneering company specializing in efficient large AI model training, is delighted to announce the successful completion of its Series A funding round, securing a total of 22 Million USD.

by Synced 2022-12-19 2

AI Machine Learning & Data Science Research

Will AGI Systems Undermine Human Control? OpenAI, UC Berkeley & Oxford U Explore the Alignment Problem

In the new paper The Alignment Problem From a Deep Learning Perspective, a research team from OpenAI, UC Berkeley and the University of Oxford examines the alignment problem with regard to deep learning, identifying potential issues and how we might mitigate them.

by Synced 2022-07-13 2

AI Machine Learning & Data Science Research

Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face

HPC-AI Tech’s flagship open-source and large-scale AI system, Colossal-AI, now allows Hugging Face users to seamlessly develop their ML models in a distributed and easy manner.

by Synced 2022-04-18 1

AI Machine Learning & Data Science Research

Meet DeepDPM: No Predefined Number of Clusters Needed for Deep Clustering Tasks

In the new paper DeepDPM: Deep Clustering With an Unknown Number of Clusters, a research team from the Ben-Gurion University of the Negev presents DeepDPM, an effective deep nonparametric approach that removes the need to predefine the number of clusters in clustering tasks and can infer it instead.

by Synced 2022-02-28 0

AI Machine Learning & Data Science Research

Princeton U’s DataMUX Enables DNNs to Simultaneously and Accurately Process up to 40 Input Instances With Limited Computational Overhead

In the new paper DataMUX: Data Multiplexing for Neural Networks, a Princeton University research team proposes Data Multiplexing (DataMUX). The novel technique enables neural networks to process multiple inputs simultaneously and generate accurate predictions, increasing model throughput with minimal additional memory requirements.

by Synced 2022-02-11 0

AI Machine Learning & Data Science Research

Google Brain’s EvoJAX Hardware-Accelerated Toolkit Significantly Improves Neuroevolutionary Computation

A Google Brain research team introduces EvoJAX, a JAX-based, scalable, general-purpose, hardware-accelerated neuroevolution toolkit that enables neuroevolution algorithms to work with neural networks running in parallel across multiple TPU/GPUs and achieves significant training speedups.

by Synced 2022-02-08 0

AI Machine Learning & Data Science Research

Introducing Alpa: A Compiler Architecture for Automated Model-Parallel Distributed Training That Outperforms Hand-Tuned Strategies

A research team from UC Berkeley, Amazon Web Services, Google, Shanghai Jiao Tong University and Duke University proposes Alpa, a compiler system for distributed deep learning on GPU clusters that automatically generates parallelization plans that match or outperform hand-tuned model-parallel training systems even on the models they were designed for.

by Synced 2022-01-19 1

AI Machine Learning & Data Science Research

Less is More: Understanding Neural Network Decisions via Simplified Yet Informative Inputs

A research team from University Medical Center Freiburg, ML Collective, and Google Brain introduces SimpleBits — an information-reduction method that learns to synthesize simplified inputs that contain less information yet remain informative for the task, providing a new approach for exploring the basis of network decisions.

by Synced 2022-01-18 0

AI Machine Learning & Data Science Research

Microsoft’s DeepSpeed-MoE Makes Massive MoE Model Inference up to 4.5x Faster and 9x Cheaper

A Microsoft research team proposes DeepSpeed-MoE, comprising a novel MoE architecture design and model compression technique that reduces MoE model size by up to 3.7x and a highly optimized inference system that provides 7.3x better latency and cost compared to existing MoE inference solutions.

by Synced 2021-12-31 4

AI Machine Learning & Data Science Research

Microsoft’s Self-Supervised Bug Detection and Repair Approach Betters Baselines By Up to 30%

In the NeurIPS 2021-accepted paper Self-Supervised Bug Detection and Repair, a Microsoft Research team proposes BUGLAB, a self-supervised approach that significantly improves on baseline methods for detecting bugs in real-life code.

by Synced 2021-12-23 1

AI Machine Learning & Data Science Research

Advancing Deep Learning With Collective Intelligence: Google Brain Surveys Recent Developments

A Google Brain research team surveys historical and recent neural network research on complex systems and the incorporation of collective intelligence principles to advance the capabilities of deep neural networks.

by Synced 2021-12-10 1

AI Machine Learning & Data Science Research

MIT Open-Sources a Toolkit for Editing Classifiers by Directly Rewriting Their Prediction Rules

An MIT research team develops a method for directly modifying a classifier’s prediction rules with essentially no additional data collection, enabling users to change a classifier’s behaviour on occurrences of concepts beyond the examples used in the editing process.

by Synced 2021-12-09 13

AI Machine Learning & Data Science Nature Language Tech Research

Peng Cheng Laboratory & Baidu Release PCL-BAIDU Wenxin: The World’s First Knowledge-Enhanced 100-Billion-Scale Pretrained Language Model

Peng Cheng Laboratory (PCL) and Baidu release PCL-BAIDU Wenxin, the world’s first knowledge-enhanced 100-billion-scale pretrained language model and the largest Chinese-language monolithic model with 260 billion parameters. PCL-BAIDU Wenxin achieves state-of-the-art results on more than 60 tasks and significantly advances more than 30 benchmarks for zero-shot and few-shot learning.

by Synced 2021-11-24 0

AI Machine Learning & Data Science Research

DeepMind, Google Brain & World Chess Champion Explore How AlphaZero Learns Chess Knowledge

DeepMind and Google Brain researchers and former World Chess Champion Vladimir Kramnik explore how human knowledge is acquired and how chess concepts are represented in the AlphaZero neural network via concept probing, behavioural analysis, and an examination of its activations.

by Synced 2021-11-23 7

AI Company Global News Research US & Canada

NVIDIA Launch Web App for GauGAN2 to Generate Pictures Through Simple Phrases

On November 22, the NVIDIA blog introduced the interactive demo website app to generate photorealistic landscape images in real-time via text description.

by Synced 2021-11-15 2

AI Computer Vision & Graphics Machine Learning & Data Science Popular Research

A Leap Forward in Computer Vision: Facebook AI Says Masked Autoencoders Are Scalable Vision Learners

In a new paper, a Facebook AI team advances autoencoding methods to the computer vision field and shows that masked autoencoders (MAE) are scalable self-supervised learners.

by Synced 2021-11-12 1

AI Machine Learning & Data Science Research

DeepMind’s One Pass ImageNet: A New Benchmark for Resource Efficiency in Deep Learning

A DeepMind research team presents the One Pass ImageNet (OPIN) problem, designed to study the space and compute efficiency of deep learning in a streaming setting with constrained data storage and to develop model training systems where each example is passed to the system only once.

by Synced 2021-11-10 3

AI Machine Learning & Data Science Research

Microsoft India Proposes Varuna: Scalable, Low-Cost Training of Massive Deep Learning Models

A Microsoft Research India team presents Varuna, a system for training massive deep learning models on commodity networking that eliminates the need for specialized hyperclusters and alleviates the cost, scale, and resource utilization challenges of deep learning model training.

by Synced 2021-11-04 2

AI Machine Learning & Data Science Research

Washington U & Google Study Reveals How Attention Matrices Are Formed in Encoder-Decoder Architectures

In the new paper Understanding How Encoder-Decoder Architectures Attend, researchers from the University of Washington, Google Blueshift Team and Google Brain Team propose a method for decomposing hidden states over a sequence into temporal- and input-driven components, revealing how attention matrices are formed in encoder-decoder networks.

by Synced 2021-11-01 0

AI Machine Learning & Data Science Research

Warsaw U, OpenAI and Google’s Hourglass Hierarchical Transformer Model Outperforms Transformer Baselines

A team from the University of Warsaw, OpenAI and Google Research proposes Hourglass, a hierarchical transformer language model that operates on shortened sequences to alleviate transformers’ huge computation burdens.

by Synced 2021-10-25 1

AI Machine Learning & Data Science Research

Facebook AI Releases SaLinA: A Flexible and Simple Library for Learning Sequential Agents

A Facebook AI research team releases SaLinA, a reinforcement learning (RL) library for model-based RL, differentiable environments and multi-agent RL that simplifies the implementation of complex sequential learning models.

by Synced 2021-10-22 0

AI Machine Learning & Data Science Research

Deeper Is Not Necessarily Better: Princeton U & Intel’s 12-Layer Parallel Networks Achieve Performance Competitive With SOTA Deep Networks

In the new paper Non-deep Networks, a research team from Princeton University and Intel Labs argues it is possible to achieve high performance with “non-deep” neural networks, presenting ParNet (Parallel Networks), a novel 12-layer architecture that achieves performance competitive with its state-of-the-art deep counterparts.

by Synced 2021-09-24 1

AI Machine Learning & Data Science Nature Language Tech Research

Google’s Zero-Label Language Learning Achieves Results Competitive With Supervised Learning

A Google AI research team explores zero-label learning (training with synthetic data only) in natural language processing, and introduces Unsupervised Data Generation (UDG), a training data creation procedure designed to synthesize high-quality training data without human annotations.

by Synced 2021-08-19 3

AI Machine Learning & Data Science Research

100+ Stanford Researchers Publish 200+ Page Paper on the AI Paradigm Shift Introduced by Large-Scale Models

In a 200+ page paper, Percy Liang, Fei-Fei Li, and over 100 other researchers from the Stanford University Center for Research on Foundation Models (CRFM) systematically describe the opportunities and risks of large-scale pretrained “foundation” models. The unique study aims to provide a clearer understanding of how these models work, when and how they fail, and the various capabilities provided by their emergent properties.

by Synced 2021-08-18 5

AI Machine Learning & Data Science Research

Logic Explained Deep Neural Networks: A General Approach to Explainable AI

A research team from Università di Firenze, Università di Siena, University of Cambridge and Universitè Côte d’Azur proposes a general approach to explainable artificial intelligence (XAI) in neural architectures, designing interpretable deep learning models called Logic Explained Networks (LENs). The novel approach yields better performance than established white-box models while providing more compact and meaningful explanations.

by Synced 2021-08-12 2

AI Machine Learning & Data Science Nature Language Tech Research

WMT21 | Detailing WeChat AI & Beijing Jiaotong University’s NMT System Architecture

On August 5, WeChat AI and Beijing Jiaotong University system developers released the paper WeChat Neural Machine Translation Systems for WMT21, revealing the architecture of their novel neural machine translation (NMT) system and the strategies they adopted to achieve impressive performance in the WMT21 competition.

by Synced 2021-08-11 2

AI Machine Learning & Data Science Research

Tokyo U & Preferred Networks Propose a Fast Estimation Method for the Stability of Ensemble Feature Selectors

A research team from Tokyo University and Preferred Networks proposes a fast simulation-based method for estimating the stability of ensemble selectors.

by Synced 2021-08-10 3

AI Machine Learning & Data Science Research

Novel Feature Importance-Aware Transferable Adversarial Attacks Dramatically Improve Transferability

A research team from Zhejiang University, Wuhan University and Adobe Research proposes Feature Importance-Aware Attacks (FIA) that drastically improve the transferability of adversarial examples, achieving superior performance compared to state-of-the-art transferable attacks.

by Synced 2021-08-09 4

AI Machine Learning & Data Science Research

DeepMind’s Perceiver IO: A General Architecture for a Wide Variety of Inputs & Outputs

A DeepMind research team proposes Perceiver IO, a single network that can easily integrate and transform arbitrary information for arbitrary tasks while scaling linearly with both input and output sizes. The general architecture achieves outstanding results on tasks with highly structured output spaces, such as natural language and visual understanding.

by Synced 2021-08-03 7

AI Machine Learning & Data Science Research

DeepMind & Google Use Neural Networks to Solve Mixed Integer Programs

A team from DeepMind and Google Research leverages neural networks to automatically construct effective heuristics from a dataset for mixed integer programming (MIP) problems. The approach significantly outperforms classical MIP solver techniques.

by Synced 2021-07-29 2

AI Machine Learning & Data Science Research

Google & Northwestern U Present Provably Efficient Learning Algorithms for Neural Networks

A research team from Google Research and Northwestern University presents polynomial time and sample-efficient algorithms for learning an unknown depth-2 feedforward neural network with general ReLU activations, aiming to provide insights into whether efficient algorithms exist for learning ReLU networks.

by Synced 2021-07-26 1

AI Machine Learning & Data Science Research

DeepMind’s Epistemic Neural Networks Open New Avenues for Uncertainty Modelling in Large and Complex DL Systems

A research team from DeepMind presents epistemic neural networks (ENNs) as an interface for uncertainty modelling in deep learning, and proposes the KL divergence from a target distribution as a precise metric to evaluate ENNs.

by Synced 2021-07-20 6

AI Machine Learning & Data Science Popular Research

DeepMind’s AlphaFold2 Predicts Protein Structures with Atomic-Level Accuracy

In a new paper published in the prestigious scientific journal Nature, DeepMind presents AlphaFold2, a redesigned neural-network system based on last year’s AlphaFold that can predict protein structures with atomic-level accuracy.

by Synced 2021-07-02 32

AI Machine Learning & Data Science Research

Proposed ‘New Hope’ Blockchain Platforms Enable Large-Scale DNN Training on Smart Contracts

A recent paper proposes A New Hope (ANH), a set of novel blockchain platforms designed to enable the integration of large-scale deep neural networks (DNNs) into smart contracts.

by Synced 2021-06-29 2

AI Machine Learning & Data Science Research

DeepMind & Amii Extend Emphatic Algorithms for Deep RL, Improving Performance on Atari Games

A research team from DeepMind and Amii extends the emphatic method to multi-step deep reinforcement learning (RL) targets, and demonstrates that combining emphatic trace with deep neural networks can improve performance on classic Atari video games.

by Synced 2021-06-25 5

AI Machine Learning & Data Science Research

Google Research’s Prediction Depth: Understanding the Laws that Govern DL Data Processing

A team from Google Research proposes prediction depth, a new measure of example difficulty determined from hidden embeddings. Their study reveals the surprising fact that the prediction depth of a given input has strong connections to a model’s uncertainty, confidence, accuracy and speed of learning for that data point.

by Synced 2021-06-24 2

AI Machine Learning & Data Science Research

Google Survey Explores Methods for Making DL Models ‘Smaller, Faster, and Better’

Researchers from Google conduct a survey on how to make Deep Learning models smaller, faster, and better. The team focuses on core areas of model efficiency, from modelling techniques to hardware support, and open-sources an experiment-based guide and code to help practitioners optimize their model training and deployment.

by Synced 2021-05-28 2

AI Machine Learning & Data Science Research

New IEEE Research Equips Gradient Descent with Angular Information to Boost DNN Training

An IEEE team proposes AngularGrad — a novel optimization algorithm that takes both gradient direction and angular information into consideration. The method successfully reduces the zig-zag effect in the optimization trajectory and speeds up convergence.

by Synced 2021-05-20 2

AI Machine Learning & Data Science Popular Research

ETH Zürich Identifies Priors That Boost Bayesian Deep Learning Models

A research team from ETH Zürich presents an overview of priors for (deep) Gaussian processes, variational autoencoders and Bayesian neural networks. The researchers propose that well-chosen priors can achieve theoretical and empirical properties such as uncertainty estimation, model selection and optimal decision support; and provide guidance on how to choose them.

by Synced 2021-05-13 3

AI Machine Learning & Data Science Research

DeepMind Presents Neural Algorithmic Reasoning: The Art of Fusing Neural Networks With Algorithmic Computation

A research team from DeepMind explores how neural networks can be fused with algorithmic computation and demonstrates an elegant neural end-to-end pipeline that goes straight from raw inputs to general outputs while emulating an algorithm internally.