Neural Network

by Synced 2022-07-13 2

Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face

HPC-AI Tech’s flagship open-source and large-scale AI system, Colossal-AI, now allows Hugging Face users to seamlessly develop their ML models in a distributed and easy manner.

by Synced 2022-04-22 5

AI Machine Learning & Data Science Research

DeepMind, Mila & Google Brain Enable Generalization Capabilities for Causal Graph Structure Induction

A research team from DeepMind, Mila – University of Montreal and Google Brain proposes a neural network architecture that learns the graph structure of observational and/or interventional data via supervised training on synthetic graphs, making causal induction a black-box problem that generalizes well to new synthetic and naturalistic graphs.

by Synced 2022-04-19 4

AI Machine Learning & Data Science Popular Research

Toward Self-Improving Neural Networks: Schmidhuber Team’s Scalable Self-Referential Weight Matrix Learns to Modify Itself

In the new paper A Modern Self-Referential Weight Matrix That Learns to Modify Itself, a research team from The Swiss AI Lab, IDSIA, University of Lugano (USI) & SUPSI, and King Abdullah University of Science and Technology (KAUST) presents a scalable self-referential weight matrix (SRWM) that leverages outer products and the delta update rule to update and improve itself.

by Synced 2022-04-06 3

AI Machine Learning & Data Science Research

Google Trains a 540B Parameter Language Model With Pathways, Achieving ‘Breakthrough Performance’

A Google Research team further explores the scaling approach for improving language modelling, leveraging the new Pathways distributed ML system to train a 540 billion parameter autoregressive transformer, Pathways Language Model (PaLM), that achieves state-of-the-art few-shot performance.

by Synced 2022-01-31 1

AI Machine Learning & Data Science Nature Language Tech Research

Sapienza U & OpenAI Propose Explanatory Learning to Enable Machines to Understand and Create Explanations

A research team from Sapienza University and OpenAI introduces an explanatory learning procedure that enables machines to understand existing explanations from symbolic sequences and create new explanations for unexplained phenomena, and further proposes Critical Rationalist Network (CRN) models for discovering explanations for novel phenomena.

by Synced 2022-01-19 1

AI Machine Learning & Data Science Research

Less is More: Understanding Neural Network Decisions via Simplified Yet Informative Inputs

A research team from University Medical Center Freiburg, ML Collective, and Google Brain introduces SimpleBits — an information-reduction method that learns to synthesize simplified inputs that contain less information yet remain informative for the task, providing a new approach for exploring the basis of network decisions.

by Synced 2022-01-10 0

AI Machine Learning & Data Science Research

Counterfactual Memorization in Language Models: Distinguishing Rare from Common Memorization

A team from Google Research, University of Pennsylvania and Cornell University proposes a principled perspective to filter out common memorization for LMs, introducing “counterfactual memorization” to measure the expected change in a model’s prediction and distinguish “rare” (episodic) memorization from “common” (semantic) memorization in neural LMs.

by Synced 2021-06-01 1

AI Machine Learning & Data Science Research

Georgia Tech & Microsoft Reveal ‘Super Tickets’ in Pretrained Language Models: Improving Model Compression and Generalization

A research team from Georgia Tech, Microsoft Research and Microsoft Azure AI studies the collections of “lottery tickets” in extremely over-parametrized models, revealing the generalization performance pattern of winning tickets and proving the existence of “super tickets.”

by Synced 2021-05-26 2

AI Machine Learning & Data Science Nature Language Tech Research

Study Shows Transformers Possess the Compositionality Power for Mathematical Reasoning

A research team from UC Davis, Microsoft Research and Johns Hopkins University extends work on training massive amounts of linguistic data to reveal the grammatical structures in their representations to the domain of mathematical reasoning, showing that both the standard transformer and the TP-Transformer can compose the meanings of mathematical symbols based on their structured relationships.

by Synced 2021-05-20 2

AI Machine Learning & Data Science Popular Research

ETH Zürich Identifies Priors That Boost Bayesian Deep Learning Models

A research team from ETH Zürich presents an overview of priors for (deep) Gaussian processes, variational autoencoders and Bayesian neural networks. The researchers propose that well-chosen priors can achieve theoretical and empirical properties such as uncertainty estimation, model selection and optimal decision support; and provide guidance on how to choose them.

by Synced 2021-05-13 3

AI Machine Learning & Data Science Research

DeepMind Presents Neural Algorithmic Reasoning: The Art of Fusing Neural Networks With Algorithmic Computation

A research team from DeepMind explores how neural networks can be fused with algorithmic computation and demonstrates an elegant neural end-to-end pipeline that goes straight from raw inputs to general outputs while emulating an algorithm internally.

by Synced 2021-05-05 3

AI Machine Learning & Data Science Popular Research

Bronstein, Bruna, Cohen and Velickovic Leverage the Erlangen Programme to Establish the Geometric Foundations of Deep Learning

Twitter Chief Scientist Michael Bronstein, Joan Bruna from New York University, Taco Cohen from Qualcomm AI and Petar Veličković from DeepMind publish a paper that aims to geometrically unify the typical architectures of CNNs, GNNs, LSTMs, Transformers, etc. from the perspective of symmetry and invariance to build an “Erlangen Programme” for deep neural networks.

by Synced 2021-05-03 4

AI Machine Learning & Data Science Research

CMU, UT Austin & Facebook’s CNN Layer Width Optimization Strategies Achieve 320x Overhead Reduction

Researchers from Carnegie Mellon University, the University of Texas at Austin and Facebook AI propose a novel paradigm to optimize widths for each CNN layer. The method is compatible across various width optimization algorithms and networks and achieves up to a 320x reduction in width optimization overhead without compromising top-1 accuracy on ImageNet.

by Synced 2021-04-27 2

AI Machine Learning & Data Science Nature Language Tech Research

Microsoft & Peking U Researchers Identify ‘Knowledge Neurons’ in Pretrained Transformers, Enabling Fact Editing

A research team from Microsoft Research and Peking University peeps into pretrained transformers and investigates how factual knowledge is stored, proposing a method to identify “knowledge neurons,” which can be utilized to explicitly update and erase facts.

by Synced 2021-04-26 2

AI Machine Learning & Data Science Research

Google and UC Berkeley Propose Green Strategies for Large Neural Network Training

A research team from Google and the University of California, Berkeley calculates the energy use and carbon footprint of large-scale models T5, Meena, GShard, Switch Transformer and GPT-3, and identifies methods and publication guidelines that could help reduce their CO2e footprint.

by Synced 2021-03-10 8

AI Emerging Company Machine Learning & Data Science Others Research

Qualcomm AI Maps DL to Quantum Computer via Quantum Field Theory

A team from Qualcomm AI proposes the direct mapping of a deep neural network onto an optical quantum computer through the language of quantum field theory, paving the way for the future development of novel quantum neural network architectures.

by Synced 2021-01-21 0

AI Computer Vision & Graphics Machine Learning & Data Science Nature Language Tech Research

CHI 2021 | ‘AI as Play’ Challenges the Productivity-Based Human-AI Interaction Paradigm

Researchers from Drexel University, Northeastern University and IT University Copenhagen explore how humans interact with AI in such contexts, with a focus on computer games.

by Synced 2021-01-18 2

Global News Machine Learning & Data Science Research US & Canada

ICLR 2021 | UT Austin Training-Free Framework Performs High-Quality NAS on ImageNet in Four GPU Hours

Researchers from the University of Texas, Austin have proposed a novel framework called Training-Free Neural Architecture Search (TE-NAS) for “training-free” neural architecture search.

by Synced 2020-12-08 4

Computer Vision & Graphics Global News Machine Learning & Data Science Research US & Canada

NeurIPS 2020: NVIDIA Achieves AI Training Breakthrough Using Limited Datasets of 1,500 Images

NVIDIA blog introduced company’s latest NeurIPS presentation: applying a novel neural network training technique, adaptive discriminator augmentation, to the popular NVIDIA StyleGAN2 model.

by Synced 2020-07-07 12

AI Computer Vision & Graphics Machine Learning & Data Science Research

‘Beyond the ConvNet’ -Stanford & MIT Neural Network Learns Physical Graph Representations from Video

Researchers introduce a novel Physical Scene Graphs (PSG) approach designed to obtain a better structured understanding of visual scenes.

by Synced 2020-05-30 8

AI Industrial AI Industry Research

How About Letting AI Take Care of Weather Forecasting?

With AI models gaining power and momentum across a number of industries in recent years, meteorological researchers are now applying the tech in satellite data processing, nowcasting, typhoon and extreme weather forecasting and other business and environmental analytics areas.

by Synced 2020-04-16 2

AI Computer Vision & Graphics Machine Learning & Data Science Research

OpenAI Puts CV Models Under Their Microscope

Just as biologists gain insights into organisms by putting model specimens under their microscopes, AI Microscope was designed to help researchers analyze the features that form inside leading CV models.

by Synced 2020-02-06 1

AI Machine Learning & Data Science Research

Radioactive Data: Facebook AI Knows Where You Got Your Training Dataset

Researchers proposed a “radioactive data” technique for subtly marking images in a dataset to help researchers later determine whether they were used to train a particular model.

by Synced 2020-01-20 2

AI Computer Vision & Graphics Machine Learning & Data Science Research

Give Your Apps a New Interface With Neural Style Transfer!

To enable both content creators and end users to seriously restyle their apps’ interfaces while maintaining content detail clarity essential to their usability, researchers from Stanford have proposed ImagineNet, a novel and powerful new tool for interface customisation.

by Synced 2020-01-07 1

AI Machine Learning & Data Science Research

Microsoft Releases NNI V1.3 for AutoML Algorithms and Training

To help users design and tune machine learning models, neural network architectures or complex system parameters in an efficient and automatic way, in 2017 Microsoft Research began developing its Neural Network Intelligence (NNI) AutoML toolkit, open-sourcing v1.0 version in 2018.

by Synced 2019-12-26 1

AI Research

Facebook PointRend: Rendering Image Segmentation

Facebook AI Research team has introduced a new “point-based rendering” neural network module with an iterative subdivision algorithm that can integrate SOTA image segmentation models.

by Synced 2019-12-20 0

AI Conference Research

ICLR 2020 Accepted Papers Announced

The ICLR 2020 conference programme chairs finally put the selection process behind them, announcing 687 out of 2594 papers had made it to ICLR 2020 — a 26.5 percent acceptance rate.

by Synced 2019-12-01 7

AI AI Weekly Research

CAH Black Friday AI Challenge; AI-Generated Thanksgiving Dinner Recipes; Go Master Lee Sedol Retires Because AI

Synced Global AI Weekly December 1st

by Synced 2019-11-27 1

AI Research

Google & Johns Hopkins University | Can Adversarial Examples Improve Image Recognition?

Rather than attempting to defend convolutional networks against them, the researchers introduce a novel enhanced adversarial training scheme, AdvProp, which treats adversarial examples as additional training examples to improve the accuracy of image classification models.

by Synced 2019-10-09 3

AI Research

Watch Out, MIT’s New AI Model Knows What You’re Doing Behind That Wall

For better or worse, AI can now figure out what you’re doing even without “seeing” you. The MIT Computer Science & AI Lab (CSAIL) has unveiled a neural network model that can detect human actions through walls or in extremely dark places.

by Synced 2019-07-01 0

AI Research

Searching for Code? Let a Neural Network Do That for You!

Facebook published a blog article demonstrating their newly developed code search tool, Neural Code Search (NCS).

by Synced 2019-06-14 0

AI Research

Peeking Inside DNNs With Information Theory

Deep learning model performance has taken huge strides, allowing researchers to tackle tasks which were simply not possible for machines less than a decade ago.

by Synced 2019-06-09 1

AI AI Weekly Research

Chatbots, AI Generated Fake Faces & Carbon Footprints — What’s Hot on Reddit?

Synced Global AI Weekly June 9th

by Synced 2019-05-11 0

AI Interview Research

Microsoft Research Asia: Past, Present, and Future of NLP

Microsoft Research Asia (MSRA) has been dubbed the “Whampoa Academy for AI” in reference the elite Chinese military school. MSRA is a bootcamp for NLP research and has trained more than 500 interns, 20 PhDs and 20 postdocs over the past two decades.

by Synced 2019-04-29 3

AI Research

OpenAI Sparse Transformer Improves Predictable Sequence Length by 30x

San Francisco research company OpenAI has developed Sparse Transformer, a deep neural network which outperforms current state-of-the-art techniques for predicting long-sequence data in text, image and sound.

by Synced 2019-04-28 1

AI AI Weekly Industry

Earth Week: AI Helps Build A Sustainable Future

Synced Global AI Weekly April 28th

by Synced 2019-04-19 0

AI Research

Facebook Randomly Wired Neural Networks Outperform Human-designed for Image Recognition

Neural networks for image recognition have matured from simple chain-like models to structures with multiple wiring paths. The emergence of Neural Architecture Search (NAS) can optimize models with more elaborate wiring and operation types.

by Synced 2019-04-05 1

AI Research

DeepMind AI Flunks High School Math Test

DeepMind trained and tested its neural model by first collecting a dataset consisting of different types of mathematics problems. Rather than crowd-sourcing, they synthesized the dataset to generate a larger number of training examples, control the difficulty level and reduce training time.

by Synced 2019-03-28 1

AI Research

BigGAN Trained With Only 4 GPUs!

Andrew Brock, first author of the high-profile research paper Large Scale GAN Training for High Fidelity Natural Image Synthesis (aka “BigGAN”), has posted a GitHub repository of an unofficial PyTorch BigGAN implementation that requires only 4-8 GPUs to train the model.

by Synced 2019-03-27 1

AI Research

New Method Applies Monte Carlo Neural Fictitious Self-Play to Texas Hold’em

Facing the incomplete information environment, the asynchronous neural virtual self-play (ANFSP) method allows AI to learn to generate optimal decisions in multiple virtual environments. The approach has performed well in Texas Hold’em and multiplayer FPS video games.