October, 2021 | Synced

by Synced 2021-10-29 1

DeepMind Study Resolves Delusions in Sequence Models for Interaction and Control

In the new paper Shaking the Foundations: Delusions in Sequence Models for Interaction and Control, a DeepMind research team explores the origins of mismatch problems in sequence models that lack understanding of the cause and effect of their actions, and addresses the problem by treating actions as causal interventions.

by Synced 2021-10-28 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

Softmax-free Vision Transformer With Linear Complexity: Achieving a Superior Accuracy/Complexity Trade-off

Researchers from Fudan University, University of Surrey and Huawei Noah’s Ark Lab identify the limitations of quadratic complexity for vision transformers (ViTs) as rooted in keeping the softmax self-attention during approximations. The team proposes the first softmax-free transformer (SOFT), which reduces the self-attention computation to linear complexity, achieving a superior trade-off between accuracy and complexity.

by Synced 2021-10-27 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

Google Open-Sources SCENIC: A JAX Library for Rapid Computer Vision Model Prototyping and Cutting-Edge Research

A research team from Google Brain and Google Research introduces SCENIC, an open-source JAX library for fast and extensible computer vision research and beyond. JAX currently supports implementations of state-of-the-art vision models such as ViT, DETR and MLP Mixer, and more open-sourced cutting-edge projects will be added in the near future.

by Synced 2021-10-26 3

AI Machine Learning & Data Science Nature Language Tech Research

Facebook AI’s NormFormer Employs Extra Normalization to Significantly Improve Transformer Pretraining

Facebook AI Research proposes NormFormer, an approach that improves pretraining perplexity and downstream task performance for both causal and masked language models, achieving GPT3-Large (1.3B) zero-shot performance 60 percent faster and improving fine-tuned GLUE performance by 1.9 percent.

by Synced 2021-10-25 1

AI Machine Learning & Data Science Research

Facebook AI Releases SaLinA: A Flexible and Simple Library for Learning Sequential Agents

A Facebook AI research team releases SaLinA, a reinforcement learning (RL) library for model-based RL, differentiable environments and multi-agent RL that simplifies the implementation of complex sequential learning models.

by Synced 2021-10-22 0

AI Machine Learning & Data Science Research

Deeper Is Not Necessarily Better: Princeton U & Intel’s 12-Layer Parallel Networks Achieve Performance Competitive With SOTA Deep Networks

In the new paper Non-deep Networks, a research team from Princeton University and Intel Labs argues it is possible to achieve high performance with “non-deep” neural networks, presenting ParNet (Parallel Networks), a novel 12-layer architecture that achieves performance competitive with its state-of-the-art deep counterparts.

by Synced 2021-10-21 0

AI Machine Learning & Data Science Research

DeepMind’s Fictitious Co-Play Trains RL Agents to Collaborate with Novel Humans Without Using Human Data

A DeepMind research team explores the problem of how to train agents to collaborate well with novel human partners without using human data and presents Fictitious Co-Play (FCP), a surprisingly simple approach designed to address this challenge.

by Synced 2021-10-20 0

AI Machine Learning & Data Science Research

Yann LeCun Team Challenges Current Beliefs on Interpolation and Extrapolation Regarding DL Model Generalization Performance

Facebook AI and NYU researchers challenge the conventional wisdom regarding interpolation in machine learning, arguing that interpolation almost never happens on high-dimensional datasets.

by Synced 2021-10-19 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

StyleNeRF: A 3D-Aware Generator for High-Resolution Image Synthesis with Explicit Style Control

In a paper currently under double-blind review for ICLR 2022, researchers propose StyleNeRF, a 3D-aware generative model that can synthesize high-resolution images at interactive rates while preserving high-quality 3D consistency, and can even generalize to unseen views with control on styles and poses.

by Synced 2021-10-18 3

AI Machine Learning & Data Science Nature Language Tech Popular Research

Mention Memory: Incorporating Factual Knowledge From Various Sources Into Transformers Without Supervision

A research team from the University of Southern California and Google proposes TOME, a “mention memory” approach to factual knowledge extraction for NLU tasks. A transformer model with attention over a semi-parametric representation of the entire Wikipedia text corpus, TOME can extract information without supervision and achieves strong performance on multiple open-domain question answering benchmarks.

by Synced 2021-10-15 4

AI Machine Learning & Data Science Research

Google Proposes ARDMs: Efficient Autoregressive Models That Learn to Generate in any Order

A Google Research team introduces Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models and discrete diffusion models that can generate variables in an arbitrary order and upscale variables.

by Synced 2021-10-14 1

AI Machine Learning & Data Science Research

Google Researchers Explore the Limits of Large-Scale Model Pretraining

A Google Research team conducts a systematic exploration comprising more than 4800 experiments on Vision Transformers, MLP-Mixers and ResNets with parameters ranging from 10 million to 10 billion, evaluated on more than 20 downstream image recognition tasks, aiming to capture the nonlinear relationships between performance on upstream and downstream tasks.

by Synced 2021-10-13 0

AI Community Computer Vision & Graphics Global Global News Research

ICCV 2021 Best Papers Announced

On October 13, ICCV 2021 announced its Best Paper Awards, honourable mentions, and Best Student Paper.

by Synced 2021-10-13 5

AI Machine Learning & Data Science Research

NVIDIA’s StyleGAN3 Is Fully Equivariant to Translation and Rotation, Improving GAN-Based Animation Generation

A NVIDIA and Aalto University research team presents StyleGAN3, a novel generative adversarial network (GAN) architecture where the exact sub-pixel position of each feature is exclusively inherited from the underlying coarse features, enabling a more natural transformation hierarchy and advancing GAN-based animation generation.

by Synced 2021-10-12 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

Are Patches All You Need? New Study Proposes Patches Are Behind Vision Transformers’ Strong Performance

A research team proposes ConvMixer, an extremely simple model designed to support the argument that the impressive performance of vision transformers (ViTs) is mainly attributable to their use of patches as the input representation. The study shows that ConvMixer can outperform ViTs, MLP-Mixers and classical vision models.

by Synced 2021-10-11 7

AI Global News Hot Industry Nature Language Tech Research US & Canada

530 Billion Parameters! Microsoft and NVIDIA Trained the Largest Generative Language Model

On October 11, Microsoft introduced the largest and “the most powerful monolithic transformer language model” trained to date, a 530 billion parameter GPT-3-style generative language model.

by Synced 2021-10-11 6

AI Asia China Global News US & Canada

China Has Won AI Battle With U.S., Says Ex-Pentagon Software Chief

According to Reuters, the Pentagon’s former software chief Nicolas Chaillan told the Financial Times on Sunday that “We have no competing fighting chance against China in 15 to 20 years. Right now, it’s already a done deal; it is already over, in my opinion.”

by Synced 2021-10-08 0

AI Computer Vision & Graphics Machine Learning & Data Science Research

Apple Study Reveals the Learned Visual Representation Similarities and Dissimilarities Between Self-Supervised and Supervised Methods

An Apple research team performs a comparative analysis on a contrastive self-supervised learning (SSL) algorithm (SimCLR) and a supervised learning (SL) approach for simple image data in a common architecture, shedding light on the similarities and dissimilarities in their learned visual representation patterns.

by Synced 2021-10-07 1

AI Machine Learning & Data Science Research

Facebook & CMU’s Zero-Shot VideoCLIP Outperforms Fully-Supervised SOTA Methods for Video-Text Understanding

A research team from Facebook AI and Carnegie Mellon University presents VideoCLIP, a contrastive approach for pretraining a unified model for zero-shot video and text understanding without requiring annotated data for downstream tasks.

by Synced 2021-10-06 0

AI Asia Company Global News

Sony Will Launch Edge AI Platform Service for AI Developers

On October 6, Sony Group announced that the company will launch its edge AI platform service Aitrios AI sensing platform in Japan, the U.S. and Europe starting late this year.

by Synced 2021-10-06 1

AI Computer Vision & Graphics Machine Learning & Data Science Research

Google Significantly Improves Visual Representations by Adding Explicit Information Compression

A Google Research team presents compressive variants of SimCLR and BYOL that yield better and more robust visual representations.

by Synced 2021-10-05 4

AI Company EU & UK Global News Research

DeepMind Introduce AlphaFold-Multimer to Predict Multi-Chain Protein Complexes with Better Accuracy

The London-based AI research firm DeepMind has introduced AlphaFold-Multimer, a model that can predict the structure of multi-chain protein complexes with increased accuracy.

by Synced 2021-10-05 1

AI Machine Learning & Data Science Research

DeepMind’s FIRE PBT: Automated Hyperparameter Tuning With Faster Model Training and Better Final Performance

A DeepMind research team proposes Faster Improvement Rate PBT (FIRE PBT) for Population Based Training (PBT), an automated hyperparameter tuning method for neural network training. The novel approach achieves faster improvement rates and better long-term performance.

by Synced 2021-10-04 3

AI Computer Vision & Graphics Machine Learning & Data Science Research

Debiasing Image Datasets: Oxford University Presents PASS, an ImageNet Replacement for Self-Supervised Pretraining

An Oxford University research team presents PASS, a large (1.28M) image collection excluding humans, created as an ImageNet replacement for self-supervised pretraining without technical, ethical or legal issues.

by Synced 2021-10-01 1

AI Machine Learning & Data Science Research

ETH Zurich and NVIDIA’s Massively Parallel Deep RL Enables Robots to Learn to Walk in Minutes

A research team from ETH Zurich and NVIDIA proposes a training framework that enables fast policy generation for real-world robotic tasks — training time can be reduced by multiple orders of magnitude using massive parallelism on a single workstation GPU.