Columbia University Model Learns Predictability From Unlabelled Video
Researchers propose a novel framework and hierarchical predictive model that learns to identify what is predictable from unlabelled video.
AI Technology & Industry Review
Researchers propose a novel framework and hierarchical predictive model that learns to identify what is predictable from unlabelled video.
A new model surpassed human baseline performance on the challenging natural language understanding benchmark.
OpenAI has trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language.
The novel approach tackles dynamic 3D human-body synthesis from a sparse set of camera views, bettering existing methods on key metrics by significant margins.
Researchers proposed PGDrive, a driving simulator designed to evaluate and improve end-to-end driving agents’ generalization abilities.
A new machine translation method enables global manga fans to enjoy immediate translations of their favourite Japanese comics.
Researchers from Vector Institute, University of Toronto and Google Brain recently studied three common types of intrinsic motivation across seven agents, three Atari games, and the 3D game Minecraft.
Researchers from Tsinghua University have developed an AutoML framework and toolkit specifically designed for graph datasets and tasks.
Researchers combine the effectiveness of the inductive bias in CNNs with the expressivity of transformers to model and synthesize high resolution images.
Rapid and accurate identification of mosquitoes that transmit human pathogens such as malaria is an essential part of mosquito-borne disease surveillance.
As part of our year-end series, Synced highlights 10 artificial intelligence papers that garnered extraordinary attention and accolades in 2020.
WILDS is an ambitious benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications.
This year, 22 Transformer-related research papers were accepted by NeurIPS, the world’s most prestigious machine learning conference. Synced has selected ten of these works to showcase the latest Transformer trends.
Yoshua Bengio and Anirudh Goyal from Mila - Quebec AI Institute delve into human and non-human animal intelligence and how it can inform deep learning.
SlimGANs can easily change model sizes during runtime to implement quality-efficiency trade-offs based on practical needs.
CSAIL researchers propose a framework for image reconstruction tasks using the state-of-the-art generative model StyleGAN2.
The new AI-powered Multi-Ingredient Pizza Generator (MPG) can deliver all these mouth-watering pies and many more.
What if, instead of hard-coding road rules into self-driving algorithms, AI agents were free to come up with their own ways of safely and efficiently sharing the road?
The approach dramatically reduces bandwidth requirements by sending only a keypoint representation [of faces] and reconstructing the source video on the receiver side with the help of generative adversarial networks (GANs) to synthesize the talking heads.
Airbus has developed a new Smart Librarian (SL) FCOM QA system comprising a dialogue engine, retriever (search engine), and QA module.
Google’s UK-based lab and research company DeepMind says its AlphaFold AI system has solved the protein folding problem, a grand challenge that has vexed the biology research community for half a century.
Facebook’s new model enables free-viewpoint rendering of dynamic scenes in a single video.
The SeLf-trAining framework for Distance mEtric learning (SLADE) framework combines self-supervised learning and distance metric learning methods to improve information retrieval performance.
University of Alberta recently proposed U^2-Net, a novel deep network architecture that achieves very competitive performance in salient object detection.
In EMNLP 2020, a Montreal-based research team introduced a large medical text dataset designed to boost abbreviation disambiguation in the medical domain.
A new study by South China University of Technology and Tencent WeChat AI researchers is the latest fruitful attempt to utilize transformer architectures in object detection.
A new DeepMind scalable environment simulator takes a digital approach to the question, enabling the examination of environmental factors on AI agents.
New Spoken Language Understanding (SLU) research from MIT CSAIL and Amazon AI introduces step-skipping semi-supervised frameworks that take speech as input and achieve performance competitive to systems leveraging oracle text.
“Our research provides enriched AR user experiences by enabling a more fine-grained visual recognition feature in AR, which is desirable in a wide range of application scenarios including technical support,” IBM researchers say.
Google Research and DeepMind debut Long-Range Arena (LRA) benchmark for Transformer research on tasks with long sequence lengths.
Google Brain ICLR 2021 submission analyzes learned optimizers’ performance advantage over well-tuned baseline optimizers.
Researchers introduced a modular primitive that uses existing, highly optimized hardware graphics pipelines to deliver performance superior to previous differentiable rendering systems.
Amazon Alexa AI paper asks whether NLU problems could be mapped to question-answering (QA) problems using transfer learning.
Facebook AI says DNNs can perform well without class specific neurons and overreliance on intuition-based methods for understanding DNNs can be misleading.
Probability trees may have been around for decades, but they have received little attention from the AI and ML community.
Amazon extracts an optimal subset of architectural parameters for BERT architecture by applying recent breakthroughs in algorithms for neural architecture search.
Google recently introduced mT5, a multilingual variant of its “Text-to-Text Transfer Transformer” (T5), pretrained on a new Common Crawl-based dataset covering 101 languages.
ICLR 2021 submission proposes LambdaNetworks, a transformer-specific method that reduces costs of modeling long-range interactions for CV and other applications.
Facebook AI open-sourced a multilingual machine translation (MMT) model that translates between any pair of 100 languages without relying on English data.
The UK researchers identify just how much AI research might benefit from the field of animal cognition.







































