DeepMind Introduces ‘EATS’ – An Adversarial, End-to-End Approach to TTS
DeepMind researchers have developed EATS, a generative model trained adversarially in an end-to-end manner that achieves performance comparable to SOTA models.
AI Technology & Industry Review
DeepMind researchers have developed EATS, a generative model trained adversarially in an end-to-end manner that achieves performance comparable to SOTA models.
ACM SIGGRAPH has honoured MIT CSAIL postdoctoral researcher Li Tzu-Mao with its 2020 Doctoral Dissertation Award for his PhD thesis Differentiable Visual Computing.
Researchers from Katholieke Universiteit Leuven in Belgium and ETH Zürich in a recent paper propose a two-step approach for unsupervised classification.
DeepMind researchers introduce a framework that aims to solve the problem by enabling simple RL agent implementations to be run at different scales of execution.
Rather than simply treating AI as a tool to be leveraged, this approach reimagines AI as a collaborator that learns from a developer’s needs and subsequently proposes multiple design approaches with different trade-offs in order to enable a rapid and iterative approach to model building.
Today, organizers of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020) announced extending the paper submission deadline by 48 hours.
A total of 1,088 papers out of 4,990 submissions made it to the prestigious machine learning conference.
23 respected machine learning researchers propose that personalized peer-to-peer contact tracing through mobile apps has the potential to shift the paradigm of Covid-19 community spread.
OpenAI announced the upgraded GPT-3 with a whopping 175 billion parameters.
Canadian education technology startup Korbit Technologies has introduced a personalized AI-powered learning experience that it says can help all students learn faster and better in a cost-effective way.
Google Research team proposes the automatic metric BLEURT which is based on the highly successful Google language model BERT.
Former Microsoft executive vice president of Artificial Intelligence and Research Harry Shum (Shen Xiangyang) has been appointed chairman of the board of the Silicon Valley startup News Break.
Zurich-based student and aspiring full-stack software engineer Vincent Dörig has taken LaTeX to the website level with his GitHub project LaTeX.css.
GameGAN, a generative model that learns to visually imitate video game environments by ingesting screenplay and keyboard actions during training.
The research team proposes that colourization performance can be improved dramatically at the instance level for a few reasons.
A new simple baseline for few-shot learning that achieves state-of-the-art performance.
The team introduces photon sources fabricated in silicon that meet a variety of requirements for scalable quantum photonics: high purity, high heralding efficiency, and high indistinguishability.
Google AI researchers introduce Meta-Dataset, a large-scale and diverse benchmark for measuring the ability of few-shot classification models.
To deliver human-level voices to its platform’s billions of users while maintaining strict compute efficiency, Facebook AI researchers have deployed a new neural TTS system that works on CPU servers.
Enter Plan2Explore — a self-supervised RL agent designed to quickly generalize to unseen tasks in a zero or few-shot manner.
A “data echoing” technique that enables these time-consuming upstream training stages to also benefit from accelerators.
The delightful program can animate a 2D avatar in real-time from a webcam video stream input and has garnered 3,700 GitHub stars since its release.
A new paper explores the potentially richer optimizations that could result from a spirit of human-machine teamwork built on complementarity.
This paper proposes a novel graph-constrained generative adversarial network, whose generator and discriminator are built upon relational architecture.
A team from the Allen Institute for Artificial Intelligence and the University of Washington this week introduced TLDR generation, a new automatic summarization task for scientific papers.
Researchers from the University of Bristol, the University of Toronto and the University of Catania explain how they created Epic-Kitchens and introduce new baselines that emphasize the multimodal nature of the largest such egocentric video benchmark.
A team of researchers from Russian AI startup OSAI recently introduced the real-time neural network TTNet, designed for processing high-resolution table tennis videos with both temporal and spatial data.
ICLR 2020 accepted 687 out of 2,594 papers and drew over 5,600 participants from nearly 90 countries.
Researchers from the University of Washington, Virginia Tech and Facebook have introduced an algorithm that can reconstruct dense, geometrically consistent depth for all pixels in monocular videos.
The booming global pet care market is projected to grow to US$280 billion by 2025.
This is the first chatbot to blend a diverse set of conversational skills — including empathy, knowledge, and personality — together in one system.
VidPress is an AI-powered video synthesis tool Baidu Research recently developed in an effort to churn out sleek, professional video content in one click.
In a recent paper, Chen and his colleagues with the Samsung AI Center utilized the WiFi signals to establish a submeter-level localization system that employs WiFi propagation characteristics as users’ location fingerprints.
Yesterday in the r/MachineLearning subreddit, a lighthearted announcement appeared from the “Animal Crossing Artificial Intelligence Workshop (ACAI)”calling for abstracts.
South Korea’s Naver Clova AI Research is one of the institutions behind the unsupervised generative network U-GAT-IT. The tech hasContinue Reading
Researchers “posit that the universes of knowledge and experience available to NLP models can be defined by successively larger world scopes: from a single corpus to a fully embodied and social context.”
YOLOv4 is twice as fast as EfficientDet with comparable performance.
In this article, we take a look at Edwin Catmull’s doctoral dissertation published in 1974, which laid the groundwork for 3D computer graphics.
The latest ResNet improvement comes courtesy researchers from Amazon and UC Davis, who unveiled their Split-Attention Networks, ResNeSt.
Hinton, University Professor Emeritus at the University of Toronto, responded on Reddit, “I have never claimed that I invented backpropagation.







































