MonoLayout | Bird’s-Eye Layout Estimation from A Single Image
MonoLayout, a practical deep neural architecture that takes just a single image of a road scene as input and outputs an amodal scene layout in bird’s-eye view.
AI Technology & Industry Review
Global machine intelligence updates.
MonoLayout, a practical deep neural architecture that takes just a single image of a road scene as input and outputs an amodal scene layout in bird’s-eye view.
In a bid to raise awareness of the threats posed by climate change, the Mila team recently published a paper that uses GANs to generate images of how climate events may impact our environments — with a particular focus on floods.
Joseph Redmon, creator of the popular object detection algorithm YOLO, tweeted last week that he had ceased his computer vision research to avoid enabling potential misuse of the tech.
Synced Global AI Weekly February 23rd
Researchers have proposed a novel self-adversarial learning (SAL) paradigm for improving GANs’ performance in text generation.
Total disinfection coverage of DJI agricultural drones exceeds 600 million square meters across more than 1,000 villages.
Bayesian inference meanwhile leverages Bayes’ theorem to update the probability of a hypothesis as additional data becomes available. How can Bayesian inference benefit deep learning models?
DeepMind announced yesterday the release of Haiku and RLax — new JAX libraries designed for neural networks and reinforcement learning respectively.
Researchers from Italy’s University of Pisa present a clear and engaging tutorial on the main concepts and building blocks involved in neural architectures for graphs.
Researchers have proposed a novel generator network specialized on the illustrations in children’s books.
Researchers have proposed a simple but powerful “SimCLR” framework for contrastive learning of visual representations.
A recent Google Brain paper looks into Google’s hugely successful transformer network — BERT — and how it represents linguistic information internally.
The tool enables researchers to try, compare, and evaluate models to decide which work best on their datasets or for their research purposes.
Synced Global AI Weekly February 16th
Thanks to AI technologies such as image recognition and machine learning, people can now save time, food and money in the kitchen while discovering creative and tasty recipes and even generating their own new and personalized flavours.
The study introduces an Event Recognition in Aerial video (ERA) dataset comprising 2,866 aerial videos collected from YouTube and annotated with labels from 25 different classes corresponding to an event that can be seen unfolding over a period of five seconds.
Researchers from Google Brain and Carnegie Mellon University have released models trained with a semi-supervised learning method called “Noisy Student” that achieve 88.4 percent top-1 accuracy on ImageNet.
Researchers have introduced the first unsupervised learning approach for identifying interpretable semantic directions in the latent space of generative adversarial network (GAN) models.
Deep learning models are getting larger and larger to meet the demand for better and better performance. Meanwhile, the timeContinue Reading
Researchers introduced semantic region-adaptive normalization (SEAN), a simple but effective building block for conditional Generative Adversarial Networks (cGAN).
The Godfathers of AI and 2018 ACM Turing Award winners Geoffrey Hinton, Yann LeCun, and Yoshua Bengio shared a stage in New York on Sunday night at an event organized by the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020).
In a bid to simplify 3D deep learning and improve processing performance and efficiency, Facebook recently introduced an open-source framework for 3D computer vision.
Synced Global AI Weekly February 9th
The crucial step now is to develop matching vaccines and drugs to uproot its existence, and China’s big tech companies have stepped up to help.
Batchboost is a simple technique to accelerate ML model training by adaptively feeding mini-batches with artificial samples which are created by mixing two examples from the previous step – in favor of pairing those that produce the difficult one.
In an effort to enrich resources for multispeaker singing-voice synthesis, a team of researchers from the University of Tokyo has developed a Japanese multispeaker singing-voice corpus.
Researchers proposed a “radioactive data” technique for subtly marking images in a dataset to help researchers later determine whether they were used to train a particular model.
In a new paper, researchers from the University of Toronto, Vector Institute, and University of Wisconsin-Madison propose SISA training, a new framework that helps models “unlearn” information by reducing number of updates that need to be computed when data points are removed.
In a new paper, researchers from the New York University and Modl.ai, a company applying machine learning to game developing, suggest that simple spacial processing methods such as rotation, translation and cropping could help increase model generality.
The tool can significantly accelerate the prediction time of a virus’s RNA secondary structure, affording frontline researchers an opportunity to better understand the virus and develop targeting vaccines in a time of crisis.
Facebook’s new HiPlot is a lightweight interactive visualization tool that takes this further, using parallel plots to discover correlations and patterns in such high-dimensional data.
A new paper from the University of Washington Seattle and the University of California, Berkeley looks at saddle points on Riemannian Manifolds. In this article Synced takes a deep dive into this important research.
Synced Global AI Weekly February 2nd
Advancements in machine learning in recent years have enabled a number of novel offerings in the pet retail market, pushing smart pet products sales to US$565 million in 2018.
Now, DeepMind and University College London (UCL) have introduced a new deep network called MEMO which matches SOTA results on Facebook’s bAbI dataset for testing text understanding and reasoning, and is the first and only architecture capable of solving long sequence novel reasoning tasks.
A new study suggests human-to-human transmission of the 2019 Novel Coronavirus (2019-nCoV) may have started as early as mid December, 2019.
One of a new breed of open-domain chatbots designed to engage in conversations across any topic, Meena’s free and natural conversational abilities are closing the gap on human performance.
Facebook AI researchers have further developed the BART model with the introduction of mBART.
A team of researchers from the Natural Language Processing Lab at the University of British Columbia in Canada have proposed AraNet, a deep learning toolkit designed for Arabic social media processing.
Inspired by the performance of attention mechanisms in NLP, researchers have explored the possibility of applying them to vision tasks.