A group of Google researchers led by Quoc Le — the AI expert behind Google Neural Machine Translation and AutoML — have published a paper proposing attention augmentation. In experiment results, the novel two-dimensional relative self-attention mechanismfor image classification delivers “consistent improvements in image classification.”
With its improved productivity and accuracy and more personalized experience, AI is revolutionizing medical imaging. According to Signify Research, the world market for AI in medical imaging — comprising software for automated detection, quantification, decision support, and diagnosis — will reach US$2 billion by 2023.
If we ask one of today’s AI-powered voice assistants like Alexa and Siri to tell a joke, it might very well come up with something that puts a smile on our face. If however we then asked “Why do you think that joke is funny?” the bot would be stuck for a response. AI researchers want to change that.
Advanced machine learning techniques and the widespread deployment of surveillance cameras have dramatically improved the efficiency and accuracy of human detection systems in airports, train stations, and other sensitive public places. Is this the end of anonymity?
Thanks to the CUDA architecture  developed by NVIDIA, developers can exploit GPUs’ parallel computing power to perform general computation without extra efforts. Our objective is to evaluate the performance achieved by TensorFlow, PyTorch, and MXNet on Titan RTX.
Researchers from Facebook, the National University of Singapore, and the Qihoo 360 AI Institute have jointly proposed OctConv (Octave Convolution), a promising new alternative to traditional convolution operations. Akin to a “compressor” for Convolutional Neural Networks (CNN), the OctConv method saves computational resources while boosting effectiveness.
Interactive movies are redefining cinema and storytelling and opening up a world of possibilities in the entertainment industry. There are no “spoilers” for films with no predetermined endings, whose characters and plots develop based on viewers’ real-time direction. Now, what if these viewers became characters?
ccording to a Fuji Research Laboratory report, the Japanese smart home market is expected to top JP¥4.2 trillion (US$38 billion) in 2025, up 36.3 percent from 2017. The market is being driven by smart devices including smartphones, which already account for more than half the market and are continuing to grow due their ability to conveniently connect IoT devices.
Chinese technology giant Tencent has open-sourced its face detection algorithm DSFD (Dual Shot Face Detector). The related paper DSFD: Dual Shot Face Detector achieves state-of-the-art performance on WIDER FACE and FDDB dataset benchmarks, and has been accepted by top computer vision conference CVPR 2019.
A collaboration between researchers from China’s Beihang University and Microsoft Research Asia has produced TableBank, a new image-based dataset for table detection and recognition built with novel weak supervision from Word and Latex documents on the Internet.
As robots take over industrial manufacturing, specific and accurate robot control is becoming more important. Conventional feedback control methods can effectively solve various types of robot control problems by capturing structures with explicit models such as motion equations.
Facebook AI Research has announced it is open-sourcing PyTorch-BigGraph (PBG), a tool that can easily process and produce graph embeddings for extremely large graphs. PBG can also process multi-relation graph embeddings where a model is too large to fit in memory.
A year ago, Shenzhen-based self-driving start-up Roadstar.ai was cruising along promisingly. In May 2018 the company announced a US$128 million funding round led by Wu Capital and state-backed Shenzhen Capital Group — one of the largest autonomous driving investments ever in China.