A research team from NVIDIA, Stanford University and Microsoft Research propose a novel pipeline parallelism approach that improves throughput by more than 10 percent with a comparable memory footprint, showing such strategies can achieve high aggregate throughput while training models with up to a trillion parameters.
Information retrieval (IR) is the activity of retrieving information from a collection of sources stored on computers, based on user queries. IR enjoys a history of one century , and serves as the heart of many ubiquitous applications such as web search, product recommendation, and personal feeds on social networks.
Apple has unveiled the latest iteration of its smartphone chip: the A12 Bionic SoC (system-on-a-chip). The company made the announcement yesterday at its annual product showcase event in Cupertino, California, hailing the A12 as the industry’s first ever 7nm chip (the smallest current transistor scale). It will be embedded in Apple’s new XR, XS, and XS Max iPhones.
At the prestigious SIGGRAPH (Special Interest Group on Computer GRAPHics and Interactive Techniques) conference in Vancouver yesterday, Nvidia CEO Jensen Huang announced Turing, an eighth-generation GPU architecture introducing ray tracing and AI capability to real-time graphics.
Chip giant NVIDIA Founder and CEO Jensen Huang created a bit of a stir at yesterday’s GPU Technology Conference in Santa Clara, USA, when he appeared to dis one of these chips’ appropriateness for autonomous vehicle system development: “FPGA is not the right answer,” he said.
Embedded AI can transform a tabletop speaker into a personal assistant; give a robot brains and dexterity; and turn a smartphone into a smart camera, music player, or game console. Traditional processors, however, lack the computational power to support many of these intelligent features.