Softmax loss has become a standard build-in loss function for a lot of mathematical tools like TensorFlow, Torch and Caffe. It is mainly used for classification and has its advantages and disadvantages, the latter of which is the focus of this paper.
Google’s DeepMind introduced its Relation Network (RN) module and a Visual Interaction Network (VIN) that achieved superhuman performance in specific tests by employing a different type of reasoning concerning objects and physical interactions.
“We want to be able to train a machine learning model, like Google Translate, in a few hours,” said Prof. Hwu. Best known for enabling the parallelism capability of the GPU to solve scientific problems like AI.