information retrieval

by Synced 2023-06-02 7

Google & Waterloo U Scales Generative Retrieval to Handle 8.8M Passages

In a new paper How Does Generative Retrieval Scale to Millions of Passages? a research team from Google Research and University of Waterloo performs the first empirical study of generative retrieval across various corpus scales, even scaling up to the entire MS MARCO passage ranking task that contains 8.8M passages, aiming to provide insights on scaling generative retrieval to millions of passages.

by Synced 2022-09-29 0

AI Machine Learning & Data Science Research

Microsoft & Xiamen U’s Progressive Distillation Method Sets New SOTA for Dense Retrieval

In the new paper Progressive Distillation for Dense Retrieval, a research team from Xiamen U and Microsoft Research presents PROD, a progressive distillation method for dense retrieval that achieves state-of-the-art performance on five widely used benchmarks.

by Synced 2022-05-31 1

AI Machine Learning & Data Science Nature Language Tech Research

Fact Tracing in LMs: MIT & Google Dataset and Benchmark Track Learned Knowledge Back to the Training Data

In the new paper Tracing Knowledge in Language Models Back to the Training Data, a team from MIT CSAIL and Google Research proposes a benchmark for tracing language models’ assertions to the associated training data, aiming to establish a principled ground truth and mitigate high compute demands for large neural language model training.

by Synced 2021-12-20 4

AI Machine Learning & Data Science Nature Language Tech Research

OpenAI’s WebGPT Crawls a Text-Based Web Environment to Achieve Human-Level Performance on Long-Form QA

An OpenAI research team fine-tunes the GPT-3 pretrained language model to enable it to answer long-form questions by searching and navigating a text-based web browsing environment, achieving retrieval and synthesis improvements and reaching human-level long-form question-answering performance.

by Synced 2018-10-27 0

AI Research

Recurrent Binary Embedding for GPU-Enabled Exhaustive Retrieval from Billion-Scale Semantic Vectors

Information retrieval (IR) is the activity of retrieving information from a collection of sources stored on computers, based on user queries. IR enjoys a history of one century [1], and serves as the heart of many ubiquitous applications such as web search, product recommendation, and personal feeds on social networks.

by Synced 2017-06-26 1

AI Research

A Survey of Question Answering for Math and Science Problem

Overview of the ways to make a machine smart enough to pass the standardized math and science test.