Microsoft’s UPRISE Automatically Retrieves Prompts to Boost the Zero-Shot Performance of Large Language Models

Pretrained large language models (LLMs) have emerged as a driving force in the evolution of AI systems, and the global race is on to make such models even more powerful. Promising research directions for improving LLMs include model-specific fine-tuning and task-specific prompt engineering. Both of these approaches however have their downsides: the former can be computationally costly while the latter lacks generalization capabilities.

In the new paper UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation, a Microsoft research team introduces a novel approach that tunes a lightweight and versatile retriever to retrieve prompts for any given task input to improve the zero-shot performance of LLMs.

The team summarizes their main contributions as follows:

We introduce UPRISE, a lightweight and versatile approach to improve zero-shot performance of LLMs in the cross-task and cross-model scenarios.
UPRISE is tuned with GPT-Neo-2.7B, but can also benefit different LLMs of much larger scales, such as BLOOM-7.1B, OPT-66B and GPT3-175B.
Our exploration on ChatGPT demonstrates the potential of UPRISE in improving performance of even the strongest LLMs.

The UPRISE prompting process comprises two straightforward steps: retrieve, then predict. Given an input, UPRISE first retrieves a set of positive prompts from a preconstructed pool, then concatenates them with the input to form an input sequence. This is fed to a frozen LLM (fixed weights/parameters), which generates a predicted output.

Central to the proposed approach is the prompt retriever. In the training stage, the frozen LLM supervises the prompt retriever’s fine-tuning across a set of tasks. In the inference stage, the trained retriever retrieves appropriate prompts for different task types and different LLMs. This cross-task and cross-model paradigm equips UPRISE with universality — the ability to generalize from seen-in-training to unseen task types — without further tuning.

In their empirical study, the team evaluated UPRISE on various natural language understanding tasks. UPRISE outperformed vanilla zero-shot prompting in the experiments and demonstrated strong universality in a cross-task and cross-model scenario. Moreover, the researchers note that UPRISE also mitigated the hallucination problems that have impaired ChatGPT performance, suggesting their approach’s potential to improve even the strongest LLMs.

The paper UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation is on arXiv.

Author: Hecate He | Editor: Michael Sarazen

We know you don’t want to miss any news or research breakthroughs. Subscribe to our popular newsletter Synced Global AI Weekly to get weekly AI updates.

10 comments on “Microsoft’s UPRISE Automatically Retrieves Prompts to Boost the Zero-Shot Performance of Large Language Models”

John O'Reilly

2023-03-23

I couldn’t agree more
Great post! It’s fascinating to see how UPRISE improves zero-shot performance of LLMs, especially in a cross-task and cross-model scenario. I’m curious, how does UPRISE’s prompt retrieval process compare to other prompt engineering methods in terms of efficiency and effectiveness?
John
http://www.airiches.online/

Loading...

Reply
- Daixuan
  
  2023-03-30
  
  Hi, thanks for your interest. While we focused our research on comparing with vanilla zero-shot prompting, we believe that different prompt engineering methods are complementary rather than competing. For instance, incorporating methods like zero-shot CoT into our prompt pool or instruction templates could enhance UPRISE’s performance. Thus we leave this for future work rather than comparing UPRISE with it.
  
  Loading...
  
  Reply
woodoku

2023-05-16

I’m grateful you shared this useful knowledge. Your webpage is fantastic. Your website contains an incredible amount of information.

Loading...

Reply
Candy Crush

2023-06-22

The perfect game to relax and release daily stress and anger.

Loading...

Reply
Larry Martin

2023-11-02

Upgrade your space with Great Railing, the trusted fence decking supplier in Toms River NJ. Their premium products combine durability and aesthetics, ensuring your property stands out with elegance.

Loading...

Reply
OSH UNIVERSITY

2024-02-24

Osh University’s International Medical Faculty is a beacon for students seeking a global medical education. With a commitment to excellence, it prepares the next generation of healthcare leaders for an ever-evolving world of medicine.

Loading...

Reply
Shalamar Hospital

2024-02-24

Shalamar Hospital is not just a hospital; it’s also home to a cutting-edge dental clinic . Trust us for all your dental care needs in a medical setting you can rely on.

Loading...

Reply
Tempo Garments

2024-02-24

Explore unbeatable deals at Tempo Garments' ready to wear sale Discover premium quality apparel at discounted prices. Elevate your style with our versatile collection, crafted for comfort and sophistication. Don't miss out on the opportunity to refresh your wardrobe for less. Shop now and embrace the savings with Tempo Garments.

Loading...

Reply
half body sexdoll

2024-06-18

Very good article, I’m interested

Loading...

Reply
Finger

2025-08-01

The United States presents a contradictory picture. On the one hand, American culture values individual freedom and sexual expression, laying the foundation for the growing acceptance of life size sex doll, adult content, and open relationships.

Loading...

Reply