Abstract
GPT-3 has attracted lots of attention due to its superior performance across a wide range of NLP tasks, especially with its in-context learning abilities. Despite its success, we found that the empirical results of GPT-3 depend heavily on the choice of in-context examples. In this work, we investigate whether there are more effective strategies for judiciously selecting in-context examples (relative to random sampling) that better leverage GPT-3's in-context learning capabilities. Inspired by the recent success of leveraging a retrieval module to augment neural networks, we propose to retrieve examples that are semantically-similar to a test query sample to formulate its corresponding prompt. Intuitively, the examples selected with such a strategy may serve as more informative inputs to unleash GPT-3's power of text generation. We evaluate the proposed approach on several natural language understanding and generation benchmarks, where the retrieval-based prompt selection approach consistently outperforms the random selection baseline. Moreover, it is observed that the sentence encoders fine-tuned on task-related datasets yield even more helpful retrieval results. Notably, significant gains are observed on tasks such as table-to-text generation (44.3% on the ToTTo dataset) and open-domain question answering (45.5% on the NQ dataset).
Original language | English (US) |
---|---|
Title of host publication | DeeLIO 2022 - Deep Learning Inside Out |
Subtitle of host publication | 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, Proceedings of the Workshop |
Editors | Eneko Agirre, Marianna Apidianaki, Ivan Vulic |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 100-114 |
Number of pages | 15 |
ISBN (Electronic) | 9781955917322 |
State | Published - 2022 |
Event | Deep Learning Inside Out: 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, DeeLIO 2022 - Virtual, Dublin, Ireland Duration: May 27 2022 → … |
Publication series
Name | DeeLIO 2022 - Deep Learning Inside Out: 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, Proceedings of the Workshop |
---|
Conference
Conference | Deep Learning Inside Out: 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, DeeLIO 2022 |
---|---|
Country/Territory | Ireland |
City | Virtual, Dublin |
Period | 05/27/22 → … |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.
ASJC Scopus subject areas
- Computer Networks and Communications
- Information Systems
- Software
- Hardware and Architecture