Abstract
Open-domain question answering (ODQA) has emerged as a pivotal research spotlight in information systems. Existing methods follow two main paradigms to collect evidence: (1) Theretrieve-then-read paradigm retrieves pertinent documents from an external corpus; and (2) thegenerate-then-read paradigm employs large language models (LLMs) to generate relevant documents. However, neither can fully address multifaceted requirements for evidence. To this end, we propose LLMQA, a generalized framework that formulates the ODQA process into three basic steps: query expansion, document selection, and answer generation, combining the superiority of both retrieval-based and generation-based evidence. Since LLMs exhibit their excellent capabilities to accomplish various tasks, we instruct LLMs to play multiple roles as generators, rerankers, and evaluators within our framework, integrating them to collaborate in the ODQA process. Furthermore, we introduce a novel prompt optimization algorithm to refine role-playing prompts and steer LLMs to produce higher-quality evidence and answers. Extensive experimental results on widely used benchmarks (NQ, WebQ, and TriviaQA) demonstrate that LLMQA achieves the best performance in terms of both answer accuracy and evidence quality, showcasing its potential for advancing ODQA research and applications.
Original language | English (US) |
---|---|
Title of host publication | WWW 2024 - Proceedings of the ACM Web Conference |
Publisher | Association for Computing Machinery, Inc |
Pages | 4372-4382 |
Number of pages | 11 |
ISBN (Electronic) | 9798400701719 |
DOIs | |
State | Published - May 13 2024 |
Event | 33rd ACM Web Conference, WWW 2024 - Singapore, Singapore Duration: May 13 2024 → May 17 2024 |
Publication series
Name | WWW 2024 - Proceedings of the ACM Web Conference |
---|
Conference
Conference | 33rd ACM Web Conference, WWW 2024 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 05/13/24 → 05/17/24 |
Bibliographical note
Publisher Copyright:© 2024 ACM.
Keywords
- prompt optimization
- question answering
- role-playing llms
ASJC Scopus subject areas
- Computer Networks and Communications
- Software