Loading…

Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning

A common problem with adopting Text-to-SQL translation in database systems is poor generalization. Specifically, when there is limited training data on new datasets, existing few-shot Text-to-SQL techniques, even with carefully designed textual prompts on pre-trained language models (PLMs), tend to...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the ACM on management of data 2023-06, Vol.1 (2), p.1-28, Article 147
Main Authors: Gu, Zihui, Fan, Ju, Tang, Nan, Cao, Lei, Jia, Bowen, Madden, Sam, Du, Xiaoyong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A common problem with adopting Text-to-SQL translation in database systems is poor generalization. Specifically, when there is limited training data on new datasets, existing few-shot Text-to-SQL techniques, even with carefully designed textual prompts on pre-trained language models (PLMs), tend to be ineffective. In this paper, we present a divide-and-conquer framework to better support few-shot Text-to-SQL translation, which divides Text-to-SQL translation into two stages (or sub-tasks), such that each sub-task is simpler to be tackled. The first stage, called the structure stage, steers a PLM to generate an SQL structure (including SQL commands such as SELECT, FROM, WHERE and SQL operators such as
ISSN:2836-6573
2836-6573
DOI:10.1145/3589292