2025-11-20T20:49:21.880729

LitE-SQL: A Lightweight and Efficient Text-to-SQL Framework with Vector-based Schema Linking and Execution-Guided Self-Correction

Piao, Lee, Park
The Text-to-SQL task translates natural language questions into SQL queries, enabling intuitive database interaction for non-experts. While recent methods leveraging Large Language Models (LLMs) achieve strong performance, their reliance on proprietary models raise concerns about deployment feasibility and data privacy. In this work, we introduce LitE-SQL, a Lightweight and Efficient framework with two components: (i) a Schema Retriever that performs efficient schema linking using a vector database of pre-computed schema embeddings, and (ii) a SQL Generator fine-tuned in two stages-supervised fine-tuning followed by execution-guided reinforcement-enabling self-correction without costly multi-candidate generation. On BIRD, LitE-SQL achieves 72.10% execution accuracy, and on Spider 1.0 it reaches 88.45%, demonstrating comparable or superior performance to LLM-based methods despite using 2x to 30x fewer parameters. Our findings demonstrate that high-quality Text-to-SQL generation is feasible with lightweight models, offering a practical solution for privacy-sensitive and resource-constrained settings.
academic

LitE-SQL: ๋ฒกํ„ฐ ๊ธฐ๋ฐ˜ ์Šคํ‚ค๋งˆ ๋งํ‚น ๋ฐ ์‹คํ–‰ ์œ ๋„ ์ž์ฒด ์ˆ˜์ •์„ ๊ฐ–์ถ˜ ๊ฒฝ๋Ÿ‰ ํšจ์œจ์  ํ…์ŠคํŠธ-SQL ํ”„๋ ˆ์ž„์›Œํฌ

๊ธฐ๋ณธ ์ •๋ณด

  • ๋…ผ๋ฌธ ID: 2510.09014
  • ์ œ๋ชฉ: LitE-SQL: A Lightweight and Efficient Text-to-SQL Framework with Vector-based Schema Linking and Execution-Guided Self-Correction
  • ์ €์ž: Shengmin Piao, Jieun Lee, Sanghyun Park (์—ฐ์„ธ๋Œ€ํ•™๊ต)
  • ๋ถ„๋ฅ˜: cs.CL (๊ณ„์‚ฐ ์–ธ์–ดํ•™)
  • ๋ฐœํ‘œ ์‹œ๊ฐ„: 2024๋…„ 10์›”
  • ๋…ผ๋ฌธ ๋งํฌ: https://arxiv.org/abs/2510.09014

์ดˆ๋ก

ํ…์ŠคํŠธ-SQL ์ž‘์—…์€ ์ž์—ฐ์–ด ์งˆ๋ฌธ์„ SQL ์ฟผ๋ฆฌ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ๋น„์ „๋ฌธ๊ฐ€ ์‚ฌ์šฉ์ž์—๊ฒŒ ์ง๊ด€์ ์ธ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ƒํ˜ธ์ž‘์šฉ ๋ฐฉ์‹์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM) ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์ด ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด์ง€๋งŒ, ๋…์  ๋ชจ๋ธ์— ๋Œ€ํ•œ ์˜์กด์„ฑ์€ ๋ฐฐํฌ ๊ฐ€๋Šฅ์„ฑ ๋ฐ ๋ฐ์ดํ„ฐ ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ์— ๋Œ€ํ•œ ์šฐ๋ ค๋ฅผ ์•ผ๊ธฐํ•ฉ๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ๋‘ ๊ฐ€์ง€ ํ•ต์‹ฌ ๊ตฌ์„ฑ ์š”์†Œ๋ฅผ ํฌํ•จํ•˜๋Š” ๊ฒฝ๋Ÿ‰ ํšจ์œจ์  ํ”„๋ ˆ์ž„์›Œํฌ์ธ LitE-SQL์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค: (i) ์Šคํ‚ค๋งˆ ๊ฒ€์ƒ‰๊ธฐ(Schema Retriever)๋Š” ์‚ฌ์ „ ๊ณ„์‚ฐ๋œ ์Šคํ‚ค๋งˆ ์ž„๋ฒ ๋”ฉ์˜ ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํšจ์œจ์ ์ธ ์Šคํ‚ค๋งˆ ๋งํ‚น์„ ์ˆ˜ํ–‰ํ•˜๊ณ , (ii) SQL ์ƒ์„ฑ๊ธฐ(SQL Generator)๋Š” ๋น„์šฉ์ด ๋งŽ์ด ๋“œ๋Š” ๋‹ค์ค‘ ํ›„๋ณด ์ƒ์„ฑ ์—†์ด ๋‘ ๋‹จ๊ณ„ ๋ฏธ์„ธ ์กฐ์ •(์ง€๋„ ํ•™์Šต ๋ฏธ์„ธ ์กฐ์ • + ์‹คํ–‰ ์œ ๋„ ๊ฐ•ํ™” ํ•™์Šต)์„ ํ†ตํ•ด ์ž์ฒด ์ˆ˜์ •์„ ๊ตฌํ˜„ํ•ฉ๋‹ˆ๋‹ค. BIRD ๋ฐ์ดํ„ฐ์…‹์—์„œ LitE-SQL์€ 72.10%์˜ ์‹คํ–‰ ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•˜๊ณ , Spider 1.0์—์„œ๋Š” 88.45%๋ฅผ ๋‹ฌ์„ฑํ•˜๋ฉฐ, LLM ๋ฐฉ๋ฒ•์˜ 1/2์—์„œ 1/30์˜ ๋งค๊ฐœ๋ณ€์ˆ˜๋งŒ ์‚ฌ์šฉํ•˜๋ฉด์„œ๋„ ๋™๋“ฑํ•˜๊ฑฐ๋‚˜ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค.

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋™๊ธฐ

๋ฌธ์ œ ์ •์˜

ํ…์ŠคํŠธ-SQL ์ž‘์—…์€ ์ž์—ฐ์–ด ์งˆ๋ฌธ์„ ํ•ด๋‹นํ•˜๋Š” SQL ์ฟผ๋ฆฌ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ๋น„์ „๋ฌธ๊ฐ€ ์‚ฌ์šฉ์ž๊ฐ€ ๊ตฌ์กฐํ™”๋œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์— ์ ‘๊ทผํ•˜๋Š” ๋ฌธํ„ฑ์„ ๋‚ฎ์ถ”๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค. ์ด ์ž‘์—…์€ ์‹ค์ œ ์‘์šฉ์—์„œ ์ค‘์š”ํ•œ ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง€์ง€๋งŒ, ๋„๋ฉ”์ธ ๊ฐ„ ์ผ๋ฐ˜ํ™” ๋ฐ ๋ณต์žกํ•œ ์ฟผ๋ฆฌ ์ƒ์„ฑ์˜ ๊ณผ์ œ์— ์ง๋ฉดํ•ด ์žˆ์Šต๋‹ˆ๋‹ค.

๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„

  1. LLM ์˜์กด์„ฑ ๋ฌธ์ œ: ํ˜„์žฌ ์ฃผ๋ฅ˜ ๋ฐฉ๋ฒ•์€ GPT-4, Gemini ๋“ฑ์˜ ๋…์  ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ์— ์˜์กดํ•˜๋ฉฐ, ๋ฐ์ดํ„ฐ ๊ฐœ์ธ์ •๋ณด ์œ ์ถœ ์œ„ํ—˜์ด ์žˆ๊ณ  ๋ฐฐํฌ ๋น„์šฉ์ด ๋†’์Šต๋‹ˆ๋‹ค.
  2. ๊ณ„์‚ฐ ๋ฆฌ์†Œ์Šค ์†Œ๋น„: ์™„์ „ํ•œ ์Šคํ‚ค๋งˆ ์ •๋ณด ์ž…๋ ฅ์œผ๋กœ ์ธํ•œ ์ปจํ…์ŠคํŠธ ๊ธธ์ด ์ฆ๊ฐ€, ์ž์ฒด ์ฃผ์˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ์ด์ฐจ ๋ณต์žก๋„๋กœ ์ธํ•œ ๊ฑฐ๋Œ€ํ•œ ๋ฉ”๋ชจ๋ฆฌ ์†Œ๋น„
  3. ๋‹ค์ค‘ ํ›„๋ณด ์ƒ์„ฑ ์˜ค๋ฒ„ํ—ค๋“œ: ๊ธฐ์กด ๋ฐฉ๋ฒ•์€ ์—ฌ๋Ÿฌ ํ›„๋ณด ์ฟผ๋ฆฌ๋ฅผ ์ƒ์„ฑํ•˜๊ณ  ์ตœ์  ์†”๋ฃจ์…˜์„ ์„ ํƒํ•˜์—ฌ ๊ณ„์‚ฐ ๋น„์šฉ์ด ์ƒ๋‹นํ•ฉ๋‹ˆ๋‹ค.

์—ฐ๊ตฌ ๋™๊ธฐ

์œ„์˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ๋ณธ ๋…ผ๋ฌธ์€ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ๋งค๊ฐœ๋ณ€์ˆ˜ ์ˆ˜์™€ ๊ณ„์‚ฐ ๋น„์šฉ์„ ํฌ๊ฒŒ ์ค„์ด๋Š” ๊ฒฝ๋Ÿ‰ ํšจ์œจ์  ํ…์ŠคํŠธ-SQL ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๊ฐœ๋ฐœํ•˜์—ฌ ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ์— ๋ฏผ๊ฐํ•˜๊ณ  ๋ฆฌ์†Œ์Šค๊ฐ€ ์ œํ•œ๋œ ์‹œ๋‚˜๋ฆฌ์˜ค์— ์ ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ

  1. LitE-SQL ํ”„๋ ˆ์ž„์›Œํฌ ์ œ์•ˆ: ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ธฐ๋ฐ˜ ์Šคํ‚ค๋งˆ ๋งํ‚น ๋ฐฉ๋ฒ•์„ ์™„์ „ํžˆ ํ™œ์šฉํ•˜๋Š” ์ฒซ ๋ฒˆ์งธ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๊ฒฝ๋Ÿ‰ SQL ์ƒ์„ฑ๊ธฐ์™€ ๊ฒฐํ•ฉ
  2. ํ˜์‹ ์ ์ธ HN-SupCon ์†์‹ค ํ•จ์ˆ˜: ํ•˜๋“œ ๋„ค๊ฑฐํ‹ฐ๋ธŒ ์ƒ˜ํ”Œ ํ•„ํ„ฐ๋ง์„ ํ†ตํ•œ ์ง€๋„ ๋Œ€์กฐ ํ•™์Šต์œผ๋กœ ์ž„๋ฒ ๋”ฉ ๊ณต๊ฐ„ ์ตœ์ ํ™”
  3. ๋‘ ๋‹จ๊ณ„ ํ›ˆ๋ จ ์ „๋žต: ์ง€๋„ ํ•™์Šต ๋ฏธ์„ธ ์กฐ์ • + ์‹คํ–‰ ์œ ๋„ ๊ฐ•ํ™” ํ•™์Šต์œผ๋กœ ํšจ์œจ์ ์ธ ์ž์ฒด ์˜ค๋ฅ˜ ์ˆ˜์ • ๊ตฌํ˜„
  4. ํ˜„์ €ํ•œ ํšจ์œจ์„ฑ ํ–ฅ์ƒ: BIRD ๋ฐ Spider 1.0 ๋ฐ์ดํ„ฐ์…‹์—์„œ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ ๋‹ฌ์„ฑ, ๋งค๊ฐœ๋ณ€์ˆ˜๋Š” ๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ 1/2์—์„œ 1/30

๋ฐฉ๋ฒ• ์ƒ์„ธ ์„ค๋ช…

์ž‘์—… ์ •์˜

์ž์—ฐ์–ด ์งˆ๋ฌธ Q์™€ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์Šคํ‚ค๋งˆ S๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ, ํ…์ŠคํŠธ-SQL ์ž‘์—…์€ ๋ชฉํ‘œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์—์„œ ์‹คํ–‰ ๊ฒฐ๊ณผ๊ฐ€ ๊ธˆํ‘œ์ค€ ์ฟผ๋ฆฌ์™€ ์ผ์น˜ํ•˜๋Š” SQL ์ฟผ๋ฆฌ๋ฅผ ์ƒ์„ฑํ•˜๋„๋ก ์š”๊ตฌํ•ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜

1. ์Šคํ‚ค๋งˆ ๊ฒ€์ƒ‰๊ธฐ(Schema Retriever)

ํ•ต์‹ฌ ์„ค๊ณ„:

  • ๊ฐ ์—ด์„ ์—ด ์ด๋ฆ„, ์„ค๋ช…, ํ…Œ์ด๋ธ” ์ด๋ฆ„ ๋ฐ ๊ฐ’ ์„ค๋ช…์„ ํฌํ•จํ•˜๋Š” ๋ฐ€์ง‘ ์ž„๋ฒ ๋”ฉ์œผ๋กœ ์ธ์ฝ”๋”ฉ
  • ์Šคํ‚ค๋งˆ ์ž„๋ฒ ๋”ฉ์„ ์‚ฌ์ „ ๊ณ„์‚ฐํ•˜๊ณ  ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์— ์ €์žฅ
  • ์ถ”๋ก  ์‹œ ์งˆ๋ฌธ๋งŒ ์ธ์ฝ”๋”ฉํ•˜๊ณ  ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„๋ฅผ ํ†ตํ•ด ์ƒ์œ„ k๊ฐœ ๊ด€๋ จ ์—ด ๊ฒ€์ƒ‰

HN-SupCon ์†์‹ค ํ•จ์ˆ˜:

L_HN-SupCon = -1/B โˆ‘(i=1 to B) log(e^(s(qi,pi)/ฯ„) / Zi)

Zi = e^(s(qi,pi)/ฯ„) + โˆ‘(j=1 to Ni) mij * e^(s(qi,nij)/ฯ„)

mij = {1 if qiโŠ™nij โ‰ฅ qiโŠ™pi - 0.1, 0 otherwise}

์—ฌ๊ธฐ์„œ s(ยท,ยท)๋Š” ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„๋ฅผ ๋‚˜ํƒ€๋‚ด๊ณ , ฯ„๋Š” ์˜จ๋„ ๋งค๊ฐœ๋ณ€์ˆ˜์ด๋ฉฐ, mij๋Š” ๋‹จ์ˆœ ๋„ค๊ฑฐํ‹ฐ๋ธŒ ์ƒ˜ํ”Œ์„ ํ•„ํ„ฐ๋งํ•˜๊ณ  ์˜๋ฏธ๋ก ์ ์œผ๋กœ ์œ ์‚ฌํ•˜์ง€๋งŒ ๊ธฐ๋Šฅ์ ์œผ๋กœ ๋ฌด๊ด€ํ•œ ํ•˜๋“œ ๋„ค๊ฑฐํ‹ฐ๋ธŒ ์ƒ˜ํ”Œ์— ์ดˆ์ ์„ ๋งž์ถ”๋Š” ๋งˆ์Šคํฌ ํ•จ์ˆ˜์ž…๋‹ˆ๋‹ค.

2. SQL ์ƒ์„ฑ๊ธฐ(SQL Generator)

๋‘ ๋‹จ๊ณ„ ํ›ˆ๋ จ ์ „๋žต:

๋‹จ๊ณ„ 1: ์ง€๋„ ํ•™์Šต ๋ฏธ์„ธ ์กฐ์ •(SFT)

L_SFT(ฮธ) = -log P(SQL | Q, S; ฮธ)
  • ์ž์—ฐ์–ด ์งˆ๋ฌธ ๋ฐ ์Šคํ‚ค๋งˆ ์ •๋ณด์—์„œ SQL ์ฟผ๋ฆฌ๋กœ์˜ ์กฐ๊ฑด๋ถ€ ๋งคํ•‘ ํ•™์Šต
  • ๋ฌด๊ด€ํ•œ ์Šคํ‚ค๋งˆ ์ •๋ณด๋ฅผ ๋ฌด์ž‘์œ„๋กœ ์ƒ˜ํ”Œ๋งํ•˜์—ฌ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•์„ ์ˆ˜ํ–‰ํ•˜์—ฌ ํ›ˆ๋ จ๊ณผ ์ถ”๋ก ์˜ ์ผ๊ด€์„ฑ ๋ณด์žฅ

๋‹จ๊ณ„ 2: ๊ฐ•ํ™” ๋ฏธ์„ธ ์กฐ์ •(RFT) ์ง์ ‘ ์„ ํ˜ธ๋„ ์ตœ์ ํ™”(DPO) ์‚ฌ์šฉ:

L_RFT(ฯ€ฮธ;ฯ€0) = L_DPO(y^w_i, y^l_i|xi) + ฮฑL_NLL(y^w_i|xi)
  • ์‹คํ–‰ ๊ฒฐ๊ณผ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์„ ํ˜ธ๋„ ์Œ ๊ตฌ์„ฑ: ์„ฑ๊ณต์ ์œผ๋กœ ์‹คํ–‰๋œ ์ฟผ๋ฆฌ๊ฐ€ ์‹คํŒจํ•œ ์ฟผ๋ฆฌ๋ณด๋‹ค ์šฐ์ˆ˜
  • ์˜ค๋ฅ˜ ๋ฉ”์‹œ์ง€์™€ ๊ฒฐํ•ฉํ•˜์—ฌ ์ž์ฒด ์ˆ˜์ • ํ›ˆ๋ จ ์ˆ˜ํ–‰

๊ธฐ์ˆ  ํ˜์‹  ํฌ์ธํŠธ

  1. ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ธฐ๋ฐ˜ ์Šคํ‚ค๋งˆ ๋งํ‚น: ๊ธฐ์กด ๋ฐฉ๋ฒ•์ด ๋งค๋ฒˆ ์Šคํ‚ค๋งˆ๋ฅผ ๋‹ค์‹œ ์ธ์ฝ”๋”ฉํ•˜๋Š” ๊ฒƒ๊ณผ ๋‹ฌ๋ฆฌ, ๋ณธ ๋ฐฉ๋ฒ•์€ ์งˆ๋ฌธ๋งŒ ์ธ์ฝ”๋”ฉํ•˜์—ฌ ํšจ์œจ์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ
  2. ํ•˜๋“œ ๋„ค๊ฑฐํ‹ฐ๋ธŒ ์ƒ˜ํ”Œ ํ•„ํ„ฐ๋ง ๋ฉ”์ปค๋‹ˆ์ฆ˜: HN-SupCon ์†์‹ค์€ ์˜๋ฏธ๋ก ์ ์œผ๋กœ ์œ ์‚ฌํ•˜์ง€๋งŒ ๊ธฐ๋Šฅ์ ์œผ๋กœ ๋ฌด๊ด€ํ•œ ์—ด์„ ๊ตฌ๋ถ„ํ•˜๋Š” ๋ฐ ์ดˆ์ ์„ ๋งž์ถฐ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ ํ–ฅ์ƒ
  3. ์‹คํ–‰ ์œ ๋„ ์ž์ฒด ์ˆ˜์ •: SQL ์‹คํ–‰ ํ”ผ๋“œ๋ฐฑ์„ ํ™œ์šฉํ•œ ๊ฐ•ํ™” ํ•™์Šต์œผ๋กœ ๋‹ค์ค‘ ํ›„๋ณด ์ƒ์„ฑ์˜ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ ํšŒํ”ผ

์‹คํ—˜ ์„ค์ •

๋ฐ์ดํ„ฐ์…‹

  • BIRD: 95๊ฐœ์˜ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค, 37๊ฐœ์˜ ์ „๋ฌธ ๋ถ„์•ผ, 9,376๊ฐœ ํ›ˆ๋ จ ์ƒ˜ํ”Œ, 1,534๊ฐœ ๊ฒ€์ฆ ์ƒ˜ํ”Œ
  • Spider 1.0: 200๊ฐœ์˜ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค, 138๊ฐœ์˜ ๋„๋ฉ”์ธ, 8,659๊ฐœ ํ›ˆ๋ จ ์ƒ˜ํ”Œ, 1,034๊ฐœ ๊ฒ€์ฆ ์ƒ˜ํ”Œ, 2,147๊ฐœ ํ…Œ์ŠคํŠธ ์ƒ˜ํ”Œ

ํ‰๊ฐ€ ์ง€ํ‘œ

  1. ์‹คํ–‰ ์ •ํ™•๋„(EX): ์˜ˆ์ธก SQL๊ณผ ๊ธˆํ‘œ์ค€ SQL์˜ ์‹คํ–‰ ๊ฒฐ๊ณผ ์ผ์น˜์„ฑ
  2. ์ฐธ ์–‘์„ฑ์œจ(TPR): ๊ฒ€์ƒ‰๋œ ๊ด€๋ จ ์—ด์ด ๊ธˆํ‘œ์ค€ ๊ด€๋ จ ์—ด์—์„œ ์ฐจ์ง€ํ•˜๋Š” ๋น„์œจ
  3. ๊ฑฐ์ง“ ์–‘์„ฑ์œจ(FPR): ๊ฒ€์ƒ‰๋œ ๋ฌด๊ด€ ์—ด์ด ์ด ๊ฒ€์ƒ‰ ์—ด์—์„œ ์ฐจ์ง€ํ•˜๋Š” ๋น„์œจ
  4. ์Šคํ‚ค๋งˆ ๋งํ‚น ์žฌํ˜„์œจ(SLR): ๋ชจ๋“  ๊ด€๋ จ ์—ด์„ ์™„์ „ํžˆ ๊ฒ€์ƒ‰ํ•œ ์ฟผ๋ฆฌ์˜ ๋น„์œจ

๋น„๊ต ๋ฐฉ๋ฒ•

  • ๋ฌธ๋งฅ ํ•™์Šต ๋ฐฉ๋ฒ•: ChatGPT+CoT, DIN-SQL, DAIL-SQL, CHESS, CHASE-SQL ๋“ฑ
  • ๋ฏธ์„ธ ์กฐ์ • ๋ฐฉ๋ฒ•: CodeS, OmniSQL, DTS-SQL, Reasoning-SQL ๋“ฑ

๊ตฌํ˜„ ์„ธ๋ถ€์‚ฌํ•ญ

  • ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ: Qwen3-0.6B-Embedding
  • SQL ์ƒ์„ฑ๊ธฐ: Qwen2.5-Coder (1.5B, 3B, 7B)
  • ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค: ChromaDB
  • ํ›ˆ๋ จ ์„ค์ •: 4๊ฐœ A100 GPU, AdamW ์ตœ์ ํ™”๊ธฐ, LoRA ์–ด๋Œ‘ํ„ฐ

์‹คํ—˜ ๊ฒฐ๊ณผ

์ฃผ์š” ๊ฒฐ๊ณผ

๋ฐฉ๋ฒ• ๋ฒ”์ฃผ๋ชจ๋ธ๋งค๊ฐœ๋ณ€์ˆ˜BIRD(Dev) EXSpider 1.0(Test) EX
๋ฌธ๋งฅ ํ•™์Šต
CHASE-SQLGemini 1.5200B73.0187.60
MCS-SQLGPT-4175B63.3689.60
๋ฏธ์„ธ ์กฐ์ • ๋ฐฉ๋ฒ•
Reasoning-SQLQwen2.5-Coder-14B14B72.2981.43
LitE-SQLQwen2.5-Coder-7B7B72.1088.45

์ฃผ์š” ๋ฐœ๊ฒฌ

  1. ๋งค๊ฐœ๋ณ€์ˆ˜ ํšจ์œจ์„ฑ: 7B ๋ชจ๋ธ์ด ๋Œ€๋ถ€๋ถ„์˜ 175B-200B ๋งค๊ฐœ๋ณ€์ˆ˜ LLM ๋ฐฉ๋ฒ•์„ ์ดˆ๊ณผ
  2. ๋„๋ฉ”์ธ ๊ฐ„ ์ผ๋ฐ˜ํ™”: BIRD์—์„œ MCS-SQL์„ 8.74% ์ดˆ๊ณผ, Spider์—์„œ 1.15%๋งŒ ๋’ค์ง
  3. ์ผ๊ด€๋œ ์„ฑ๋Šฅ: ๋™์ผ ๊ทœ๋ชจ ๋ฏธ์„ธ ์กฐ์ • ๋ฐฉ๋ฒ• ๋Œ€๋น„ ํ‰๊ท  10.87%(BIRD) ๋ฐ 7.21%(Spider) ํ–ฅ์ƒ

์†Œ๊ฑฐ ์‹คํ—˜

๊ตฌ์„ฑ ์š”์†Œ ์„ค์ •BIRD EXSpider EXํ–ฅ์ƒ๋„
๊ธฐ์ค€์„ (๊ฒ€์ƒ‰๊ธฐ + ์ƒ์„ฑ๊ธฐ ์—†์Œ)39.3161.61-
+์Šคํ‚ค๋งˆ ๊ฒ€์ƒ‰๊ธฐ43.1664.28+3.85/+2.67
+SFT58.2183.56+18.90/+21.95
+RFT60.5684.35+21.25/+22.74

์Šคํ‚ค๋งˆ ๋งํ‚น ์„ฑ๋Šฅ ๋ถ„์„

๊ธฐ์ค€์„  ๋ฐฉ๋ฒ•๊ณผ์˜ ๋น„๊ต(BIRD ๋ฐ์ดํ„ฐ์…‹ ๋ถ€๋ถ„ ์ƒ˜ํ”Œ๋ง):

  • LitE-SQL: TPR=95.23%, FPR=80.28%, SLR=82.31%, EX=56.46%
  • CHESS: TPR=87.15%, FPR=8.27%, SLR=61.9%, EX=57.14%
  • CodeS: TPR=89.64%, FPR=74.16%, SLR=65.31%, EX=51.70%

FPR์ด ๋†’์Œ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ , SLR์˜ ์šฐ์œ„๊ฐ€ ๊ฑฐ์ง“ ์–‘์„ฑ์˜ ์˜ํ–ฅ์„ ๋ณด์ƒํ•˜๋ฉฐ, 0.6B ๋งค๊ฐœ๋ณ€์ˆ˜๋งŒ์œผ๋กœ 200B ๋ชจ๋ธ๊ณผ ๋™๋“ฑํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค.

์ž์ฒด ์ˆ˜์ • ํšจ๊ณผ ๋ถ„์„

  • ๋ฐ˜๋ณต ์ˆ˜์ต ๊ฐ์†Œ: ์ฒซ ๋ฒˆ์งธ ์ž์ฒด ์ˆ˜์ •์ด ์ตœ๋Œ€ ํ–ฅ์ƒ์„ ๊ฐ€์ ธ์˜ค๊ณ , ํ›„์† ๋ฐ˜๋ณต์€ ์ˆ˜์ต์ด ์ ์ง„์ ์œผ๋กœ ๊ฐ์†Œ
  • ์˜ค๋ฅ˜ ์œ ํ˜• ๊ฐœ์„ : ๊ตฌ๋ฌธ ์˜ค๋ฅ˜, ์—ด ์กด์žฌ ์•ˆ ํ•จ, ํ…Œ์ด๋ธ” ์กด์žฌ ์•ˆ ํ•จ ๋“ฑ์˜ ์˜ค๋ฅ˜ ์œ ํ˜•์ด ๋ชจ๋‘ ํ˜„์ €ํžˆ ๊ฐ์†Œ
  • ๊ทœ๋ชจ ํšจ๊ณผ: ๋” ํฐ ๋ชจ๋ธ์ด ์˜๋ฏธ๋ก ์  ์ •๋ ฌ ์ธก๋ฉด์—์„œ ๋” ๋งŽ์€ ์ด์ ์„ ์–ป์Œ

๊ด€๋ จ ์—ฐ๊ตฌ

์Šคํ‚ค๋งˆ ๋งํ‚น ์—ฐ๊ตฌ

  1. ์ดˆ๊ธฐ ๋ฐฉ๋ฒ•: ๋ถ„๋ฅ˜๊ธฐ ๊ธฐ๋ฐ˜ ์—ด ์ˆœ์œ„ ์ง€์ •
  2. LLM ๋ฐฉ๋ฒ•: ๋‹ค๋‹จ๊ณ„ ํ”„๋กฌํ”„ํŒ…, ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ”„๋ ˆ์ž„์›Œํฌ(CHESS)
  3. ๋ณธ ๋…ผ๋ฌธ ํ˜์‹ : ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๊ธฐ๋ฐ˜ ์Šคํ‚ค๋งˆ ๋งํ‚น์˜ ์ฒซ ๋ฒˆ์งธ ์™„์ „ ๊ตฌํ˜„

SQL ์ƒ์„ฑ ์—ฐ๊ตฌ

  1. ๋ฌธ๋งฅ ํ•™์Šต: ๊ตฌ์กฐํ™”๋œ ํ”„๋กฌํ”„ํŒ…, ์†Œ์ˆ˜ ์ƒ˜ํ”Œ ํ•™์Šต, ์ž์ฒด ์ผ๊ด€์„ฑ
  2. ๋ฏธ์„ธ ์กฐ์ • ๋ฐฉ๋ฒ•: ๋„๋ฉ”์ธ ์ ์‘, ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•, ์ž‘์—… ๋ถ„ํ•ด
  3. ๋ณธ ๋…ผ๋ฌธ ๊ธฐ์—ฌ: ์‹คํ–‰ ์œ ๋„ ๊ฐ•ํ™” ํ•™์Šต ์ž์ฒด ์ˆ˜์ • ๋ฉ”์ปค๋‹ˆ์ฆ˜

๊ฒฐ๋ก  ๋ฐ ๋…ผ์˜

์ฃผ์š” ๊ฒฐ๋ก 

  1. ๊ฒฝ๋Ÿ‰ ๊ฐ€๋Šฅ์„ฑ: ๊ณ ํ’ˆ์งˆ ํ…์ŠคํŠธ-SQL ์ƒ์„ฑ์ด ๊ฒฝ๋Ÿ‰ ๋ชจ๋ธ์„ ํ†ตํ•ด ๊ตฌํ˜„ ๊ฐ€๋Šฅํ•จ์„ ์ฆ๋ช…
  2. ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ ๊ท ํ˜•: ๋งค๊ฐœ๋ณ€์ˆ˜ ์ˆ˜๋ฅผ ํ˜„์ €ํžˆ ์ค„์ด๋ฉด์„œ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ ์œ ์ง€
  3. ์‹ค์šฉ์  ๊ฐ€์น˜: ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ์— ๋ฏผ๊ฐํ•˜๊ณ  ๋ฆฌ์†Œ์Šค๊ฐ€ ์ œํ•œ๋œ ์‹œ๋‚˜๋ฆฌ์˜ค์— ์‹ค์šฉ์  ์†”๋ฃจ์…˜ ์ œ๊ณต

ํ•œ๊ณ„

  1. ๊ณ ์ • k๊ฐ’ ๋ฌธ์ œ: ๊ณ ์ • ์ˆ˜์˜ ์—ด ๊ฒ€์ƒ‰์€ ๋ถˆ๊ฐ€ํ”ผํ•˜๊ฒŒ ๊ฑฐ์ง“ ์–‘์„ฑ ๋„์ž…
  2. ์˜๋ฏธ๋ก ์  ์˜ค๋ฅ˜ ๊ฐ์ง€: ํ˜„์žฌ ์ž์ฒด ์ˆ˜์ • ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ์ฃผ๋กœ ๊ตฌ๋ฌธ ์˜ค๋ฅ˜๋ฅผ ์ฒ˜๋ฆฌํ•˜๋ฉฐ, ์˜๋ฏธ๋ก ์ ์œผ๋กœ ์˜ฌ๋ฐ”๋ฅด์ง€๋งŒ ๋…ผ๋ฆฌ์ ์œผ๋กœ ์ž˜๋ชป๋œ ์ฟผ๋ฆฌ์— ๋Œ€ํ•œ ํšจ๊ณผ๊ฐ€ ์ œํ•œ์ 

ํ–ฅํ›„ ๋ฐฉํ–ฅ

  1. ๋™์  ๊ฒ€์ƒ‰ ์ „๋žต: ์งˆ๋ฌธ ๋ณต์žก๋„์— ๋”ฐ๋ผ ๊ฒ€์ƒ‰ ์—ด ์ˆ˜๋ฅผ ์ž์ ์‘์ ์œผ๋กœ ์กฐ์ •
  2. ์˜๋ฏธ๋ก ์  ์˜ค๋ฅ˜ ๊ฐ์ง€: ์˜๋ฏธ๋ก ์  ์˜ค๋ฅ˜๋ฅผ ํฌ์ฐฉํ•˜๋Š” ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๊ฐœ๋ฐœ
  3. ๋‹ค์ค‘ ๋ชจ๋‹ฌ ํ™•์žฅ: ํ…Œ์ด๋ธ” ๋‚ด์šฉ ๋ฐ ์Šคํ‚ค๋งˆ ์ •๋ณด ๊ฒฐํ•ฉ

์‹ฌ์ธต ํ‰๊ฐ€

์žฅ์ 

  1. ๋†’์€ ํ˜์‹ ์„ฑ: ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค๋ฅผ ํ…์ŠคํŠธ-SQL์˜ ์Šคํ‚ค๋งˆ ๋งํ‚น์— ์ฒด๊ณ„์ ์œผ๋กœ ์ ์šฉํ•œ ์ฒซ ๋ฒˆ์งธ ์‚ฌ๋ก€
  2. ๋†’์€ ์‹ค์šฉ์  ๊ฐ€์น˜: LLM ๋ฐฉ๋ฒ•์˜ ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ ๋ฐ ๋ฐฐํฌ ๋ฌธ์ œ ํ•ด๊ฒฐ
  3. ์ถฉ๋ถ„ํ•œ ์‹คํ—˜: ํฌ๊ด„์ ์ธ ์†Œ๊ฑฐ ์‹คํ—˜ ๋ฐ ์˜ค๋ฅ˜ ๋ถ„์„
  4. ๊ฒฌ๊ณ ํ•œ ๊ธฐ์ˆ : HN-SupCon ์†์‹ค ๋ฐ ๋‘ ๋‹จ๊ณ„ ํ›ˆ๋ จ ์ „๋žต์˜ ํ•ฉ๋ฆฌ์  ์„ค๊ณ„

๋ถ€์กฑํ•œ ์ 

  1. ๋‹จ์ˆœํ•œ ๊ฒ€์ƒ‰ ์ „๋žต: ๊ณ ์ • k๊ฐ’ ๊ฒ€์ƒ‰์ด ์ตœ์  ์ „๋žต์ด ์•„๋‹ ์ˆ˜ ์žˆ์Œ
  2. ์˜ค๋ฅ˜ ์œ ํ˜• ์ œํ•œ: ์ž์ฒด ์ˆ˜์ •์ด ์ฃผ๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅ ๊ฐ์ง€ ์˜ค๋ฅ˜์— ์ดˆ์ 
  3. ๋ฐ์ดํ„ฐ์…‹ ์ œํ•œ: ์ฃผ๋กœ ์˜์–ด ๋ฐ์ดํ„ฐ์…‹์—์„œ ๊ฒ€์ฆ, ๋‹ค๊ตญ์–ด ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ ๋ฏธ์ง€์ˆ˜

์˜ํ–ฅ๋ ฅ

  1. ํ•™์ˆ ์  ๊ฐ€์น˜: ๊ฒฝ๋Ÿ‰ ํ…์ŠคํŠธ-SQL ์—ฐ๊ตฌ์— ์ƒˆ๋กœ์šด ์‚ฌ๊ณ  ์ œ๊ณต
  2. ์‹ค์šฉ์  ๊ฐ€์น˜: ์—ฃ์ง€ ์ปดํ“จํŒ… ๋ฐ ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ ์‹œ๋‚˜๋ฆฌ์˜ค์— ์ ์šฉ ๊ฐ€๋Šฅ
  3. ์žฌํ˜„์„ฑ: ์˜คํ”ˆ ์†Œ์Šค ๋ชจ๋ธ ๊ธฐ๋ฐ˜์œผ๋กœ ์žฌํ˜„ ๋ฐ ํ™•์žฅ ์šฉ์ด

์ ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค

  1. ๋ฆฌ์†Œ์Šค ์ œํ•œ ํ™˜๊ฒฝ: ์—ฃ์ง€ ๋””๋ฐ”์ด์Šค, ๋ชจ๋ฐ”์ผ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜
  2. ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ ๋ฏผ๊ฐ ์‹œ๋‚˜๋ฆฌ์˜ค: ๊ธฐ์—… ๋‚ด๋ถ€ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค, ์˜๋ฃŒ ๊ธˆ์œต ๋“ฑ ๋ถ„์•ผ
  3. ์‹ค์‹œ๊ฐ„ ์‘์šฉ: ๋น ๋ฅธ ์‘๋‹ต์ด ํ•„์š”ํ•œ ๋Œ€ํ™”ํ˜• ์ฟผ๋ฆฌ ์‹œ์Šคํ…œ

์ฐธ๊ณ  ๋ฌธํ—Œ

๋…ผ๋ฌธ์€ ํ…์ŠคํŠธ-SQL ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋ฅผ ์ธ์šฉํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ๋‹ค์Œ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค:

  • Spider ๋ฐ BIRD ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ์…‹์˜ ์›๋ณธ ๋…ผ๋ฌธ
  • ์ฃผ์š” LLM ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•(DIN-SQL, CHESS, CHASE-SQL ๋“ฑ)
  • ๋ฏธ์„ธ ์กฐ์ • ๋ฐฉ๋ฒ•์˜ ๋Œ€ํ‘œ ์—ฐ๊ตฌ(CodeS, OmniSQL ๋“ฑ)
  • ๊ด€๋ จ ๊ธฐ์ˆ  ๊ธฐ์ดˆ(DPO, LoRA, ๋Œ€์กฐ ํ•™์Šต ๋“ฑ)