2025-11-19T01:19:13.619140

An approach for systematic decomposition of complex llm tasks

Zhou, Xu, Liu et al.
Large Language Models (LLMs) suffer from reliability issues on complex tasks, as existing decomposition methods are heuristic and rely on agent or manual decomposition. This work introduces a novel, systematic decomposition framework that we call Analysis of CONstraint-Induced Complexity (ACONIC), which models the task as a constraint problem and leveraging formal complexity measures to guide decomposition. On combinatorial (SATBench) and LLM database querying tasks (Spider), we find that by decomposing the tasks following the measure of complexity, agent can perform considerably better (10-40 percentage point).
academic

๋ณต์žกํ•œ LLM ์ž‘์—…์˜ ์ฒด๊ณ„์  ๋ถ„ํ•ด ์ ‘๊ทผ๋ฒ•

๊ธฐ๋ณธ ์ •๋ณด

  • ๋…ผ๋ฌธ ID: 2510.07772
  • ์ œ๋ชฉ: An Approach for Systematic Decomposition of Complex LLM Tasks
  • ์ €์ž: Tianle Zhou, Jiakai Xu, Guanhong Liu, Jiaxiang Liu, Haonan Wang, Eugene Wu (Columbia University)
  • ๋ถ„๋ฅ˜: cs.AI
  • ๋ฐœํ‘œ ์‹œ๊ฐ„: 2025๋…„ 10์›” 13์ผ (arXiv v2)
  • ๋…ผ๋ฌธ ๋งํฌ: https://arxiv.org/abs/2510.07772v2

์ดˆ๋ก

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLMs)์€ ๋ณต์žกํ•œ ์ž‘์—…์—์„œ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ๋ณด์ด๊ณ  ์žˆ์œผ๋ฉฐ, ๊ธฐ์กด์˜ ๋ถ„ํ•ด ๋ฐฉ๋ฒ•์€ ํœด๋ฆฌ์Šคํ‹ฑ์— ์˜์กดํ•˜๊ฑฐ๋‚˜ ์—์ด์ „ํŠธ ๋˜๋Š” ์ˆ˜๋™ ๋ถ„ํ•ด์— ์˜์กดํ•ฉ๋‹ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ์ œ์•ฝ ์กฐ๊ฑด ์œ ๋„ ๋ณต์žก๋„ ๋ถ„์„(ACONIC)์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ์ฒด๊ณ„์  ๋ถ„ํ•ด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค. ์ด ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ์ž‘์—…์„ ์ œ์•ฝ ์กฐ๊ฑด ๋ฌธ์ œ๋กœ ๋ชจ๋ธ๋งํ•˜๊ณ  ํ˜•์‹ํ™”๋œ ๋ณต์žก๋„ ์ธก์ •์„ ํ™œ์šฉํ•˜์—ฌ ๋ถ„ํ•ด๋ฅผ ์ง€๋„ํ•ฉ๋‹ˆ๋‹ค. ์กฐํ•ฉ ๋ฌธ์ œ(SAT-Bench)์™€ LLM ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ฟผ๋ฆฌ ์ž‘์—…(Spider)์—์„œ ๋ณต์žก๋„ ์ธก์ •์— ๋”ฐ๋ผ ์ž‘์—…์„ ๋ถ„ํ•ดํ•จ์œผ๋กœ์จ ์—์ด์ „ํŠธ์˜ ์„ฑ๋Šฅ์ด ํ˜„์ €ํžˆ ํ–ฅ์ƒ๋˜์—ˆ์Šต๋‹ˆ๋‹ค(10-40 ๋ฐฑ๋ถ„ํฌ์ธํŠธ).

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋™๊ธฐ

1. ํ•ด๊ฒฐํ•ด์•ผ ํ•  ๋ฌธ์ œ

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์€ ๊นŠ์ด ์žˆ๋Š” ๋‹ค๋‹จ๊ณ„ ์ถ”๋ก ์ด๋‚˜ ์กฐํ•ฉ ํƒ์ƒ‰์ด ํ•„์š”ํ•œ ๋ณต์žกํ•œ ์ž‘์—…์„ ์ฒ˜๋ฆฌํ•  ๋•Œ ๋‹จ์ผ ์ˆœ์ „ํŒŒ์—์„œ ์˜ฌ๋ฐ”๋ฅธ ๊ฒฐ๊ณผ๋ฅผ ์ƒ์„ฑํ•˜์ง€ ๋ชปํ•˜๋Š” ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ๋ณด์ž…๋‹ˆ๋‹ค.

2. ๋ฌธ์ œ์˜ ์ค‘์š”์„ฑ

LLMs์ด ๋‹ค์–‘ํ•œ ์ถ”๋ก , ํ”„๋กœ๊ทธ๋ž˜๋ฐ ๋ฐ ๋ฌธ์ œ ํ•ด๊ฒฐ ์ž‘์—…์—์„œ ๊ด‘๋ฒ”์œ„ํ•˜๊ฒŒ ์ ์šฉ๋˜๋ฉด์„œ, ๋ณต์žกํ•œ ์ž‘์—…์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„ํ•ดํ•˜์—ฌ ๋ชจ๋ธ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ด ํ•ต์‹ฌ ๊ณผ์ œ๊ฐ€ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ๊ธฐ์กด ๋ฐฉ๋ฒ•์€ ์›์น™์ ์ธ ๋ณต์žก๋„ ์ธก์ • ๋ฐ ๋ถ„ํ•ด ์ „๋žต์ด ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.

3. ๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„

  • ํœด๋ฆฌ์Šคํ‹ฑ ๋ถ„ํ•ด: Chain-of-Thought์™€ ๊ฐ™์€ ๊ธฐ์กด ๋ฐฉ๋ฒ•์€ ์ฃผ๋กœ LLM ์ž์ฒด์˜ ๋ถ„ํ•ด์— ์˜์กดํ•˜๋ฉฐ ์ด๋ก ์  ๊ธฐ์ดˆ๊ฐ€ ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.
  • ์ˆ˜๋™ ๋ถ„ํ•ด: ์˜์—ญ ์ „๋ฌธ๊ฐ€๊ฐ€ ์ˆ˜๋™์œผ๋กœ ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ์„ค๊ณ„ํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์ฒด๊ณ„์„ฑ์ด ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.
  • ๋ณต์žก๋„ ์ธก์ • ๋ถ€์žฌ: ์ž‘์—… ๋ณต์žก๋„๋ฅผ ์ •๋Ÿ‰ํ™”ํ•  ์ˆ˜ ์—†์–ด ๋ถ„ํ•ด ํ•„์š”์„ฑ๊ณผ ๋ฐฉ๋ฒ•์„ ๊ฒฐ์ •ํ•˜๊ธฐ ์–ด๋ ต์Šต๋‹ˆ๋‹ค.

4. ์—ฐ๊ตฌ ๋™๊ธฐ

ํ˜•์‹ํ™”๋œ ์ž‘์—… ๋ณต์žก๋„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ˆ˜๋ฆฝํ•˜์—ฌ ์ฒด๊ณ„์ ์ธ ๋ถ„ํ•ด ์ „๋žต์„ ์ œ๊ณตํ•˜๊ณ , ๋น„๊ต ๊ฐ€๋Šฅํ•œ ๋‚œ์ด๋„์˜ ์ž‘์—… ์—ฐ๊ตฌ ๋Šฅ๋ ฅ์„ ์ œ๊ณตํ•˜๋ฉฐ, ๋„๊ตฌ ๋ณด์กฐ๊ฐ€ ํ•„์š”ํ•œ ์‹œ์ ์„ ์ง€๋„ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ

  1. ACONIC ํ”„๋ ˆ์ž„์›Œํฌ ์ œ์‹œ: LLM ์ž‘์—…์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ ๋ฌธ์ œ๋กœ ์ถ•์•ฝํ•˜๋Š” ์ฒซ ๋ฒˆ์งธ ํ˜•์‹ํ™”๋œ ๋ณต์žก๋„ ํ”„๋ ˆ์ž„์›Œํฌ
  2. ๋ณต์žก๋„ ์ธก์ • ์ˆ˜๋ฆฝ: ์ œ์•ฝ ์กฐ๊ฑด ๊ทธ๋ž˜ํ”„์˜ ๊ทธ๋ž˜ํ”„ ํฌ๊ธฐ ๋ฐ ํŠธ๋ฆฌ ๋„ˆ๋น„๋ฅผ ์ž‘์—… ๋ณต์žก๋„์˜ ์ธก์ • ๊ธฐ์ค€์œผ๋กœ ์‚ฌ์šฉ
  3. ์ฒด๊ณ„์  ๋ถ„ํ•ด ๋ฐฉ๋ฒ•: ํŠธ๋ฆฌ ๋ถ„ํ•ด ๊ธฐ๋ฐ˜ ๋ถ„ํ•ด ์ „๋žต์œผ๋กœ ๋ถ€๋ถ„ ์ž‘์—… ๋ณต์žก๋„๋ฅผ ์ตœ์†Œํ™”ํ•˜๋ฉด์„œ ์ „์—ญ ๋งŒ์กฑ์„ฑ ์œ ์ง€
  4. ์‹ค์ฆ ๊ฒ€์ฆ: SAT-Bench ๋ฐ Spider ๋ฒค์น˜๋งˆํฌ์—์„œ ๋ณต์žก๋„ ์ธก์ •์ด ์ •์˜ํ•œ ๋‚œ์ด๋„ ๊ฒฝ๊ณ„ ๋ฐ ๋ถ„ํ•ด ํšจ๊ณผ ๊ฒ€์ฆ
  5. ์„ฑ๋Šฅ ํ–ฅ์ƒ: Chain-of-Thought ๋ฐฉ๋ฒ• ๋Œ€๋น„ SAT-Bench์—์„œ 9-15% ํ–ฅ์ƒ, Spider์—์„œ 30-40% ํ–ฅ์ƒ

๋ฐฉ๋ฒ• ์ƒ์„ธ ์„ค๋ช…

์ž‘์—… ์ •์˜

ACONIC์€ LLM ์ž‘์—…์„ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์ •์˜ํ•ฉ๋‹ˆ๋‹ค: ์ œ์•ฝ ์กฐ๊ฑด ์ง‘ํ•ฉ์„ ์„ค๋ช…ํ•˜๋Š” ์ปจํ…์ŠคํŠธ์™€ ์ œ์•ฝ ์กฐ๊ฑด์— ๋Œ€ํ•ด ์ถ”๋ก ํ•ด์•ผ ํ•˜๋Š” ์ฟผ๋ฆฌ๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ, ์ด๋ฅผ ํ˜•์‹ํ™”๋œ ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ ๋ฌธ์ œ๋กœ ์ถ•์•ฝํ•œ ํ›„ ๋ถ„ํ•ดํ•˜๊ณ  ๋ถ€๋ถ„ ์ž‘์—… ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ๊ตฌ์„ฑํ•ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜

1. ๊ณ„ํš ๋ฌธ์ œ๋กœ์˜ ์ถ•์•ฝ

์ƒํƒœ ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ ์ž‘์—… ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ž‘์—…์„ ๊ณ„ํš์œผ๋กœ์„œ์˜ ๋งŒ์กฑ์„ฑ(PaS) ๋ฌธ์ œ๋กœ ํ˜•์‹ํ™”ํ•ฉ๋‹ˆ๋‹ค:

P = โŸจF, A, I, GโŸฉ

์—ฌ๊ธฐ์„œ:

  • F: ์„ธ๊ณ„ ์‚ฌ์‹ค์„ ์„ค๋ช…ํ•˜๋Š” ๋ช…์ œ ์œ ์ฐฝ์„ฑ์˜ ์œ ํ•œ ์ง‘ํ•ฉ
  • A: ๋™์ž‘์˜ ์œ ํ•œ ์ง‘ํ•ฉ
  • I, G: ์ดˆ๊ธฐ ๋ฐ ๋ชฉํ‘œ ์œ ์ฐฝ์„ฑ
  • ๋™์ž‘ a์— ๋Œ€ํ•ด: P(a)๋Š” ์ „์ œ ์กฐ๊ฑด์„ ๊ฒฐ์ •ํ•˜๊ณ , A(a)๋Š” ์ฐธ์ด ๋˜๋Š” ์œ ์ฐฝ์„ฑ์„ ๊ฒฐ์ •ํ•˜๋ฉฐ, D(a)๋Š” ๊ฑฐ์ง“์ด ๋˜๋Š” ์œ ์ฐฝ์„ฑ์„ ๊ฒฐ์ •ํ•ฉ๋‹ˆ๋‹ค.

2. ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ ๋ฌธ์ œ๋กœ์˜ ์ถ•์•ฝ

PaS ๋ฌธ์ œ๋ฅผ CSP ์ธ์Šคํ„ด์Šค๋กœ ์ถ•์•ฝํ•˜๋ฉฐ, ๋‹ค์Œ์„ ์ธ์ฝ”๋”ฉํ•ฉ๋‹ˆ๋‹ค:

  • ์ „์ œ ์กฐ๊ฑด fp โˆˆ P(a)
  • ์ถ”๊ฐ€ ํšจ๊ณผ fa โˆˆ A(a)
  • ์‚ญ์ œ ํšจ๊ณผ fd โˆˆ D(a) ์œ ์ฐฝ์„ฑ๊ณผ ๋™์ž‘ ๊ฐ„์˜ ๋ถ€์šธ ์˜์กด์„ฑ ์ œ์•ฝ์œผ๋กœ์„œ.

3. ํŠธ๋ฆฌ ๋ถ„ํ•ด ์ „๋žต

Bodlaender(1998)์˜ ํŠธ๋ฆฌ ๋ถ„ํ•ด ์ด๋ก ์„ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค:

  • ์ตœ์†Œ ์ตœ๋Œ€ ๋ฐฑ ํฌ๊ธฐ์˜ ํŠธ๋ฆฌ ๋ถ„ํ•ด D* ํƒ์ƒ‰(ํŠธ๋ฆฌ ๋„ˆ๋น„)
  • ํŠธ๋ฆฌ ๋„ˆ๋น„๋Š” ๋‚ด์žฌ์  ๋ฌธ์ œ ๋ณต์žก๋„๋ฅผ ํŠน์ง•์ง€์Œ
  • ๊ตญ์†Œ ์ผ๊ด€์„ฑ์ด ์ „์—ญ ์ผ๊ด€์„ฑ์„ ๋ณด์žฅํ•จ

๊ธฐ์ˆ ์  ํ˜์‹ ์ 

  1. ํ˜•์‹ํ™”๋œ ๋ณต์žก๋„ ์ธก์ •: ๊ทธ๋ž˜ํ”„ ์ด๋ก ์˜ ํŠธ๋ฆฌ ๋„ˆ๋น„๋ฅผ LLM ์ž‘์—… ๋ณต์žก๋„์˜ ์ •๋Ÿ‰ํ™” ์ง€ํ‘œ๋กœ ์ฒ˜์Œ ์‚ฌ์šฉ
  2. ์ „์—ญ ์ผ๊ด€์„ฑ ๋ณด์žฅ: ํŠธ๋ฆฌ ๋ถ„ํ•ด๋Š” ๊ตญ์†Œ ๋ถ€๋ถ„๊ทธ๋ž˜ํ”„์˜ ์ผ๊ด€์„ฑ์ด ์ „์—ญ CSP ํ•ด์˜ ์ผ๊ด€์„ฑ์„ ์˜๋ฏธํ•จ์„ ๋ณด์žฅ
  3. ์ตœ์  ๋ถ„ํ•ด ์ „๋žต: ์ตœ์†Œ ํŠธ๋ฆฌ ๋„ˆ๋น„ ๊ธฐ๋ฐ˜ ๋ถ„ํ•ด๋กœ ๊ตญ์†Œ ๋ณต์žก๋„ ์ตœ์†Œํ™”
  4. ์ž๋™ ์ถ•์•ฝ ์ ˆ์ฐจ: ํŠน์ • ๋ฒค์น˜๋งˆํฌ๋ฅผ ์œ„ํ•œ ์ž๋™ ์ถ•์•ฝ ์ ˆ์ฐจ ๊ฐœ๋ฐœ๋กœ ์ˆ˜๋™ ๋ชจ๋ธ๋ง ๊ฐ์†Œ

์‹คํ—˜ ์„ค์ •

๋ฐ์ดํ„ฐ์…‹

1. SAT-Bench

  • SAT ๋ฌธ์ œ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตฌ์„ฑ๋œ ์ž์—ฐ์–ด ์Šคํ† ๋ฆฌ ๋ฌธ์ œ
  • CNF ํ‘œํ˜„, ์ž์—ฐ์–ด ์„ค๋ช… ๋ฐ ์—”ํ‹ฐํ‹ฐ-SAT ์ •๋ ฌ ๋งคํ•‘ ํฌํ•จ
  • Claude3.5-Sonnet(์ž‘์—…์˜ ์ ˆ๋ฐ˜์„ ๋ฌด์ž‘์œ„ ์ƒ˜ํ”Œ๋ง) ๋ฐ Llama-3-70B(์ „์ฒด ์ž‘์—…) ํ‰๊ฐ€

2. Spider

  • ์ธ๊ธฐ ์žˆ๋Š” NL2SQL ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ์…‹
  • ์ˆ˜๋ฐฑ ๊ฐœ์˜ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํฌํ•จ, ๊ฐ๊ฐ ์ตœ๋Œ€ 37๊ฐœ ํ…Œ์ด๋ธ”, 90๊ฐœ ์™ธ๋ž˜ ํ‚ค, 100๊ฐœ ์ด์ƒ์˜ ์—ด
  • ์ž‘์—…์—๋Š” ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์Šคํ‚ค๋งˆ S, ์ž์—ฐ์–ด ์ฟผ๋ฆฌ q ๋ฐ ์‹ค์ œ SQL ์ฟผ๋ฆฌ q* ํฌํ•จ

ํ‰๊ฐ€ ์ง€ํ‘œ

  • SAT-Bench: ์ž‘์—… ์™„๋ฃŒ์œจ(์„ฑ๊ณต/์‹คํŒจ)
  • Spider: SQL ์ฟผ๋ฆฌ ์ •ํ™•๋„, ๋‚œ์ด๋„ ๋“ฑ๊ธ‰๋ณ„ ํ‰๊ฐ€(Easy/Medium/Hard/Extra)

๋น„๊ต ๋ฐฉ๋ฒ•

  • Chain-of-Thought (CoT): ํ‘œ์ค€ ์‚ฌ๊ณ ์˜ ์—ฐ์‡„ ํ”„๋กฌํ”„ํŒ… ๋ฐฉ๋ฒ•์„ ๊ธฐ์ค€์„ ์œผ๋กœ ์‚ฌ์šฉ
  • ์™„์ „ ๊ด€์ฐฐ vs ๋ถ„ํ•ด ๊ด€์ฐฐ: ์ „์—ญ ์ •๋ณด ์ ‘๊ทผ๊ณผ ๊ตญ์†Œ ๋ถ„ํ•ด ์ •๋ณด ์ ‘๊ทผ ๋น„๊ต

๊ตฌํ˜„ ์„ธ๋ถ€์‚ฌํ•ญ

  • SageMath๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํŠธ๋ฆฌ ๋ถ„ํ•ด ๊ณ„์‚ฐ, ์ตœ์†Œ ์ฑ„์šฐ๊ธฐ ํœด๋ฆฌ์Šคํ‹ฑ ๋ฐ ์ •ํ™• ํ•ด๊ฒฐ๊ธฐ ์ฑ„ํƒ
  • SAT-Bench๋Š” ๋‹จ๊ณ„์  ๋ณ€์ˆ˜ ํ• ๋‹น ์ „๋žต ์‚ฌ์šฉ
  • Spider๋Š” WITH ์ ˆ์˜ ์ฆ๋ถ„ ๊ตฌ์„ฑ ์ „๋žต ์‚ฌ์šฉ

์‹คํ—˜ ๊ฒฐ๊ณผ

์ฃผ์š” ๊ฒฐ๊ณผ

1. SAT-Bench ๊ฒฐ๊ณผ

  • Claude3.5-Sonnet: 49.3%์—์„œ 58.1%๋กœ ํ–ฅ์ƒ(+8.8%)
  • Llama-3-70B: 21.5%์—์„œ 36.5%๋กœ ํ–ฅ์ƒ(+15.0%)
  • ๋ณต์žก๋„ ์ธก์ •์ด ๋‚œ์ด๋„ ๊ฒฝ๊ณ„๋ฅผ ๋ช…ํ™•ํžˆ ์ •์˜ํ•˜๋ฉฐ, ACONIC์€ ๊ฒฝ๊ณ„๋ฅผ ๋” ๋ณต์žกํ•œ ๋ฌธ์ œ๋กœ ์ด๋™

2. Spider ๊ฒฐ๊ณผ

CoT ๊ธฐ์ค€์„  ๋Œ€๋น„ ACONIC์€ ๋ชจ๋“  ๋‚œ์ด๋„ ๋“ฑ๊ธ‰์—์„œ ํ˜„์ €ํ•œ ํ–ฅ์ƒ์„ ๋ณด์ž…๋‹ˆ๋‹ค:

  • Easy: 42.7%์—์„œ 75.8%๋กœ ํ–ฅ์ƒ(+33.1%)
  • Medium: 38.1%์—์„œ 58.1%๋กœ ํ–ฅ์ƒ(+20.0%)
  • Hard: 36.2%์—์„œ 62.7%๋กœ ํ–ฅ์ƒ(+26.5%)
  • Extra: 19.3%์—์„œ 37.9%๋กœ ํ–ฅ์ƒ(+18.6%)

์‹คํ—˜ ๋ฐœ๊ฒฌ

  1. ๋ณต์žก๋„ ๊ฒฝ๊ณ„: ์‹คํ—˜์€ ๋ฌธ์ œ ํŠธ๋ฆฌ ๋„ˆ๋น„ ๋ฐ ๋ฐฑ ๊ฐœ์ˆ˜๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๊ณ ์ •๋œ "์ด ์ž‘์—… ๋ณต์žก๋„" ๊ฒฝ๊ณ„๋ฅผ ๋“œ๋Ÿฌ๋ƒ…๋‹ˆ๋‹ค.
  2. ์ผ๊ด€๋œ ๊ฐœ์„ : ACONIC ๋ถ„ํ•ด๋Š” ๋‘ ๊ฐ€์ง€ ๋‹ค๋ฅธ ๋ชจ๋ธ(Claude ๋ฐ LLaMA)์—์„œ ์ผ๊ด€๋œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์ž…๋‹ˆ๋‹ค.
  3. ๋‚œ์ด๋„ ๊ตฌ๋ฐฐ: ๋” ๊ฐ•๋ ฅํ•œ ๋ชจ๋ธ(์˜ˆ: Claude)์€ ๊ฒฝ๊ณ„๋ฅผ ๋” ๋ณต์žกํ•œ ๋ฌธ์ œ ๋ฐฉํ–ฅ์œผ๋กœ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค.
  4. ๋ถ„ํ•ด ํšจ๊ณผ: ๊ถค์  ์ˆ˜ ์ฆ๊ฐ€๋Š” ์ •ํ™•๋„๋ฅผ ์•ฝ๊ฐ„ ๊ฐœ์„ ํ•˜์ง€๋งŒ, ๋ณต์žก๋„ ์œ ๋„ ๋ถ„ํ•ด๋Š” ๋” ํ˜„์ €ํ•œ ํ–ฅ์ƒ์„ ๊ฐ€์ ธ์˜ต๋‹ˆ๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ

1. ์ž‘์—… ๋ถ„ํ•ด ๋ฐฉ๋ฒ•

  • Chain-of-Thought ์‹œ๋ฆฌ์ฆˆ: Wei et al.(2022), Yao et al.(2023), Khot et al.(2023)
  • ๋„๊ตฌ ๋ณด์กฐ ๋ฐฉ๋ฒ•: Wang et al.(2024), Singh et al.(2024)
  • ์˜์—ญ ํŠน์ • ๋ถ„ํ•ด: Pourreza and Rafiei(2023), Chen et al.(2024)

2. ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ ๋ฐ ๊ณ„ํš

  • ๊ณ„ํš์œผ๋กœ์„œ์˜ ๋งŒ์กฑ์„ฑ: Selman et al.์˜ ๊ณ ์ „ ์—ฐ๊ตฌ
  • ํŠธ๋ฆฌ ๋ถ„ํ•ด ์ด๋ก : Bodlaender(1998)์˜ ๊ทธ๋ž˜ํ”„ ์ด๋ก  ๊ธฐ์ดˆ
  • ๋‹ค์ค‘ ์—์ด์ „ํŠธ ๊ฒฝ๋กœ ๊ณ„ํš: Surynek et al.(2016)

3. ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ด๋ก  ์‘์šฉ

  • ์ œ์•ฝ ์กฐ๊ฑด ๊ทธ๋ž˜ํ”„ ๋ชจ๋ธ๋ง: Gottlob et al.(2001)
  • NL2SQL ๋ฐฉ๋ฒ•: Wang et al.(2019)์˜ ๊ด€๊ณ„ ์ธ์‹ ์ธ์ฝ”๋”ฉ

๊ฒฐ๋ก  ๋ฐ ๋…ผ์˜

์ฃผ์š” ๊ฒฐ๋ก 

  1. ํ˜•์‹ํ™” ํ”„๋ ˆ์ž„์›Œํฌ์˜ ํšจ๊ณผ์„ฑ: ACONIC์€ ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ LLM ์ž‘์—… ๋ณต์žก๋„ ์ •๋Ÿ‰ํ™”์˜ ์ฒซ ๋ฒˆ์งธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
  2. ์ฒด๊ณ„์  ๋ถ„ํ•ด์˜ ์žฅ์ : ๋ณต์žก๋„ ๊ธฐ๋ฐ˜ ๋ถ„ํ•ด๋Š” ํœด๋ฆฌ์Šคํ‹ฑ ๋ฐฉ๋ฒ•์„ ํ˜„์ €ํžˆ ๋Šฅ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.
  3. ํ†ต์šฉ์„ฑ: ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—… ์œ ํ˜•(์กฐํ•ฉ ๋ฌธ์ œ ๋ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ฟผ๋ฆฌ)์—์„œ ํšจ๊ณผ์ ์ž…๋‹ˆ๋‹ค.
  4. ์ด๋ก ์ด ์‹ค์ œ๋ฅผ ์ง€๋„ํ•จ: ํŠธ๋ฆฌ ๋„ˆ๋น„ ๋“ฑ์˜ ๊ทธ๋ž˜ํ”„ ์ด๋ก  ๊ฐœ๋…์€ LLM ์ž‘์—… ๋ถ„ํ•ด์— ์ด๋ก ์  ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

ํ•œ๊ณ„

  1. ์ ์šฉ ๋ฒ”์œ„ ์ œํ•œ: ์ œ์•ฝ ์กฐ๊ฑด ๋งŒ์กฑ ๋ฌธ์ œ๋กœ ํŽธ๋ฆฌํ•˜๊ฒŒ ๋ชจ๋ธ๋งํ•  ์ˆ˜ ์žˆ๋Š” ์ž‘์—…์—๋งŒ ์ ์šฉ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
  2. ์™„์ „ํ•œ ํ‘œํ˜„์˜ ์–ด๋ ค์›€: ์‹ค์ œ ๋ฌธ์ œ๋Š” ๋ฌธ์ œ ๋ชจํ˜ธ์„ฑ, ์—์ด์ „ํŠธ ๋™์ž‘์˜ ๋ถˆํˆฌ๋ช…์„ฑ ๋˜๋Š” ๋ชจํ˜ธํ•œ ์ปจํ…์ŠคํŠธ ์ •๋ณด๋กœ ์ธํ•ด ์™„์ „ํžˆ ๋…ผ๋ฆฌ์ ์œผ๋กœ ํ‘œํ˜„๋  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค.
  3. ์™„์ „ ์ž์œจ์„ฑ ๋ถ€์žฌ: ACONIC์€ ์™„์ „ํžˆ ์ž์œจ์ ์ธ ๋ถ„ํ•ด ๋˜๋Š” ์ถ”๋ก  ์‹œ์Šคํ…œ์„ ๊ตฌ์„ฑํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.
  4. ๋ฒค์น˜๋งˆํฌ ํŠน์ด์„ฑ: ํ‰๊ฐ€ ์ž‘์—…์€ ์ œ์•ฝ ์กฐ๊ฑด ํ•ด๊ฒฐ๊ธฐ ๋˜๋Š” ๊ฐ„๋‹จํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์ง์ ‘ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

ํ–ฅํ›„ ๋ฐฉํ–ฅ

  1. ํ˜ผํ•ฉ ๋ถ„ํ•ด ๋ฐฉ๋ฒ•: ๋…ผ๋ฆฌ์  ์ œ์•ฝ ์กฐ๊ฑด๊ณผ ์ƒ์‹ ์ œ์•ฝ ์กฐ๊ฑด์„ ๊ฒฐํ•ฉํ•˜๋Š” ํ˜ผํ•ฉ ๋ถ„ํ•ด ๋ฐฉ๋ฒ• ์—ฐ๊ตฌ
  2. ๋” ๊ด‘๋ฒ”์œ„ํ•œ ์ž‘์—… ์œ ํ˜•: ๊ต์ฐฉ ์ƒํƒœ ๊ฐ์ง€, ์ž์› ์Šค์ผ€์ค„๋ง ๋“ฑ ๋” ๋งŽ์€ ์‹ค์ œ ๋ฌธ์ œ๋กœ ํ™•์žฅ
  3. ์™„์ „ ์ž์œจ ์‹œ์Šคํ…œ: ์™„์ „ํžˆ ์ž์œจ์ ์ธ ๋ถ„ํ•ด ๋ฐ ์ถ”๋ก  ์‹œ์Šคํ…œ์œผ๋กœ ๋ฐœ์ „
  4. ํ•™์Šต ๊ธฐ๋ฐ˜ ๋ถ„ํ•ด: ๋‹ค๋ฅธ ์ด๋ก ์  ๊ธฐ์ดˆ ๋˜๋Š” ํ•™์Šต ๊ธฐ๋ฐ˜ ๋ถ„ํ•ด ํ”„๋ ˆ์ž„์›Œํฌ์™€์˜ ๋น„๊ต ์—ฐ๊ตฌ

์‹ฌ์ธต ํ‰๊ฐ€

์žฅ์ 

  1. ์ด๋ก ์  ํ˜์‹ : ๊ทธ๋ž˜ํ”„ ์ด๋ก ์˜ ํŠธ๋ฆฌ ๋ถ„ํ•ด ์ด๋ก ์„ LLM ์ž‘์—… ๋ถ„ํ•ด์— ์ฒด๊ณ„์ ์œผ๋กœ ์ฒ˜์Œ ์ ์šฉ
  2. ํ˜•์‹ํ™” ์—„๋ฐ€์„ฑ: PaS์—์„œ CSP๋ฅผ ๊ฑฐ์ณ ํŠธ๋ฆฌ ๋ถ„ํ•ด๊นŒ์ง€์˜ ์™„์ „ํ•œ ์ถ•์•ฝ ์ฒด์ธ์„ ์ œ๊ณตํ•˜๋Š” ์—„๊ฒฉํ•œ ์ˆ˜ํ•™์  ํ”„๋ ˆ์ž„์›Œํฌ
  3. ์ถฉ๋ถ„ํ•œ ์‹ค์ฆ: ๋‘ ๊ฐ€์ง€ ๋‹ค๋ฅธ ์œ ํ˜•์˜ ๋ฒค์น˜๋งˆํฌ์—์„œ ๊ฒ€์ฆ๋˜์—ˆ์œผ๋ฉฐ ๊ฒฐ๊ณผ๊ฐ€ ์ผ๊ด€๋˜๊ณ  ํ˜„์ €ํ•จ
  4. ๊ฐ•ํ•œ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ: ๋ณต์žก๋„ ์ธก์ •์ด ์ž‘์—… ๋‚œ์ด๋„์— ๋Œ€ํ•œ ์ง๊ด€์  ์ดํ•ด๋ฅผ ์ œ๊ณต
  5. ํ†ต์šฉ ํ”„๋ ˆ์ž„์›Œํฌ: ํŠน์ • ์ž‘์—… ์œ ํ˜•์— ๊ตญํ•œ๋˜์ง€ ์•Š์œผ๋ฉฐ ์ข‹์€ ํ†ต์šฉ์„ฑ์„ ๊ฐ€์ง

๋ถ€์กฑํ•œ ์ 

  1. ๋ชจ๋ธ๋ง ๋ณต์žก์„ฑ: ์‹ค์ œ ์ž‘์—…์„ CSP๋กœ ์ถ•์•ฝํ•˜๋ ค๋ฉด ์ „๋ฌธ ์ง€์‹๊ณผ ์ˆ˜๋™ ์—”์ง€๋‹ˆ์–ด๋ง์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
  2. ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ: ํŠธ๋ฆฌ ๋ถ„ํ•ด ๊ณ„์‚ฐ ์ž์ฒด๊ฐ€ ๋†’์€ ๋ณต์žก๋„๋ฅผ ๊ฐ€์งˆ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ์ œํ•œ๋œ ๊ธฐ์ค€์„  ๋น„๊ต: ์ฃผ๋กœ CoT์™€ ๋น„๊ตํ•˜๋ฉฐ ๋‹ค๋ฅธ ์ฒด๊ณ„์  ๋ถ„ํ•ด ๋ฐฉ๋ฒ•๊ณผ์˜ ๋น„๊ต๊ฐ€ ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.
  4. ์ž‘์—… ์œ ํ˜• ์ œํ•œ: ๋‘ ๊ฐ€์ง€ ์ž‘์—… ์œ ํ˜•์—์„œ๋งŒ ๊ฒ€์ฆ๋˜์—ˆ์œผ๋ฉฐ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์€ ๋” ๊ด‘๋ฒ”์œ„ํ•œ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

์˜ํ–ฅ๋ ฅ

  1. ์ด๋ก ์  ๊ธฐ์—ฌ: LLM ์ž‘์—… ๋ถ„ํ•ด์— ์ƒˆ๋กœ์šด ์ด๋ก ์  ๊ด€์  ์ œ๊ณต
  2. ๋ฐฉ๋ฒ•๋ก ์  ๊ฐ€์น˜: ACONIC ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ํ˜•์‹ํ™” ๋ฐฉ๋ฒ• ๊ธฐ๋ฐ˜์˜ ๋” ๋งŽ์€ LLM ์—ฐ๊ตฌ์— ์˜๊ฐ์„ ์ค„ ์ˆ˜ ์žˆ์Œ
  3. ์‹ค์šฉ์  ๊ฐ€์น˜: ํŠน์ • ์œ ํ˜• ์ž‘์—…์—์„œ์˜ ํ˜„์ €ํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์€ ์‹ค์ œ ์‘์šฉ ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง
  4. ์—ฐ๊ตฌ ๋ฐฉํ–ฅ: LLM๊ณผ ์ „ํ†ต์  AI ๊ธฐํ˜ธ ๋ฐฉ๋ฒ• ๊ฒฐํ•ฉ์˜ ์ƒˆ๋กœ์šด ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ๊ฐœ์ฒ™ํ•  ์ˆ˜ ์žˆ์Œ

์ ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค

  1. ์กฐํ•ฉ ์ตœ์ ํ™” ๋ฌธ์ œ: ์Šค์ผ€์ค„๋ง, ์ž์› ํ• ๋‹น ๋“ฑ CSP๋กœ ๋ชจ๋ธ๋งํ•  ์ˆ˜ ์žˆ๋Š” ๋ฌธ์ œ
  2. ๊ตฌ์กฐํ™”๋œ ์ฟผ๋ฆฌ ์ž‘์—…: ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ฟผ๋ฆฌ, ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์ถ”๋ก  ๋“ฑ
  3. ๋‹ค์ค‘ ์ œ์•ฝ ๊ณ„ํš: ์—ฌ๋Ÿฌ ์ œ์•ฝ ์กฐ๊ฑด์„ ๋งŒ์กฑํ•ด์•ผ ํ•˜๋Š” ๊ณ„ํš ์ž‘์—…
  4. ๋…ผ๋ฆฌ ์ถ”๋ก  ์ž‘์—…: ๋…ผ๋ฆฌ ์ œ์•ฝ์œผ๋กœ ํ˜•์‹ํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ์ถ”๋ก  ๋ฌธ์ œ

์ฐธ๊ณ ๋ฌธํ—Œ

  1. Bodlaender, H. L. (1998). A partial k-arboretum of graphs with bounded treewidth. Theoretical computer science, 209(1-2):1โ€“45.
  2. Wei, J., et al. (2022). Chain-of-thought prompting elicits reasoning in large language models. arXiv preprint arXiv:2201.11903.
  3. Yu, T., et al. (2019). Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task.
  4. Gottlob, G., Leone, N., & Scarcello, F. (2001). Hypertree decompositions: A survey. International Symposium on Mathematical Foundations of Computer Science.

์š”์•ฝ: ๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์‹œํ•œ ACONIC ํ”„๋ ˆ์ž„์›Œํฌ๋Š” LLM ์ž‘์—… ๋ถ„ํ•ด ์˜์—ญ์˜ ์ค‘์š”ํ•œ ์ด๋ก ์  ์ง„์ „์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. ํ˜•์‹ํ™”๋œ ๋ณต์žก๋„ ์ธก์ •๊ณผ ์ฒด๊ณ„์  ๋ถ„ํ•ด ์ „๋žต์„ ๋„์ž…ํ•จ์œผ๋กœ์จ ๋ณต์žกํ•œ LLM ์ž‘์—… ํ•ด๊ฒฐ์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์‚ฌ๊ณ ๋ฐฉ์‹์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์ ์šฉ ๋ฒ”์œ„ ๋ฐ ๋ชจ๋ธ๋ง ๋ณต์žก์„ฑ์˜ ํ•œ๊ณ„๊ฐ€ ์žˆ์Œ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ , ํŠน์ • ์ž‘์—…์—์„œ์˜ ํ˜„์ €ํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ๊ณผ ์ด๋ก ์  ๊ธฐ์—ฌ๋กœ ์ธํ•ด ๋ณธ ์—ฐ๊ตฌ๋Š” ํ•ด๋‹น ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์—…์ ์ž…๋‹ˆ๋‹ค.