2025-11-16T01:40:12.068255

Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning

Su
Large language models (LLMs) have been widely applied to assist in finding solutions for diverse questions. Prior work has proposed representing a method as a pair of a question and its corresponding solution, enabling method reuse. However, existing approaches typically require the questions to be highly similar. In this paper, we extend the scope of method reuse to address questions with low similarity or with hidden similarities that are not explicitly observable. For questions that are similar in a general-specific sense (i.e., broader or narrower in scope), we propose to first separate the question and solution, rather than directly feeding the pair to the LLM. The LLM is then guided to adapt the solution to new but related questions, allowing it to focus on solution transfer rather than question recognition. Furthermore, we extend this approach to cases where questions only share partial features or hidden characteristics. This enables cross-question method reuse beyond conventional similarity constraints. Experimental verification shows that our scope-extension approach increases the probability of filtering out reusable solutions, thereby improving the effectiveness of cross-question method reuse.
academic

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์—์„œ์˜ ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ: ๋‹จ์–ด ์ˆ˜์ค€ ์˜ˆ์ธก์—์„œ ํ•ฉ๋ฆฌ์  ๋…ผ๋ฆฌ์ธต ์ถ”๋ก ์œผ๋กœ

๊ธฐ๋ณธ ์ •๋ณด

  • ๋…ผ๋ฌธ ID: 2509.05660
  • ์ œ๋ชฉ: Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning
  • ์ €์ž: Hong Su (์„ฑ๋„์ •๋ณด๊ณตํ•™๋Œ€ํ•™๊ต ์ปดํ“จํ„ฐ๊ณผํ•™ํ•™๋ถ€)
  • ๋ถ„๋ฅ˜: cs.CL (๊ณ„์‚ฐ์–ธ์–ดํ•™)
  • ๊ฒŒ์žฌ ์ €๋„: Journal of LaTeX Class Files, Vol. 14, No. 8, August 2015
  • ๋…ผ๋ฌธ ๋งํฌ: https://arxiv.org/abs/2509.05660v2

์ดˆ๋ก

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLMs)์€ ๋‹ค์–‘ํ•œ ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์ง€์›ํ•˜๊ธฐ ์œ„ํ•ด ๊ด‘๋ฒ”์œ„ํ•˜๊ฒŒ ์ ์šฉ๋˜์–ด ์™”๋‹ค. ์„ ํ–‰ ์—ฐ๊ตฌ์—์„œ๋Š” ๋ฐฉ๋ฒ•์„ ๋ฌธ์ œ์™€ ๊ทธ์— ํ•ด๋‹นํ•˜๋Š” ํ•ด๊ฒฐ์ฑ…์˜ ์Œ์œผ๋กœ ํ‘œํ˜„ํ•˜์—ฌ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์„ ๊ตฌํ˜„ํ•  ๊ฒƒ์„ ์ œ์•ˆํ–ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์€ ์ผ๋ฐ˜์ ์œผ๋กœ ๋ฌธ์ œ ๊ฐ„์˜ ๋†’์€ ์œ ์‚ฌ์„ฑ์„ ์š”๊ตฌํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์˜ ๋ฒ”์œ„๋ฅผ ํ™•์žฅํ•˜์—ฌ ์œ ์‚ฌ์„ฑ์ด ๋‚ฎ๊ฑฐ๋‚˜ ์•”๋ฌต์  ์œ ์‚ฌ์„ฑ์„ ๊ฐ€์ง„ ๋ฌธ์ œ๋“ค์„ ์ฒ˜๋ฆฌํ•œ๋‹ค. ์ผ๋ฐ˜-ํŠน์ˆ˜ ์˜๋ฏธ์—์„œ ์œ ์‚ฌํ•œ ๋ฌธ์ œ๋“ค์˜ ๊ฒฝ์šฐ, ์ €์ž๋“ค์€ ๋จผ์ € ๋ฌธ์ œ์™€ ํ•ด๊ฒฐ์ฑ…์„ ๋ถ„๋ฆฌํ•˜๊ณ  ์ด๋ฅผ ์ง์ ‘ LLM์— ์ž…๋ ฅํ•˜์ง€ ์•Š์„ ๊ฒƒ์„ ์ œ์•ˆํ•œ๋‹ค. ๊ทธ ํ›„ LLM์„ ์œ ๋„ํ•˜์—ฌ ํ•ด๊ฒฐ์ฑ…์„ ์ƒˆ๋กœ์šด ๊ด€๋ จ ๋ฌธ์ œ์— ์ ์‘์‹œํ‚ค๊ณ , ๋ฌธ์ œ ์‹๋ณ„์ด ์•„๋‹Œ ํ•ด๊ฒฐ์ฑ… ์ „์ด์— ์ง‘์ค‘ํ•˜๋„๋ก ํ•œ๋‹ค. ๋˜ํ•œ ์ด ๋ฐฉ๋ฒ•์€ ๋ถ€๋ถ„์  ํŠน์„ฑ๋งŒ ๊ณต์œ ํ•˜๊ฑฐ๋‚˜ ์ˆจ๊ฒจ์ง„ ํŠน์„ฑ์„ ๊ฐ€์ง„ ๋ฌธ์ œ๋“ค๋กœ๋„ ํ™•์žฅ๋œ๋‹ค. ์‹คํ—˜ ๊ฒ€์ฆ์€ ์ด๋Ÿฌํ•œ ๋ฒ”์œ„ ํ™•์žฅ ๋ฐฉ๋ฒ•์ด ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ํ•ด๊ฒฐ์ฑ…์„ ์„ ๋ณ„ํ•  ํ™•๋ฅ ์„ ๋†’์—ฌ ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์˜ ํšจ๊ณผ์„ฑ์„ ๊ฐœ์„ ํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค.

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋™๊ธฐ

๋ฌธ์ œ ์ •์˜

์ „ํ†ต์ ์ธ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์€ ์ฃผ๋กœ ๋‹จ์–ด ์ˆ˜์ค€์—์„œ ํ›ˆ๋ จ๋˜๋ฉฐ, ๋‹ค์Œ ํ† ํฐ ์˜ˆ์ธก ๋˜๋Š” ๋ˆ„๋ฝ๋œ ํ† ํฐ ์ฑ„์šฐ๊ธฐ๋ฅผ ํ†ตํ•ด ํ•™์Šตํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ํ›ˆ๋ จ ๋ฐฉ์‹์€ ์ฃผ๋กœ ํ†ต๊ณ„์  ๊ณต์ถœํ˜„์„ฑ์„ ๋ฐ˜์˜ํ•˜๋ฉฐ, ๊ณ ์ˆ˜์ค€์˜ ๋…ผ๋ฆฌ ์ถ”๋ก ์ด ์•„๋‹ˆ๋ผ ์ง๊ด€์ด๋‚˜ ํŒจํ„ด ๋งค์นญ์— ๋” ๊ฐ€๊น๊ณ  ํ•ฉ๋ฆฌ์  ์˜์‚ฌ๊ฒฐ์ •์ด ์•„๋‹ˆ๋‹ค.

์—ฐ๊ตฌ ๋™๊ธฐ

  1. ๋‹จ์–ด ์ˆ˜์ค€ ์ถ”๋ก ์˜ ํ•œ๊ณ„: ํ˜„์žฌ์˜ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ LLM์€ ๋ฐฉ๋ฒ• ์ˆ˜์ค€์˜ ์ถ”๋ก ์— ์–ด๋ ค์›€์„ ๊ฒช์œผ๋ฉฐ, ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์—์„œ ์ž์ฃผ ๋‚˜ํƒ€๋‚˜๋Š” ๋ฐฉ๋ฒ•์„ ์„ ํ˜ธํ•˜๋Š” ๊ฒฝํ–ฅ์ด ์žˆ๋‹ค. ์ด๋Š” ์ด๋Ÿฌํ•œ ๋ฐฉ๋ฒ•๋“ค์ด ์ตœ์ ์ด ์•„๋‹ ์ˆ˜๋„ ์žˆ์Œ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๊ทธ๋ ‡๋‹ค.
  2. ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์˜ ํ•œ๊ณ„: ๊ธฐ์กด์˜ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋ฌธ์ œ ๊ฐ„์˜ ๋†’์€ ์œ ์‚ฌ์„ฑ์„ ์š”๊ตฌํ•˜์—ฌ ๊ทธ ์ ์šฉ ๋ฒ”์œ„๋ฅผ ์ œํ•œํ•œ๋‹ค.
  3. ๊ต์ฐจ ์˜์—ญ ์ง€์‹ ์ด์ „์˜ ํ•„์š”์„ฑ: ์ธ๊ฐ„์€ ํ•œ ๋ฌธ์ œ์˜ ํ•ด๊ฒฐ์ฑ…์„ ๊ฒ‰๋ณด๊ธฐ์— ๋ฌด๊ด€ํ•œ ์ƒˆ๋กœ์šด ๋ฌธ์ œ์— ์œ ์ถ”์ ์œผ๋กœ ์ ์šฉํ•  ์ˆ˜ ์žˆ์ง€๋งŒ, ํ˜„์žฌ์˜ LLM์€ ์ด๋Ÿฌํ•œ ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ๋Šฅ๋ ฅ์ด ๋ถ€์กฑํ•˜๋‹ค.

ํ•ต์‹ฌ ๊ณผ์ œ

LLM์ด ๋ฌธ์ œ ๊ฐ„ ์œ ์‚ฌ์„ฑ์ด ๋‚ฎ๊ฑฐ๋‚˜ ๋ช…๋ฐฑํ•œ ์—ฐ๊ด€์„ฑ์ด ์—†์„ ๋•Œ์—๋„ ๊ธฐ์กด์˜ ํ•ด๊ฒฐ์ฑ…์„ ํšจ๊ณผ์ ์œผ๋กœ ์žฌ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๋ฐฉ๋ฒ•.

ํ•ต์‹ฌ ๊ธฐ์—ฌ

  1. ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ๋ฒ”์œ„ ํ™•์žฅ: ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์„ ๋†’์€ ์œ ์‚ฌ์„ฑ ์‚ฌ๋ก€์—์„œ ์ผ๋ฐ˜-ํŠน์ˆ˜ ๋งคํ•‘ ๋ฐ ํŠน์„ฑ ๊ธฐ๋ฐ˜ ์ˆจ๊ฒจ์ง„ ๊ด€๊ณ„๋กœ ํ™•์žฅ.
  2. ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ๋ชจ๋ธ ์ œ์•ˆ:
    • ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ: ์ผ๋ฐ˜-ํŠน์ˆ˜ ๊ด€๊ณ„ ๋ฐ ๋ณ‘๋ ฌ ๊ด€๊ณ„ ์ฒ˜๋ฆฌ
    • ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ: ๋ถ€๋ถ„ ํŠน์„ฑ ๋งค์นญ ๋ฐ ์ˆจ๊ฒจ์ง„ ํŠน์„ฑ ์‹๋ณ„ ์ง€์›
  3. "๋ฐฉ๋ฒ•์˜ ๋ฐฉ๋ฒ•"(Method of Methods, MoM) ๊ฐœ๋… ๋„์ž…: ํ˜„์žฌ ์ ์šฉ ๋ฐฉ๋ฒ•์˜ ํšจ๊ณผ์„ฑ์„ ๊ฒ€์ฆ, ๊ฐœ์„  ๋ฐ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•œ ๊ณ ์ˆ˜์ค€ ๋ฐฉ๋ฒ• ์ œ๊ณต.
  4. ์ด๋ก ์  ํ”„๋ ˆ์ž„์›Œํฌ: ๋‹จ์–ด ์ˆ˜์ค€ ์˜ˆ์ธก์—์„œ ๋…ผ๋ฆฌ์ธต ์ถ”๋ก ์œผ๋กœ ์ƒํ–ฅ์‹ ์ „ํ™˜, ์ˆœ์ˆ˜ ํ†ต๊ณ„๊ฐ€ ์•„๋‹Œ ํ•ฉ๋ฆฌ์  ํ•ด๊ฒฐ์ฑ… ์ ์šฉ ์‹คํ˜„.

๋ฐฉ๋ฒ• ์ƒ์„ธ ์„ค๋ช…

์ž‘์—… ์ •์˜

๋ชฉํ‘œ ์งˆ๋ฌธ Qt๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ, ์ง์ ‘์ ์ธ ํ•ด๊ฒฐ์ฑ…์ด ์—†๋Š” ๊ฒฝ์šฐ ๊ธฐ์กด์˜ ๋ฐฉ๋ฒ• ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์—์„œ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ํ•ด๊ฒฐ์ฑ…์„ ์ฐพ๋Š”๋‹ค. ์ด๋Š” ์ด๋Ÿฌํ•œ ๋ฐฉ๋ฒ•๋“ค์˜ ์›๋ž˜ ์งˆ๋ฌธ์ด ๋ชฉํ‘œ ์งˆ๋ฌธ๊ณผ ์œ ์‚ฌ์„ฑ์ด ๋‚ฎ๊ฑฐ๋‚˜ ์•”๋ฌต์  ๊ด€๊ณ„๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์„ ์ˆ˜๋„ ์žˆ๋‹ค.

๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜

1. ๊ด€๊ณ„ํ˜• ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ

์ผ๋ฐ˜-ํŠน์ˆ˜ ๋ฐฉ๋ฒ•: ๋‘ ๋ฐฉ๋ฒ• Ma์™€ Mb๊ฐ€ ๊ฐ๊ฐ ์งˆ๋ฌธ ์ง‘ํ•ฉ Qma์™€ Qmb๋ฅผ ํ•ด๊ฒฐํ•œ๋‹ค๊ณ  ํ•˜์ž. ๋‹ค์Œ์„ ๋งŒ์กฑํ•˜๋ฉด:

Qma โŠƒ Qmb  (1)

Ma๋Š” Mb๋ณด๋‹ค ๋” ์ผ๋ฐ˜์ ์ด๋ฉฐ, ์ˆ˜์ง ์žฌ์‚ฌ์šฉ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.

๋ณ‘๋ ฌ ๋ฐฉ๋ฒ•: ๋‘ ๋ฐฉ๋ฒ•์ด ๋ณ‘๋ ฌ์ด ๋˜๋ ค๋ฉด ๊ทธ๋“ค์˜ ์งˆ๋ฌธ ์ง‘ํ•ฉ์ด ๋™์ผํ•œ ๋” ๊ด‘๋ฒ”์œ„ํ•œ ๋ฒ”์ฃผ์˜ ๋ถ„๋ฆฌ๋œ ๋ถ€๋ถ„์ง‘ํ•ฉ์ด์–ด์•ผ ํ•œ๋‹ค:

Qma โˆฉ Qmb = โˆ…, Qma โŠ‚ Qg, Qmb โŠ‚ Qg  (2)

2. ํŠน์„ฑํ˜• ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ

ํŠน์„ฑ ๊ณต๊ฐ„ ์ •์˜: ์งˆ๋ฌธ Q์— ๋Œ€ํ•ด, ๊ทธ ํŠน์„ฑ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์ •์˜๋œ๋‹ค:

F(Q) โІ F, F(Q) = Fmeas(Q) โˆช Ftext(Q)  (3)

์—ฌ๊ธฐ์„œ Fmeas(Q)๋Š” ๋ช…์‹œ์  ์ˆ˜์น˜ ์†์„ฑ์ด๊ณ , Ftext(Q)๋Š” ํ•™์Šต ์ธ์ฝ”๋” h(ยท)๋ฅผ ํ†ตํ•ด ํ…์ŠคํŠธ์—์„œ ์ถ”์ถœํ•œ ํŠน์„ฑ์ด๋‹ค.

ํŠน์„ฑ ์œ ์‚ฌ์„ฑ:

Simfeat(Qa, Qb) = S(F(Qa), F(Qb))  (5)

์žฌ์‚ฌ์šฉ ์กฐ๊ฑด:

Reusefeat(Qb; Sa) = {
    1, if Simfeat(Qa, Qb) โ‰ฅ ฯ„ and Valid(Sa, Qb) = 1
    0, otherwise
}  (6)

3. ์ „์—ญ ๋ฐฉ๋ฒ•

์ „์—ญ ๋ฐฉ๋ฒ• Gi = (Qgi, Sgi)๋Š” ๊ด‘๋ฒ”์œ„ํ•œ ์ ์šฉ์„ฑ์„ ๊ฐ€์ง€๋ฉฐ, ๋ฐฉ๋ฒ• ์‹คํ–‰์˜ ์‹ ๋ขฐ์„ฑ๊ณผ ์ผ๊ด€์„ฑ์„ ๋†’์ด๊ธฐ ์œ„ํ•œ ๋ฒ”์šฉ ํ”„๋กœ๊ทธ๋žจ ๊ฐ•ํ™”๋กœ ์ž‘์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

4. ๋ฐฉ๋ฒ•์˜ ๋ฐฉ๋ฒ•(MoM)

MoM์€ ๊นŠ์ด๋ณ„๋กœ ๊ณ„์ธต์ ์œผ๋กœ ์กฐ์ง๋œ๋‹ค:

  • M(0): ์ง์ ‘ ๋ฐฉ๋ฒ•, Q โ†ฆ S
  • M(1): 1์ฐจ ๋ฐฉ๋ฒ•, M(0) โ†ฆ M(0)'
  • M(i+1): (i+1)์ฐจ ๋ฐฉ๋ฒ•, M(i) โ†ฆ M(i)'

๊ธฐ์ˆ  ํ˜์‹  ํฌ์ธํŠธ

  1. ์งˆ๋ฌธ-ํ•ด๊ฒฐ์ฑ… ๋ถ„๋ฆฌ ์ „๋žต: ์งˆ๋ฌธ-ํ•ด๊ฒฐ์ฑ… ์Œ์„ ์ง์ ‘ LLM์— ์ž…๋ ฅํ•˜์ง€ ์•Š๊ณ , ๋จผ์ € ๋ถ„๋ฆฌํ•œ ํ›„ LLM์„ ์œ ๋„ํ•˜์—ฌ ํ•ด๊ฒฐ์ฑ… ์ด์ „์„ ์ˆ˜ํ–‰.
  2. ๋‹ค์ธต ์œ ์‚ฌ์„ฑ ์‹๋ณ„:
    • ๋ช…์‹œ์  ํŠน์„ฑ ๋งค์นญ
    • ์ˆจ๊ฒจ์ง„ ํŠน์„ฑ ์ถ”๋ก 
    • ์ „์ฒด ๋ฐฉ๋ฒ• ํ…œํ”Œ๋ฆฟ ์žฌ์‚ฌ์šฉ
  3. ๊ณ„์ธต์  ๊ฒ€์ฆ ๋ฉ”์ปค๋‹ˆ์ฆ˜: Valid ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ์ƒˆ๋กœ์šด ๋ฌธ๋งฅ์—์„œ ํ•ด๊ฒฐ์ฑ…์˜ ๋…ผ๋ฆฌ์  ํƒ€๋‹น์„ฑ ๋ณด์žฅ.

์‹คํ—˜ ์„ค์ •

๋ฐ์ดํ„ฐ์…‹

์‹คํ—˜์€ ๋‘ ๊ฐ€์ง€ ํ…Œ์ŠคํŠธ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค:

  1. ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ ํ…Œ์ŠคํŠธ: ๋ฐ”๋‚˜๋‚˜ ์‹ ์„ ๋„ ํŒ๋‹จ ๋ฌธ์ œ, ๊ณผ์ผ ์‹ ์„ ๋„์˜ ์ผ๋ฐ˜ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ
  2. ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ ํ…Œ์ŠคํŠธ: ํ•˜๋“œ ๋“œ๋ผ์ด๋ธŒ ์‚ฌ์šฉ ์‹œ๊ฐ„ ์žฌ์„ค์ • ๋ฌธ์ œ, MP3 ํŒŒ์ผ ์ฒ˜๋ฆฌ ๊ฒฝํ—˜ ์žฌ์‚ฌ์šฉ

ํ‰๊ฐ€ ์ง€ํ‘œ

  • ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„: ์ƒ์„ฑ๋œ ํ•ด๊ฒฐ์ฑ…๊ณผ ๋ชฉํ‘œ ๋ฐฉ๋ฒ•์˜ ์ •๋ ฌ ์ •๋„ ์ธก์ •
  • ํ†ต๊ณ„์  ์œ ์˜์„ฑ ๊ฒ€์ •: ๋…๋ฆฝ ํ‘œ๋ณธ t ๊ฒ€์ •์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐฉ๋ฒ• ๊ฐ„ ์ฐจ์ด ํ‰๊ฐ€

๋น„๊ต ๋ฐฉ๋ฒ•

  1. RelaMethod vs CompareRela: ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ ํšจ๊ณผ ํ‰๊ฐ€
  2. featureMethd vs compareMP3Method: ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ ํšจ๊ณผ ํ‰๊ฐ€

๊ตฌํ˜„ ์„ธ๋ถ€์‚ฌํ•ญ

  • ๊ฐ ๋ฐฉ๋ฒ•๋งˆ๋‹ค 20ํšŒ ํ…Œ์ŠคํŠธ
  • Welch์˜ t ๊ฒ€์ •์„ ์‚ฌ์šฉํ•œ ํ†ต๊ณ„ ๋ถ„์„
  • ๋…ธ์ด์ฆˆ ๊ฐ์†Œ๋ฅผ ์œ„ํ•ด ๋ชฉํ‘œ ๋ฐฉ๋ฒ•๊ณผ ๊ด€๋ จ๋œ ํ…์ŠคํŠธ ๋ถ€๋ถ„๋งŒ ๋น„๊ต

์‹คํ—˜ ๊ฒฐ๊ณผ

์ฃผ์š” ๊ฒฐ๊ณผ

๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ ์‹คํ—˜:

  • RelaMethod ํ‰๊ท  ์œ ์‚ฌ๋„: 0.4835 (ํ‘œ์ค€ํŽธ์ฐจ: 0.0801)
  • CompareRela ํ‰๊ท  ์œ ์‚ฌ๋„: 0.2820 (ํ‘œ์ค€ํŽธ์ฐจ: 0.0558)
  • t๊ฐ’: 9.23, p๊ฐ’: 8.98ร—10^-11 (p < 0.05)
  • ๊ฒฐ๋ก : RelaMethod๊ฐ€ ๊ธฐ์ค€์„  ๋ฐฉ๋ฒ•์„ ํฌ๊ฒŒ ๋Šฅ๊ฐ€ํ•จ

ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ ์‹คํ—˜:

  • featureMethd ํ‰๊ท  ์œ ์‚ฌ๋„: 0.2945 (ํ‘œ์ค€ํŽธ์ฐจ: 0.0698)
  • compareMP3Method ํ‰๊ท  ์œ ์‚ฌ๋„: 0.3983 (ํ‘œ์ค€ํŽธ์ฐจ: 0.0670)
  • t๊ฐ’: -4.80, p๊ฐ’: 2.52ร—10^-5 (p < 0.05)
  • ๊ฒฐ๋ก : ๋‘ ๋ฐฉ๋ฒ• ๊ฐ„์— ์œ ์˜๋ฏธํ•œ ์ฐจ์ด ์กด์žฌ

๋น„๊ต ๋ถ„์„

๋ฐฉ๋ฒ• ๋น„๊ตํ‰๊ท  ์ฐจ์ดํ‰๊ท  ์œ ์‚ฌ๋„์ƒ๋Œ€ ๋น„์œจ์žฌ์‚ฌ์šฉ ์œ ํ˜•
RelaMethod vs CompareRela0.20150.351057.4%์˜์กดํ˜• ์žฌ์‚ฌ์šฉ
featureMethd vs compareMP3Method0.10380.072614.3%๋ถ€๋ถ„ ๊ด€๋ จ

์‹คํ—˜ ๋ฐœ๊ฒฌ

  1. ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ์ด ๋” ์•ˆ์ •์ : ๊ตฌ์กฐ์  ์—ฐ๊ฒฐ์— ์˜์กดํ•˜๋Š” ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ์ด ๋ถ€๋ถ„์  ์ค‘๋ณต์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ๋ณด๋‹ค ๋” ์•ˆ์ •์ ์œผ๋กœ ์ˆ˜ํ–‰๋จ.
  2. ๋ช…์‹œ์  ๋ถ„๋ฆฌ์˜ ํšจ๊ณผ: LLM์— ์งˆ๋ฌธ-ํ•ด๊ฒฐ์ฑ… ์Œ์„ ์ง์ ‘ ์ œ๊ณตํ•˜๋Š” ๊ฒƒ๋ณด๋‹ค ๋ช…ํ™•ํ•˜๊ฒŒ ์ƒ์„ฑํ•˜๋„๋ก ์ง€์‹œํ•˜๋Š” ๊ฒƒ์ด ๋” ํšจ๊ณผ์ .
  3. ํ†ต๊ณ„์  ์œ ์˜์„ฑ: ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ์˜ ํ†ต๊ณ„์  ๋ถ„๋ฆฌ๊ฐ€ ๋” ๊ฐ•ํ•จ (t๊ฐ’ 9.23 vs 4.80), ํšจ๊ณผ๊ฐ€ ๋” ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ์Œ์„ ๋‚˜ํƒ€๋ƒ„.

์ด๋ก ์  ๋ถ„์„

๋…ผ๋ฆฌ์ธต ์žฌ์‚ฌ์šฉ

์ „ํ†ต์ ์ธ LLM์€ ํ† ํฐ ์ˆ˜์ค€์—์„œ ๋ถ„ํฌ P(wt+1|w1,w2,...,wt)๋ฅผ ํ•™์Šตํ•˜๋ฉฐ, ์ฃผ๋กœ ํ†ต๊ณ„์  ๊ณต์ถœํ˜„ ํŒจํ„ด์„ ํฌ์ฐฉํ•œ๋‹ค. ๋ณธ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋ฐฉ๋ฒ• M=(Q,S)์˜ ํ‘œํ˜„์„ ํ†ตํ•ด ๋…ผ๋ฆฌ์ธต์˜ ์žฌ์‚ฌ์šฉ ๋งคํ•‘์„ ์‹คํ˜„ํ•œ๋‹ค:

R: (Qa, Sa) โ†’ (Qb, Sa)  (15)

ํ•ฉ๋ฆฌ์  ์žฌ์‚ฌ์šฉ

ํ† ํฐ ํ™•๋ฅ  ๊ธฐ๋ฐ˜ ์„ ํƒ๊ณผ ๋‹ฌ๋ฆฌ, ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์€ ๋…ผ๋ฆฌ์  ์ ์šฉ์„ฑ์— ๊ธฐ๋ฐ˜ํ•œ๋‹ค:

Preuse(Ss|Qt) โˆ Simlogic(Qt, Qs) ยท I[Ss valid]  (19)

์žฌ์‚ฌ์šฉ์ด ํ†ต๊ณ„์  ๋นˆ๋„๊ฐ€ ์•„๋‹Œ ๋…ผ๋ฆฌ์  ์ด์ „์„ฑ์— ๊ธฐ๋ฐ˜ํ•จ์„ ๋ณด์žฅํ•œ๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ

LLM ์ถ”๋ก  ์—ฐ๊ตฌ

  • ์‚ฌ๊ณ ์˜ ์—ฐ์‡„ ํ”„๋กฌํ”„ํŒ…: ์ค‘๊ฐ„ ๋‹จ๊ณ„ ์ƒ์„ฑ์„ ํ†ตํ•œ ์ถ”๋ก  ์„ฑ๋Šฅ ๊ฐœ์„ 
  • ์ž๊ธฐ ์ผ๊ด€์„ฑ: ๋‹ค์ค‘ ๊ฒฝ๋กœ ์ƒ˜ํ”Œ๋ง์„ ํ†ตํ•œ ๊ฒฌ๊ณ ์„ฑ ํ–ฅ์ƒ
  • ์‚ฌ๊ณ ์˜ ๋‚˜๋ฌด/๊ทธ๋ž˜ํ”„: ๋” ๋ณต์žกํ•œ ๊ฒ€์ƒ‰ ๊ตฌ์กฐ๋กœ ํ™•์žฅ

๋ฐฉ๋ฒ• ํ‘œํ˜„ ๋ฐ ์žฌ์‚ฌ์šฉ

  • ๊ธฐํ˜ธ AI: ์ง€์‹์„ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ๊ตฌ์„ฑ ์š”์†Œ๋กœ ๋ถ„ํ•ด
  • ํ”„๋กœ๊ทธ๋žจ ํ•ฉ์„ฑ: ์ถ”์ƒ ์—ฐ์‚ฐ์ž ์žฌ์‚ฌ์šฉ์„ ํ†ตํ•œ ์ƒˆ๋กœ์šด ์ž‘์—… ํ•ด๊ฒฐ
  • ์‚ฌ๋ก€ ๊ธฐ๋ฐ˜ ์ถ”๋ก (CBR): ์œ ์ถ”๋ฅผ ํ†ตํ•œ ์ƒˆ๋กœ์šด ๋ฌธ์ œ ํ•ด๊ฒฐ

์ด์ „ ํ•™์Šต ๋ฐ ๋ฉ”ํƒ€ ์ถ”๋ก 

  • ์‚ฌ์ „ ํ›ˆ๋ จ ๋ชจ๋ธ: T5, GPT-4 ๋“ฑ์˜ ์ž‘์—… ์ด์ „ ๋Šฅ๋ ฅ
  • ๊ฒ€์ƒ‰ ๊ฐ•ํ™” ํ”„๋กฌํ”„ํŒ…: ์œ ์‚ฌ ์˜ˆ์ œ ๊ฒ€์ƒ‰์„ ํ†ตํ•œ ์ถ”๋ก  ์œ ๋„
  • ๋ฐ˜์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜: ๋ฐ˜๋ณต์  ์ž๊ธฐ ํ”ผ๋“œ๋ฐฑ์„ ํ†ตํ•œ ๊ฐœ์„ 

๊ฒฐ๋ก  ๋ฐ ๋…ผ์˜

์ฃผ์š” ๊ฒฐ๋ก 

  1. ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” LLM์˜ ์ ์šฉ ๋ฒ”์œ„๋ฅผ ์„ฑ๊ณต์ ์œผ๋กœ ํ™•์žฅํ•˜์—ฌ ์œ ์‚ฌ์„ฑ์ด ๋‚ฎ์€ ๋ฌธ์ œ๋ฅผ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•จ.
  2. ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ์€ ์ผ๋ฐ˜-ํŠน์ˆ˜ ์˜์กด์„ฑ ์ฒ˜๋ฆฌ ์‹œ ๋” ์•ˆ์ •์ ์œผ๋กœ ์ˆ˜ํ–‰๋˜๋ฉฐ, ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ์€ ์•”๋ฌต์  ์ค‘๋ณต ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ๋ณด์™„ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ œ๊ณตํ•จ.
  3. ๊ตฌ์กฐํ™”๋œ ์งˆ๋ฌธ-ํ•ด๊ฒฐ์ฑ… ๋ถ„๋ฆฌ ์ „๋žต์€ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์˜ ํšจ๊ณผ์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ด.

ํ•œ๊ณ„

  1. ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ ํšจ๊ณผ ์ œํ•œ: ๊ด€๊ณ„ํ˜• ์žฌ์‚ฌ์šฉ๊ณผ ๋น„๊ตํ•˜์—ฌ ํŠน์„ฑํ˜• ์žฌ์‚ฌ์šฉ์˜ ๊ฐœ์„  ํญ์ด ์ž‘์Œ.
  2. ๊ฒ€์ฆ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์˜์กด์„ฑ: Valid ํ•จ์ˆ˜์˜ ๊ตฌํ˜„์ด ์žฌ์‚ฌ์šฉ ํšจ๊ณผ์— ์˜ํ–ฅ์„ ๋ฏธ์น  ์ˆ˜ ์žˆ์Œ.
  3. ๊ณ„์‚ฐ ๋ณต์žก๋„: ๋Œ€๊ทœ๋ชจ ํŠน์„ฑ ๊ณต๊ฐ„์˜ ์œ ์‚ฌ๋„ ๊ณ„์‚ฐ์ด ์‹œ๊ฐ„ ์†Œ๋ชจ์ ์ผ ์ˆ˜ ์žˆ์Œ.

ํ–ฅํ›„ ๋ฐฉํ–ฅ

  1. ํŠน์„ฑ ์ถ”์ถœ ๋ฐ ์œ ์‚ฌ๋„ ๊ณ„์‚ฐ ๋ฐฉ๋ฒ• ๊ฐœ์„ 
  2. ๋” ์ง€๋Šฅํ˜• ๊ฒ€์ฆ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ๊ฐœ๋ฐœ
  3. ๋” ๋ณต์žกํ•œ ๋‹ค๋‹จ๊ณ„ ๋ฌธ์ œ ํ•ด๊ฒฐ ์‹œ๋‚˜๋ฆฌ์˜ค๋กœ ํ™•์žฅ

์‹ฌ์ธต ํ‰๊ฐ€

์žฅ์ 

  1. ๋†’์€ ํ˜์‹ ์„ฑ: LLM์—์„œ ๋‚ฎ์€ ์œ ์‚ฌ์„ฑ ๋ฌธ์ œ์˜ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ์„ ์ฒ˜์Œ์œผ๋กœ ์ฒด๊ณ„์ ์œผ๋กœ ํ•ด๊ฒฐ
  2. ๊ฒฌ๊ณ ํ•œ ์ด๋ก ์  ๊ธฐ์ดˆ: ๋‹จ์–ด ์ˆ˜์ค€ ์˜ˆ์ธก์—์„œ ๋…ผ๋ฆฌ์ธต ์ถ”๋ก ์œผ๋กœ์˜ ์ด๋ก ์  ํ”„๋ ˆ์ž„์›Œํฌ ์ œ๊ณต
  3. ํ•ฉ๋ฆฌ์  ์‹คํ—˜ ์„ค๊ณ„: ๊ตฌ์ฒด์  ์‚ฌ๋ก€๋ฅผ ํ†ตํ•œ ๋ฐฉ๋ฒ•์˜ ํšจ๊ณผ์„ฑ ๊ฒ€์ฆ
  4. ๋†’์€ ์‹ค์šฉ ๊ฐ€์น˜: LLM์˜ ์‹ค์ œ ์ ์šฉ์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์‚ฌ๊ณ  ์ œ๊ณต

๋ถ€์กฑํ•œ ์ 

  1. ์ œํ•œ๋œ ์‹คํ—˜ ๊ทœ๋ชจ: ๋‘ ๊ฐ€์ง€ ํŠน์ • ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ๋งŒ ๊ฒ€์ฆ๋˜์—ˆ์œผ๋ฉฐ, ๋Œ€๊ทœ๋ชจ ์‹คํ—˜ ๋ถ€์กฑ
  2. ๋ชจํ˜ธํ•œ ํŠน์„ฑ ์ •์˜: ํŠน์„ฑ ๊ณต๊ฐ„ ๊ตฌ์ถ•์— ๋Œ€ํ•œ ์ฒด๊ณ„์  ์ง€์นจ ๋ถ€์กฑ
  3. ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ๋ฏธํ‰๊ฐ€: ๋ฐฉ๋ฒ•์˜ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ ๋ฐ ํ™•์žฅ์„ฑ ๋ถ„์„ ๋ฏธ์‹ค์‹œ
  4. ๋‹จ์ผ ๋น„๊ต ๋ฐฉ๋ฒ•: ๋‹ค๋ฅธ ์„ ์ง„ ๋ฐฉ๋ฒ•๊ณผ์˜ ๋น„๊ต ๋ถ€์กฑ

์˜ํ–ฅ๋ ฅ

  1. ์ด๋ก ์  ๊ธฐ์—ฌ: LLM ์ถ”๋ก  ๋Šฅ๋ ฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์ด๋ก ์  ๊ด€์  ์ œ๊ณต
  2. ์‹ค๋ฌด ๊ฐ€์น˜: ๊ต์ฐจ ์˜์—ญ ์ง€์‹ ์ด์ „์ด ํ•„์š”ํ•œ ์‹ค์ œ ์‹œ๋‚˜๋ฆฌ์˜ค์— ์ ์šฉ ๊ฐ€๋Šฅ
  3. ์˜๊ฐ ์ œ๊ณต: ํ›„์† ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ ๊ฐ€์น˜ ์žˆ๋Š” ๋ฐฉํ–ฅ ์ œ์‹œ

์ ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค

  1. ์ง€์‹ ์ด์ „: ํ•œ ์˜์—ญ์˜ ํ•ด๊ฒฐ์ฑ…์„ ๋‹ค๋ฅธ ์˜์—ญ์— ์ ์šฉํ•ด์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ
  2. ์ฐฝ์˜์  ๋ฌธ์ œ ํ•ด๊ฒฐ: ์ƒˆ๋กœ์šด ๋ฌธ์ œ์— ์ง๋ฉดํ–ˆ์„ ๋•Œ ์œ ์ถ” ํ•ด๊ฒฐ์ฑ… ์ฐพ๊ธฐ
  3. ๊ต์œก ๋ณด์กฐ: ํ•™์Šต์ž๊ฐ€ ์„œ๋กœ ๋‹ค๋ฅธ ๋ฌธ์ œ ๊ฐ„์˜ ๋‚ด์žฌ์  ์—ฐ๊ด€์„ฑ์„ ์ดํ•ดํ•˜๋„๋ก ์ง€์›
  4. ์ „๋ฌธ๊ฐ€ ์‹œ์Šคํ…œ: ๊ธฐ์กด ์ง€์‹์„ ์œ ์—ฐํ•˜๊ฒŒ ์ ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ์ง€๋Šฅํ˜• ์‹œ์Šคํ…œ ๊ตฌ์ถ•

์ฐธ๊ณ ๋ฌธํ—Œ

  1. Wei, J. et al. "Chain-of-thought prompting elicits reasoning in large language models." NeurIPS 2022.
  2. Wang, X. et al. "Self-consistency improves chain of thought reasoning in language models." arXiv 2022.
  3. Yao, S. et al. "Tree of thoughts: Deliberate problem solving with large language models." NeurIPS 2023.
  4. Su, H. "Method-based reasoning for large language models: Extraction, reuse, and continuous improvement." arXiv 2025.

์ข…ํ•ฉ ํ‰๊ฐ€: ๋ณธ ๋…ผ๋ฌธ์€ ํ˜์‹ ์ ์ธ ๊ต์ฐจ ์งˆ๋ฌธ ๋ฐฉ๋ฒ• ์žฌ์‚ฌ์šฉ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์—ฌ ๋‚ฎ์€ ์œ ์‚ฌ์„ฑ ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ LLM์˜ ์ ์šฉ ๋Šฅ๋ ฅ์„ ์„ฑ๊ณต์ ์œผ๋กœ ํ™•์žฅํ–ˆ๋‹ค. ์‹คํ—˜ ๊ทœ๋ชจ ๋ฐ ์ผ๋ถ€ ๊ธฐ์ˆ ์  ์„ธ๋ถ€์‚ฌํ•ญ์—์„œ ๊ฐœ์„  ์—ฌ์ง€๊ฐ€ ์žˆ์ง€๋งŒ, ๊ทธ ์ด๋ก ์  ๊ธฐ์—ฌ์™€ ์‹ค์šฉ์  ๊ฐ€์น˜๋Š” LLM ์ถ”๋ก  ์—ฐ๊ตฌ ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์ž‘์—…์œผ๋กœ ๋งŒ๋“ ๋‹ค.