2025-11-22T20:07:15.604385

Semantic-Condition Tuning: Fusing Graph Context with Large Language Models for Knowledge Graph Completion

Liu, Wen, Sun et al.
Fusing Knowledge Graphs with Large Language Models is crucial for knowledge-intensive tasks like knowledge graph completion. The prevailing paradigm, prefix-tuning, simply concatenates knowledge embeddings with text inputs. However, this shallow fusion overlooks the rich relational semantics within KGs and imposes a significant implicit reasoning burden on the LLM to correlate the prefix with the text. To address these, we propose Semantic-condition Tuning (SCT), a new knowledge injection paradigm comprising two key modules. First, a Semantic Graph Module employs a Graph Neural Network to extract a context-aware semantic condition from the local graph neighborhood, guided by knowledge-enhanced relations. Subsequently, this condition is passed to a Condition-Adaptive Fusion Module, which, in turn, adaptively modulates the textual embedding via two parameterized projectors, enabling a deep, feature-wise, and knowledge-aware interaction. The resulting pre-fused embedding is then fed into the LLM for fine-tuning. Extensive experiments on knowledge graph benchmarks demonstrate that SCT significantly outperforms prefix-tuning and other strong baselines. Our analysis confirms that by modulating the input representation with semantic graph context before LLM inference, SCT provides a more direct and potent signal, enabling more accurate and robust knowledge reasoning.
academic

์˜๋ฏธ๋ก ์  ์กฐ๊ฑด ํŠœ๋‹: ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์™„์„ฑ์„ ์œ„ํ•œ ๊ทธ๋ž˜ํ”„ ์ปจํ…์ŠคํŠธ์™€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์˜ ์œตํ•ฉ

๊ธฐ๋ณธ ์ •๋ณด

  • ๋…ผ๋ฌธ ID: 2510.08966
  • ์ œ๋ชฉ: Semantic-Condition Tuning: Fusing Graph Context with Large Language Models for Knowledge Graph Completion
  • ์ €์ž: Ruitong Liu, Yan Wen, Te Sun, Yunjia Wu, Pingyang Huang, Zihang Yu, Siyuan Li
  • ๋ถ„๋ฅ˜: cs.AI cs.CL
  • ๋ฐœํ‘œ ์‹œ๊ฐ„/ํ•™ํšŒ: The ACM Web Conference, 2026๋…„ 4์›” 13-17์ผ, ๋‘๋ฐ”์ด, UAE
  • ๋…ผ๋ฌธ ๋งํฌ: https://arxiv.org/abs/2510.08966

์ดˆ๋ก

๋ณธ ๋…ผ๋ฌธ์€ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์™„์„ฑ ์ž‘์—…์—์„œ ์ง€์‹ ๊ทธ๋ž˜ํ”„์™€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM) ์œตํ•ฉ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์ง€์‹ ์ฃผ์ž… ํŒจ๋Ÿฌ๋‹ค์ž„์ธ ์˜๋ฏธ๋ก ์  ์กฐ๊ฑด ํŠœ๋‹(Semantic-Condition Tuning, SCT)์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ์ ‘๋‘์‚ฌ ํŠœ๋‹ ๋ฐฉ๋ฒ•์€ ์ง€์‹ ์ž„๋ฒ ๋”ฉ๊ณผ ํ…์ŠคํŠธ ์ž…๋ ฅ์„ ๋‹จ์ˆœํžˆ ์—ฐ๊ฒฐํ•˜๋Š”๋ฐ, ์ด๋Ÿฌํ•œ ์–•์€ ์ˆ˜์ค€์˜ ์œตํ•ฉ์€ ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ํ’๋ถ€ํ•œ ๊ด€๊ณ„ ์˜๋ฏธ๋ก ์„ ๋ฌด์‹œํ•˜๊ณ  LLM์— ๋ฌด๊ฑฐ์šด ์•”๋ฌต์  ์ถ”๋ก  ๋ถ€๋‹ด์„ ์ค๋‹ˆ๋‹ค. SCT๋Š” ๋‘ ๊ฐ€์ง€ ํ•ต์‹ฌ ๋ชจ๋“ˆ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค: ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ์€ ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ตญ์†Œ ๊ทธ๋ž˜ํ”„ ์ด์›ƒ์—์„œ ์ปจํ…์ŠคํŠธ ์ธ์‹ ์˜๋ฏธ๋ก ์  ์กฐ๊ฑด์„ ์ถ”์ถœํ•˜๊ณ , ์กฐ๊ฑด ์ ์‘ํ˜• ์œตํ•ฉ ๋ชจ๋“ˆ์€ ๋‘ ๊ฐœ์˜ ๋งค๊ฐœ๋ณ€์ˆ˜ํ™”๋œ ํ”„๋กœ์ ํ„ฐ๋ฅผ ํ†ตํ•ด ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์„ ์ ์‘์ ์œผ๋กœ ์กฐ์ ˆํ•˜์—ฌ ๊นŠ์€ ์ˆ˜์ค€์˜ ํŠน์„ฑ ๊ธฐ๋ฐ˜ ์ง€์‹ ์ธ์‹ ์ƒํ˜ธ์ž‘์šฉ์„ ์‹คํ˜„ํ•ฉ๋‹ˆ๋‹ค.

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋™๊ธฐ

ํ•ต์‹ฌ ๋ฌธ์ œ

  1. ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ๋ถˆ์™„์ „์„ฑ: ํ˜„์‹ค์˜ ์ง€์‹ ๊ทธ๋ž˜ํ”„๋Š” ๋ณธ์งˆ์ ์œผ๋กœ ๋ถˆ์™„์ „ํ•˜์—ฌ ํ•˜์œ„ ์‘์šฉ ํ”„๋กœ๊ทธ๋žจ์—์„œ์˜ ์œ ์šฉ์„ฑ์„ ์ œํ•œํ•ฉ๋‹ˆ๋‹ค
  2. ์–•์€ ์ˆ˜์ค€ ์œตํ•ฉ์˜ ํ•œ๊ณ„: ๊ธฐ์กด์˜ ์ ‘๋‘์‚ฌ ํŠœ๋‹ ๋ฐฉ๋ฒ•์€ ๋‹จ์ˆœํ•œ ์—ฐ๊ฒฐ ์ž‘์—…๋งŒ ์ˆ˜ํ–‰ํ•˜์—ฌ ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ์ถฉ๋ถ„ํžˆ ํ™œ์šฉํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค
  3. ๊ด€๊ณ„ ์˜๋ฏธ๋ก ์˜ ๋™์ ์„ฑ: ๊ด€๊ณ„์˜ ์˜๋ฏธ๋Š” ์ฃผ๋ณ€์˜ ์˜๋ฏธ๋ก ์  ์ปจํ…์ŠคํŠธ์— ๋”ฐ๋ผ ๋™์ ์œผ๋กœ ๋ณ€ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋ฆผ 1์— ํ‘œ์‹œ๋œ "treats" ๊ด€๊ณ„๋Š” ์„œ๋กœ ๋‹ค๋ฅธ ์ปจํ…์ŠคํŠธ์—์„œ ๋‹ค์–‘ํ•œ ์น˜๋ฃŒ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค

์—ฐ๊ตฌ์˜ ์ค‘์š”์„ฑ

  • ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์™„์„ฑ์€ ์ถ”์ฒœ ์‹œ์Šคํ…œ, ์ •๋ณด ์ถ”์ถœ, ์งˆ์˜์‘๋‹ต ์‹œ์Šคํ…œ ๋“ฑ์˜ ์‘์šฉ์— ๋งค์šฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค
  • LLM์€ ๊นŠ์ด ์žˆ๊ณ  ์ •ํ™•ํ•œ ์‚ฌ์‹ค ์ง€์‹์ด ๋ถ€์กฑํ•˜์—ฌ ํ™˜๊ฐ ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•˜๊ธฐ ์‰ฝ์Šต๋‹ˆ๋‹ค
  • ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ๋ช…์‹œ์  ๊ตฌ์กฐํ™”๋œ ์ง€์‹๊ณผ LLM์˜ ์•”๋ฌต์  ๋งค๊ฐœ๋ณ€์ˆ˜ํ™”๋œ ์ง€์‹์„ ํšจ๊ณผ์ ์œผ๋กœ ์œตํ•ฉํ•  ํ•„์š”๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค

๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„

  1. ์ ‘๋‘์‚ฌ ํŠœ๋‹์˜ ์–•์€ ํŠน์„ฑ: ๋‹จ์ˆœํ•œ ์—ฐ๊ฒฐ ์ž‘์—…์œผ๋กœ๋Š” ๊นŠ์€ ์ˆ˜์ค€์˜ ํ†ตํ•ฉ์„ ์‹คํ˜„ํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค
  2. ๊ด€๊ณ„ ์˜๋ฏธ๋ก  ๋ฌด์‹œ: ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ํ’๋ถ€ํ•œ ๊ด€๊ณ„ ์˜๋ฏธ๋ก ์„ ํฌ์ฐฉํ•˜์ง€ ๋ชปํ•ฉ๋‹ˆ๋‹ค
  3. ์ถ”๋ก  ๋ถ€๋‹ด: LLM์— ์ ‘๋‘์‚ฌ์™€ ํ…์ŠคํŠธ๋ฅผ ์—ฐ๊ด€์‹œํ‚ค๊ธฐ ์œ„ํ•œ ๋ฌด๊ฑฐ์šด ์•”๋ฌต์  ์ถ”๋ก  ๋ถ€๋‹ด์„ ์ค๋‹ˆ๋‹ค

ํ•ต์‹ฌ ๊ธฐ์—ฌ

  1. SCT ํ”„๋ ˆ์ž„์›Œํฌ ์ œ์•ˆ: ์ปจํ…์ŠคํŠธ ์ธ์‹๊ณผ ์ ์‘ํ˜• ์ž„๋ฒ ๋”ฉ ์œตํ•ฉ์„ ํ†ตํ•ฉํ•œ ์ตœ์ดˆ์˜ ์˜๋ฏธ๋ก ์  ์กฐ๊ฑด ํŠœ๋‹ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, ๊ธฐ์กด์˜ ๋‹จ์ˆœํ•œ ์ ‘๋‘์‚ฌ ํŠœ๋‹ ์—ฐ๊ฒฐ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•ฉ๋‹ˆ๋‹ค
  2. ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ: ์ง€์‹ ๊ฐ•ํ™” ๊ด€๊ณ„ ์„ค๋ช…์˜ ๋ช…์‹œ์  ์˜๋ฏธ๋ก ์  ์œ ์‚ฌ๋„ ์ ์ˆ˜๋กœ ์ด์›ƒ ์„ ํƒ์ด ์•ˆ๋‚ด๋˜๋Š” ์ƒˆ๋กœ์šด ๊ด€๊ณ„ ์ค‘์‹ฌ ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค
  3. ์กฐ๊ฑด ์ ์‘ํ˜• ์œตํ•ฉ ๋ชจ๋“ˆ: ์˜๋ฏธ๋ก ์  ์กฐ๊ฑด์„ ์‚ฌ์šฉํ•˜์—ฌ ์ž…๋ ฅ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์˜ ์ง์ ‘ ํŠน์„ฑ ์ˆ˜์ค€ ์•„ํ•€ ๋ณ€ํ™˜์„ ํ•™์Šตํ•˜๋Š” ์œตํ•ฉ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋„์ž…ํ•˜์—ฌ ๊ทธ๋ž˜ํ”„ ์ปจํ…์ŠคํŠธ์˜ ๊นŠ์€ ํ˜‘๋ ฅ์  ํ†ตํ•ฉ์„ ์‹คํ˜„ํ•ฉ๋‹ˆ๋‹ค
  4. ์„ฑ๋Šฅ ๊ฒ€์ฆ: ์—ฌ๋Ÿฌ ๋ฒค์น˜๋งˆํฌ์—์„œ SCT์˜ ์ตœ์ฒจ๋‹จ ์„ฑ๋Šฅ๊ณผ ๋†’์€ ๋งค๊ฐœ๋ณ€์ˆ˜ ํšจ์œจ์„ฑ์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค

๋ฐฉ๋ฒ•๋ก  ์ƒ์„ธ ์„ค๋ช…

์ž‘์—… ์ •์˜

์ง€์‹ ๊ทธ๋ž˜ํ”„ G๋Š” ์‚ผ์ค‘ํ•ญ ์ง‘ํ•ฉ T = {(h, r, t) | h, t โˆˆ E, r โˆˆ R}์œผ๋กœ ์ •์˜๋˜๋ฉฐ, ์—ฌ๊ธฐ์„œ E์™€ R์€ ๊ฐ๊ฐ ์‹ค์ฒด์™€ ๊ด€๊ณ„ ์ง‘ํ•ฉ์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์™„์„ฑ ์ž‘์—…์€ ์ฃผ์–ด์ง„ ์‚ผ์ค‘ํ•ญ์—์„œ ๋ˆ„๋ฝ๋œ ์š”์†Œ๋ฅผ ์ถ”๋ก ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ์ฟผ๋ฆฌ(h, r, ?)์—์„œ ๊ผฌ๋ฆฌ ์‹ค์ฒด t๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค. LLM ๊ธฐ๋ฐ˜ KGC์—์„œ ์ด ์ž‘์—…์€ ํ…์ŠคํŠธ ์ƒ์„ฑ ๋ฌธ์ œ๋กœ ํ˜•์‹ํ™”๋ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜

1. ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ (Semantic Graph Module)

์ง€์‹ ๊ฐ•ํ™”:

  • ๊ฐ•๋ ฅํ•œ LLM(GPT-4O)์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ฐ ๊ด€๊ณ„ ์œ ํ˜•์— ๋Œ€ํ•œ ๊ทœ๋ฒ”์  ํ…์ŠคํŠธ ์„ค๋ช… ์ƒ์„ฑ
  • ์‚ฌ์ „ ํ•™์Šต๋œ ๋ฌธ์žฅ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ(Sentence-BERT)์„ ์‚ฌ์šฉํ•˜์—ฌ ์„ค๋ช…์„ ์˜๋ฏธ๋ก ์  ๋ฒกํ„ฐ๋กœ ์ธ์ฝ”๋”ฉ

๊ด€๊ณ„ ์ค‘์‹ฌ ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ:

  • KG์˜ ๊ด€๊ณ„ ๊ตฌ์กฐ๋ฅผ ์ฃผ์š” ๊ณ„์‚ฐ ๊ทธ๋ž˜ํ”„๋กœ ์‚ฌ์šฉ
  • ๊ฐ„์„ (๊ด€๊ณ„)์€ ์ธ์ ‘ ๊ฐ„์„ ์˜ ์ •๋ณด๋ฅผ ์ง‘๊ณ„ํ•˜์—ฌ ์ƒํƒœ๋ฅผ ์—…๋ฐ์ดํŠธ
  • Top-K ์„ ํƒ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์‚ฌ์šฉํ•˜์—ฌ ์˜๋ฏธ๋ก ์ ์œผ๋กœ ๊ฐ€์žฅ ๊ด€๋ จ์„ฑ ๋†’์€ ์ด์›ƒ์„ ํ•„ํ„ฐ๋ง:
Score(ec, en) = (sc ยท sn) / (||sc||2 ||sn||2)

Transformer ๊ณ„์ธต ์—…๋ฐ์ดํŠธ:

s^(l+1)_c = TransformerLayer(s^l_c, sฬ„_N_K(ec))

์˜๋ฏธ๋ก ์  ์กฐ๊ฑด ์ƒ์„ฑ:

cS = MeanPool({s^L_h,i}_i โˆช {s^L_t,j}_j)

2. ์กฐ๊ฑด ์ ์‘ํ˜• ์œตํ•ฉ ๋ชจ๋“ˆ (Condition-Adaptive Fusion Module)

ํŠน์„ฑ๋ณ„ ์„ ํ˜• ์กฐ์ ˆ(Feature-wise Linear Modulation, FiLM) ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์‚ฌ์šฉ:

X' = X โŠ™ ฮณ + ฮฒ
ฮณ = ฯƒ(MLP1(cS))
ฮฒ = MLP2(cS)

์—ฌ๊ธฐ์„œ ฮณ๋Š” ์Šค์ผ€์ผ ๋ฒกํ„ฐ, ฮฒ๋Š” ์˜คํ”„์…‹ ๋ฒกํ„ฐ์ด๋ฉฐ, ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์˜ ํŠน์„ฑ ์ˆ˜์ค€ ์•„ํ•€ ๋ณ€ํ™˜์„ ์‹คํ˜„ํ•ฉ๋‹ˆ๋‹ค.

๊ธฐ์ˆ  ํ˜์‹ ์ 

  1. ๊นŠ์€ ์ˆ˜์ค€ ์œตํ•ฉ vs ์–•์€ ์ˆ˜์ค€ ์—ฐ๊ฒฐ: ๋‹จ์ˆœํ•œ ์ ‘๋‘์‚ฌ ์—ฐ๊ฒฐ๊ณผ ๋‹ฌ๋ฆฌ SCT๋Š” ํŠน์„ฑ ์ˆ˜์ค€์˜ ๊นŠ์€ ์ƒํ˜ธ์ž‘์šฉ์„ ์‹คํ˜„ํ•ฉ๋‹ˆ๋‹ค
  2. ์˜๋ฏธ๋ก  ๊ธฐ๋ฐ˜ ์ด์›ƒ ์„ ํƒ: ์ž‘์—… ํŠน์ • ํ•™์Šต ํ‘œํ˜„์ด ์•„๋‹Œ LLM ๊ฐ•ํ™” ๊ด€๊ณ„ ์„ค๋ช…์„ ์‚ฌ์šฉํ•œ ์˜๋ฏธ๋ก ์  ์œ ์‚ฌ๋„ ๊ณ„์‚ฐ
  3. ๊ด€๊ณ„ ์ค‘์‹ฌ ๊ทธ๋ž˜ํ”„ ์ฒ˜๋ฆฌ: ์‹ค์ฒด๊ฐ€ ์•„๋‹Œ ๊ด€๊ณ„์— ์ดˆ์ ์„ ๋งž์ถฐ ๋” ํšจ์œจ์ ์ด๊ณ  ์˜๋ฏธ๋ก ์ ์œผ๋กœ ์ง€์‹œ์ ์ž…๋‹ˆ๋‹ค

์‹คํ—˜ ์„ค์ •

๋ฐ์ดํ„ฐ์…‹

๋งํฌ ์˜ˆ์ธก:

  • WN18RR: 40,943๊ฐœ ์‹ค์ฒด, 11๊ฐœ ๊ด€๊ณ„, 86,835๊ฐœ ํ•™์Šต ์‚ผ์ค‘ํ•ญ
  • FB15k-237: 14,541๊ฐœ ์‹ค์ฒด, 237๊ฐœ ๊ด€๊ณ„, 272,115๊ฐœ ํ•™์Šต ์‚ผ์ค‘ํ•ญ

์‚ผ์ค‘ํ•ญ ๋ถ„๋ฅ˜:

  • UMLS: 135๊ฐœ ์‹ค์ฒด, 46๊ฐœ ๊ด€๊ณ„
  • CoDeX-S: 2,034๊ฐœ ์‹ค์ฒด, 42๊ฐœ ๊ด€๊ณ„
  • FB15k-237N: 13,104๊ฐœ ์‹ค์ฒด, 93๊ฐœ ๊ด€๊ณ„

ํ‰๊ฐ€ ์ง€ํ‘œ

  • ๋งํฌ ์˜ˆ์ธก: ํ‰๊ท  ์—ญ์ˆœ์œ„(Mean Reciprocal Rank, MRR)์™€ Hits@N
  • ์‚ผ์ค‘ํ•ญ ๋ถ„๋ฅ˜: ์ •ํ™•๋„(Accuracy, Acc), ์ •๋ฐ€๋„(Precision, P), ์žฌํ˜„์œจ(Recall, R), F1-์ ์ˆ˜

๋น„๊ต ๋ฐฉ๋ฒ•

์ž„๋ฒ ๋”ฉ ๋ฐฉ๋ฒ•: TransE, CompGCN, AdaProp, MA-GNN ๋“ฑ LLM ๋ฐฉ๋ฒ•: KICGPT, KG-FIT, MKGL, SSQR-LLaMA2, KoPA ๋“ฑ

๊ตฌํ˜„ ์„ธ๋ถ€์‚ฌํ•ญ

  • Alpaca-7B ๊ธฐ๋ฐ˜ ๊ตฌํ˜„
  • ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ: 2๊ณ„์ธต Transformer, Top-K=10
  • LoRA(rank=64)๋ฅผ ์‚ฌ์šฉํ•œ LLM ๋ฏธ์„ธ ์กฐ์ •
  • AdamW ์ตœ์ ํ™”๊ธฐ, ๋ฐฐ์น˜ ํฌ๊ธฐ 12
  • 2๋‹จ๊ณ„ ํ•™์Šต ์ „๋žต

์‹คํ—˜ ๊ฒฐ๊ณผ

์ฃผ์š” ๊ฒฐ๊ณผ

๋งํฌ ์˜ˆ์ธก ์„ฑ๋Šฅ:

  • WN18RR ๋ฐ์ดํ„ฐ์…‹: ์ตœ๊ฐ• ๊ธฐ์ค€์„  SSQR-LLaMA2 ๋Œ€๋น„ MRR 2.2% ํ–ฅ์ƒ, Hits@1 2.4% ํ–ฅ์ƒ, Hits@3 2.6% ํ–ฅ์ƒ
  • FB15k-237 ๋ฐ์ดํ„ฐ์…‹: MRR 4.9% ๋Œ€ํญ ํ–ฅ์ƒ, Hits@1 1.6% ํ–ฅ์ƒ, Hits@10 4.4% ํ–ฅ์ƒ

์‚ผ์ค‘ํ•ญ ๋ถ„๋ฅ˜ ์„ฑ๋Šฅ:

  • UMLS ๋ฐ์ดํ„ฐ์…‹: ์ •ํ™•๋„ 93.15%, F1 ์ ์ˆ˜ 93.18%, ์ตœ๊ณ  ์„ฑ๋Šฅ ๋‹ฌ์„ฑ
  • FB15k-237N ๋ฐ์ดํ„ฐ์…‹: ์ •ํ™•๋„ 78.02%, ์ •๋ฐ€๋„ 71.10%, F1 ์ ์ˆ˜ 80.93%, ๋ชจ๋‘ ์ตœ๊ณ  ์„ฑ๋Šฅ
  • CoDeX-S ๋ฐ์ดํ„ฐ์…‹: ์ •๋ฐ€๋„ 78.52% ์ตœ๊ณ , ๊ธฐํƒ€ ์ง€ํ‘œ๋Š” ๊ฐ•๋ ฅํ•œ ๊ธฐ์ค€์„ ๊ณผ ๋™๋“ฑ

์†Œ๊ฑฐ ์‹คํ—˜

๊ตฌ์„ฑ ์š”์†Œ ์œ ํšจ์„ฑ ๊ฒ€์ฆ:

  1. ์˜๋ฏธ๋ก  ์ œ๊ฑฐ (w/o Semantics): ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ์„ ์ œ๊ฑฐํ•˜๊ณ  ๊ธฐ์กด KGE๋กœ ๋Œ€์ฒด
    • FB15k-237์—์„œ MRR์ด 0.471์—์„œ 0.433์œผ๋กœ ๊ฐ์†Œ, Hits@1์ด 0.380์—์„œ 0.327๋กœ ๊ฐ์†Œ
  2. ์œตํ•ฉ ์ œ๊ฑฐ (w/o Fusion): ์กฐ๊ฑด ์ ์‘ํ˜• ์œตํ•ฉ ๋ชจ๋“ˆ์„ ์ œ๊ฑฐํ•˜๊ณ  ์ ‘๋‘์‚ฌ ํŠœ๋‹์œผ๋กœ ๋ณ€๊ฒฝ
    • ์„ฑ๋Šฅ ์ €ํ•˜๊ฐ€ ๊ฐ€์žฅ ์‹ฌ๊ฐํ•˜๋ฉฐ, MRR๊ณผ Hits@1์ด ๊ฐ๊ฐ 0.062์™€ 0.081 ๊ฐ์†Œ

์ ์ˆ˜ ํ•จ์ˆ˜ ๋น„๊ต:

  • RotatE ์Šคํƒ€์ผ ํ•จ์ˆ˜๊ฐ€ ์ตœ๊ณ  ์„ฑ๋Šฅ ๋‹ฌ์„ฑ, MRR 0.471
  • ๋‹จ์ˆœํ•œ DistMult์™€ MLP๋Š” ์„ฑ๋Šฅ ๋ช…๋ฐฑํžˆ ์ €ํ•˜

์‚ฌ๋ก€ ๋ถ„์„

์˜๋ฏธ๋ก  ๊ฐ•ํ™” ํšจ๊ณผ: ์ฟผ๋ฆฌ(Barack Obama, /government/politician/government_positions_held..., ?)์˜ ์˜ˆ:

  • ์ง€์‹ ๊ฐ•ํ™” ์—†์Œ: ์–ดํœ˜ ์ค‘๋ณต์„ ๊ธฐ๋ฐ˜์œผ๋กœ Gov Position (Title) ๋“ฑ์ด ์ƒ์œ„ ์ˆœ์œ„
  • ์ง€์‹ ๊ฐ•ํ™” ์žˆ์Œ: Person (Nationality) ๋“ฑ ์˜๋ฏธ๋ก ์ ์œผ๋กœ ๊ด€๋ จ๋œ ๊ฐœ๋…์˜ ์ˆœ์œ„ ํ–ฅ์ƒ, ์–•์€ ํ…์ŠคํŠธ ๋งค์นญ์—์„œ ์ง„์ •ํ•œ ์˜๋ฏธ๋ก ์  ๊ด€๋ จ์„ฑ์œผ๋กœ์˜ ์ „ํ™˜ ์ฒดํ˜„

ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋ฏผ๊ฐ๋„: Top-K ๋งค๊ฐœ๋ณ€์ˆ˜๋Š” K=10์ผ ๋•Œ ์ตœ๊ณ  ์„ฑ๋Šฅ ๋‹ฌ์„ฑ(MRR=0.471, Hit@1=0.380), ๋„ˆ๋ฌด ์ž‘์Œ(K=4)์€ ์ •๋ณด ๋ถ€์กฑ, ๋„ˆ๋ฌด ํผ(K=32)์€ ๋…ธ์ด์ฆˆ ๋„์ž….

๊ด€๋ จ ์—ฐ๊ตฌ

์ง€์‹ ๊ทธ๋ž˜ํ”„ ์™„์„ฑ

  1. ์ž„๋ฒ ๋”ฉ ๋ฐฉ๋ฒ•: TransE, ComplEx ๋“ฑ์˜ ๊ธฐํ•˜ํ•™์  ๋ชจ๋ธ์—์„œ RotE, HAKE ๋“ฑ ๋” ๋ณต์žกํ•œ ๊ธฐํ•˜ํ•™์  ๊ณต๊ฐ„ ๋ฐฉ๋ฒ•์œผ๋กœ ๋ฐœ์ „
  2. GNN ๋ฐฉ๋ฒ•: PathCon, CBLiP ๋“ฑ์€ ๋‹ค์ค‘ ํ™‰ ๊ฒฝ๋กœ ์ •๋ณด๋ฅผ ์ง‘๊ณ„ํ•˜์ง€๋งŒ ์—ฌ์ „ํžˆ ์ •์  ํ‘œํ˜„ ๊ธฐ๋ฐ˜
  3. LLM ๋ฐฉ๋ฒ•: KG-BERT, SimKGC ๋“ฑ์€ ์‚ผ์ค‘ํ•ญ์„ ํ…์ŠคํŠธ ์‹œํ€€์Šค๋กœ ๋ณ€ํ™˜ํ•˜์ง€๋งŒ ์ƒํ˜ธ์ž‘์šฉ์€ ํ‘œ๋ฉด ์ˆ˜์ค€

LLM๊ณผ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์œตํ•ฉ

๋‘ ๊ฐ€์ง€ ์ฃผ์š” ๋ฐฉํ–ฅ:

  1. KG๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ LLM์— ์‚ฌ์‹ค ๊ธฐ๋ฐ˜ ์ œ๊ณต, ํ™˜๊ฐ ๊ฐ์†Œ
  2. LLM์˜ ์ƒ์„ฑ ๋ฐ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํ™œ์šฉํ•˜์—ฌ KG ๊ด€๋ จ ์ž‘์—… ํ•ด๊ฒฐ

๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ๊ณตํ†ต ํ•œ๊ณ„: ์ง€์‹ ๊ทธ๋ž˜ํ”„์™€์˜ ์ƒํ˜ธ์ž‘์šฉ์€ ์ข…์ข… ํ…์ŠคํŠธ ๋˜๋Š” ํ‘œ๋ฉด ์ˆ˜์ค€์— ๋จธ๋ฌผ๋Ÿฌ ์žˆ์Šต๋‹ˆ๋‹ค.

๊ฒฐ๋ก  ๋ฐ ๋…ผ์˜

์ฃผ์š” ๊ฒฐ๋ก 

  1. SCT๋Š” ๊นŠ์€ ์ˆ˜์ค€์˜ ํŠน์„ฑ ๊ธฐ๋ฐ˜ ์œตํ•ฉ์„ ํ†ตํ•ด ์–•์€ ์ˆ˜์ค€์˜ ์ ‘๋‘์‚ฌ ํŠœ๋‹ ๋ฐฉ๋ฒ•์„ ํฌ๊ฒŒ ๋Šฅ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค
  2. ์˜๋ฏธ๋ก ์  ๊ทธ๋ž˜ํ”„ ๋ชจ๋“ˆ์€ ์ปจํ…์ŠคํŠธ ์ธ์‹ ๊ด€๊ณ„ ์˜๋ฏธ๋ก ์„ ํšจ๊ณผ์ ์œผ๋กœ ํฌ์ฐฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค
  3. ์กฐ๊ฑด ์ ์‘ํ˜• ์œตํ•ฉ ๋ชจ๋“ˆ์€ ์ง€์‹๊ณผ ํ…์ŠคํŠธ์˜ ๊นŠ์€ ์ˆ˜์ค€ ํ˜‘๋ ฅ์  ํ†ตํ•ฉ์„ ์‹คํ˜„ํ•ฉ๋‹ˆ๋‹ค
  4. ์—ฌ๋Ÿฌ ๋ฒค์น˜๋งˆํฌ์—์„œ ์ตœ์ฒจ๋‹จ ๋˜๋Š” ๋†’์€ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค

ํ•œ๊ณ„

  1. ์ œํ•œ๋œ ์ถ”๋ก  ๊นŠ์ด: ํ˜„์žฌ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ์ถ”๋ก  ๊นŠ์ด๋Š” ์—ฌ์ „ํžˆ ์ œํ•œ์ ์ž…๋‹ˆ๋‹ค
  2. ๋™์  ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์ ์‘์„ฑ ๋ถ€์กฑ: ๋™์ ์œผ๋กœ ๋ณ€ํ•˜๋Š” ์ง€์‹ ๊ทธ๋ž˜ํ”„์— ๋Œ€ํ•œ ์ ์‘์„ฑ ๊ฐœ์„  ํ•„์š”
  3. ๊ณ„์‚ฐ ๋ณต์žก๋„: 2๋‹จ๊ณ„ ํ•™์Šต ๋ฐ ๋ณต์žกํ•œ ์œตํ•ฉ ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด ๊ณ„์‚ฐ ๋น„์šฉ์„ ์ฆ๊ฐ€์‹œํ‚ต๋‹ˆ๋‹ค

ํ–ฅํ›„ ๋ฐฉํ–ฅ

  1. ๊ณ„์ธต์  ์˜๋ฏธ๋ก ์  ์กฐ๊ฑด ์ƒ์„ฑ: ๊ณ„์ธต์  ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋„์ž…ํ•˜์—ฌ ์ถ”๋ก  ๊นŠ์ด ๊ฐ•ํ™”
  2. ์‹œ๊ฐ„ ์ธ์‹: ๋™์  ์ง€์‹์„ ์ฒ˜๋ฆฌํ•˜๊ธฐ ์œ„ํ•œ ์‹œ๊ฐ„ ์ธ์‹ ๋Šฅ๋ ฅ ํ†ตํ•ฉ
  3. ์‘์šฉ ๋ฒ”์œ„ ํ™•์žฅ: ์‹œ๊ฐ„ ์ง€์‹ ๊ทธ๋ž˜ํ”„ ๋“ฑ ๋” ๋ณต์žกํ•œ ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ์˜ ์‘์šฉ ํƒ์ƒ‰

์‹ฌ์ธต ํ‰๊ฐ€

์žฅ์ 

  1. ๋ฐฉ๋ฒ•์˜ ๋†’์€ ํ˜์‹ ์„ฑ: ํŠน์„ฑ ์ˆ˜์ค€ ๊นŠ์€ ์œตํ•ฉ ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ตœ์ดˆ๋กœ ์ œ์•ˆํ•˜์—ฌ ๊ธฐ์กด ์ ‘๋‘์‚ฌ ํŠœ๋‹์˜ ํ•œ๊ณ„ ๋ŒํŒŒ
  2. ํ•ฉ๋ฆฌ์ ์ธ ๊ธฐ์ˆ  ์„ค๊ณ„: ๊ด€๊ณ„ ์ค‘์‹ฌ ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ๊ณผ ์˜๋ฏธ๋ก  ๊ธฐ๋ฐ˜ ์ด์›ƒ ์„ ํƒ ์„ค๊ณ„๊ฐ€ ์ •๊ตํ•ฉ๋‹ˆ๋‹ค
  3. ์ถฉ๋ถ„ํ•˜๊ณ  ํฌ๊ด„์ ์ธ ์‹คํ—˜: ๋งํฌ ์˜ˆ์ธก๊ณผ ์‚ผ์ค‘ํ•ญ ๋ถ„๋ฅ˜ ๋‘ ๊ฐ€์ง€ ์ž‘์—… ์œ ํ˜•, ์—ฌ๋Ÿฌ ๋ฐ์ดํ„ฐ์…‹ ๊ฒ€์ฆ
  4. ์ƒ์„ธํ•œ ์†Œ๊ฑฐ ์‹คํ—˜: ๊ฐ ๊ตฌ์„ฑ ์š”์†Œ์˜ ๊ธฐ์—ฌ๋„๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๊ฒ€์ฆ
  5. ์‹ฌ์ธต์  ์‚ฌ๋ก€ ๋ถ„์„: ๊ตฌ์ฒด์ ์ธ ์˜ˆ์‹œ๋ฅผ ํ†ตํ•ด ์˜๋ฏธ๋ก  ๊ฐ•ํ™” ํšจ๊ณผ ์ž…์ฆ

๋ถ€์กฑํ•œ ์ 

  1. ๊ณ„์‚ฐ ๋ณต์žก๋„ ๋ถ„์„ ๋ถ€์กฑ: 2๋‹จ๊ณ„ ํ•™์Šต์˜ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ์— ๋Œ€ํ•œ ์ƒ์„ธ ๋ถ„์„ ์—†์Œ
  2. ํ™•์žฅ์„ฑ ๋…ผ์˜ ์ œํ•œ์ : ์ดˆ๋Œ€๊ทœ๋ชจ ์ง€์‹ ๊ทธ๋ž˜ํ”„์— ๋Œ€ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ ๋ถ„์„ ๋ถ€์กฑ
  3. ์˜ค๋ฅ˜ ๋ถ„์„ ๊ฒฐ์—ฌ: ์‹คํŒจ ์‚ฌ๋ก€์— ๋Œ€ํ•œ ์‹ฌ์ธต ๋ถ„์„ ๋ถ€์žฌ
  4. ๊ธฐ์ค€์„  ์„ ํƒ: ์ผ๋ถ€ ๊ธฐ์ค€์„  ๋ฐฉ๋ฒ•์ด ์ตœ์‹  ์ตœ๊ฐ• ๋ฐฉ๋ฒ•์ด ์•„๋‹ ์ˆ˜ ์žˆ์Œ

์˜ํ–ฅ๋ ฅ

  1. ์ด๋ก ์  ๊ธฐ์—ฌ: ์ง€์‹ ๊ทธ๋ž˜ํ”„์™€ LLM ์œตํ•ฉ์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ ์ œ๊ณต
  2. ์‹ค์šฉ์  ๊ฐ€์น˜: ์—ฌ๋Ÿฌ ๋ฒค์น˜๋งˆํฌ์—์„œ์˜ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์ด ์‹ค์šฉ์„ฑ ์ž…์ฆ
  3. ์žฌํ˜„์„ฑ: ์ƒ์„ธํ•œ ๊ตฌํ˜„ ์„ธ๋ถ€์‚ฌํ•ญ ์ œ๊ณต์œผ๋กœ ์žฌํ˜„ ์šฉ์ด
  4. ์˜๊ฐ: ํŠน์„ฑ ์ˆ˜์ค€ ์œตํ•ฉ ์•„์ด๋””์–ด๊ฐ€ ๊ด€๋ จ ์—ฐ๊ตฌ์— ์˜๊ฐ ์ œ๊ณต ๊ฐ€๋Šฅ

์ ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค

  1. ์ง€์‹ ์ง‘์•ฝ์  ์ž‘์—…: ๊ตฌ์กฐํ™”๋œ ์ง€์‹์ด ํ•„์š”ํ•œ ์ถ”๋ก  ์ž‘์—…์— ํŠนํžˆ ์ ํ•ฉ
  2. ์ค‘๋“ฑ ๊ทœ๋ชจ ์ง€์‹ ๊ทธ๋ž˜ํ”„: ํ˜„์žฌ ์‹คํ—˜ ๊ทœ๋ชจ๋Š” ์ค‘๋“ฑ ๊ทœ๋ชจ KG ์‘์šฉ์— ์ ํ•ฉํ•จ์„ ์‹œ์‚ฌ
  3. ๋†’์€ ์ •ํ™•๋„ ์š”๊ตฌ ์‹œ๋‚˜๋ฆฌ์˜ค: ์ •ํ™•๋„๊ฐ€ ํšจ์œจ์„ฑ๋ณด๋‹ค ์ค‘์š”ํ•œ ์‘์šฉ์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ ๋ฐœํœ˜
  4. ๋‹ค์ค‘ ํ™‰ ์ถ”๋ก  ํ•„์š”: ๋ณต์žกํ•œ ์ฟผ๋ฆฌ์˜ ๋‹ค์ค‘ ํ™‰ ์ถ”๋ก ์„ ํšจ๊ณผ์ ์œผ๋กœ ์ฒ˜๋ฆฌ ๊ฐ€๋Šฅ

์ฐธ๊ณ ๋ฌธํ—Œ

๋ณธ ๋…ผ๋ฌธ์€ 80ํŽธ์˜ ๊ด€๋ จ ๋ฌธํ—Œ์„ ์ธ์šฉํ•˜์˜€์œผ๋ฉฐ, ์ง€์‹ ๊ทธ๋ž˜ํ”„ ์ž„๋ฒ ๋”ฉ, ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง, ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ ๋“ฑ ์—ฌ๋Ÿฌ ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋ฅผ ํฌํ•จํ•˜์—ฌ ์—ฐ๊ตฌ์— ๊ฒฌ๊ณ ํ•œ ์ด๋ก ์  ๊ธฐ์ดˆ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์ฃผ์š” ์ฐธ๊ณ  ๋ฌธํ—Œ์—๋Š” TransE, RotatE ๋“ฑ ๊ณ ์ „์  KG ์ž„๋ฒ ๋”ฉ ๋ฐฉ๋ฒ•๊ณผ KG-BERT, KoPA ๋“ฑ LLM-KG ์œตํ•ฉ์˜ ๋Œ€ํ‘œ์  ์—ฐ๊ตฌ๊ฐ€ ํฌํ•จ๋ฉ๋‹ˆ๋‹ค.