2025-11-21T07:40:15.798625

Artificial Impressions: Evaluating Large Language Model Behavior Through the Lens of Trait Impressions

Deas, McKeown
We introduce and study artificial impressions--patterns in LLMs' internal representations of prompts that resemble human impressions and stereotypes based on language. We fit linear probes on generated prompts to predict impressions according to the two-dimensional Stereotype Content Model (SCM). Using these probes, we study the relationship between impressions and downstream model behavior as well as prompt features that may inform such impressions. We find that LLMs inconsistently report impressions when prompted, but also that impressions are more consistently linearly decodable from their hidden representations. Additionally, we show that artificial impressions of prompts are predictive of the quality and use of hedging in model responses. We also investigate how particular content, stylistic, and dialectal features in prompts impact LLM impressions.
academic

์ธ๊ณต์  ์ธ์ƒ: ํŠน์„ฑ ์ธ์ƒ์˜ ๋ Œ์ฆˆ๋ฅผ ํ†ตํ•œ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ ํ–‰๋™ ํ‰๊ฐ€

๊ธฐ๋ณธ ์ •๋ณด

  • ๋…ผ๋ฌธ ID: 2510.08915
  • ์ œ๋ชฉ: Artificial Impressions: Evaluating Large Language Model Behavior Through the Lens of Trait Impressions
  • ์ €์ž: Nicholas Deas, Kathleen McKeown (Columbia University)
  • ๋ถ„๋ฅ˜: cs.CL (๊ณ„์‚ฐ ์–ธ์–ดํ•™)
  • ๋ฐœํ‘œ ์‹œ๊ฐ„: 2025๋…„ 10์›” 10์ผ (arXiv ์‚ฌ์ „ ์ธ์‡„๋ณธ)
  • ๋…ผ๋ฌธ ๋งํฌ: https://arxiv.org/abs/2510.08915

์ดˆ๋ก

๋ณธ ๋…ผ๋ฌธ์€ "์ธ๊ณต์  ์ธ์ƒ(artificial impressions)" ๊ฐœ๋…์„ ๋„์ž…ํ•˜๊ณ  ์—ฐ๊ตฌํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLMs)์˜ ๋‚ด๋ถ€ ํ‘œํ˜„์—์„œ ๋ฐœ๊ฒฌ๋˜๋Š” ํŒจํ„ด์œผ๋กœ, ์ธ๊ฐ„์ด ์–ธ์–ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ˜•์„ฑํ•˜๋Š” ์ธ์ƒ ๋ฐ ๊ณ ์ •๊ด€๋…๊ณผ ์œ ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ์—ฐ๊ตฌ์ž๋“ค์€ ์ƒ์„ฑ๋œ ํ”„๋กฌํ”„ํŠธ์— ๋Œ€ํ•ด ์„ ํ˜• ํ”„๋กœ๋ธŒ๋ฅผ ํ›ˆ๋ จํ•˜์—ฌ 2์ฐจ์› ๊ณ ์ •๊ด€๋… ๋‚ด์šฉ ๋ชจ๋ธ(Stereotype Content Model, SCM)์— ๋”ฐ๋ผ ์ธ์ƒ์„ ์˜ˆ์ธกํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ํ”„๋กœ๋ธŒ๋ฅผ ํ†ตํ•ด ์ธ์ƒ๊ณผ ํ•˜์œ„ ๋ชจ๋ธ ํ–‰๋™ ๊ฐ„์˜ ๊ด€๊ณ„, ๊ทธ๋ฆฌ๊ณ  ์ด๋Ÿฌํ•œ ์ธ์ƒ์— ์˜ํ–ฅ์„ ๋ฏธ์น  ์ˆ˜ ์žˆ๋Š” ํ”„๋กฌํ”„ํŠธ ํŠน์„ฑ์„ ์—ฐ๊ตฌํ–ˆ์Šต๋‹ˆ๋‹ค. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ, LLMs์ด ํ”„๋กฌํ”„ํŠธ๋  ๋•Œ ์ธ์ƒ์„ ๋ถˆ์ผ์น˜ํ•˜๊ฒŒ ๋ณด๊ณ ํ•˜์ง€๋งŒ, ์ธ์ƒ์€ ์ˆจ๊ฒจ์ง„ ํ‘œํ˜„์—์„œ ๋” ์ผ๊ด€๋˜๊ฒŒ ์„ ํ˜•์œผ๋กœ ๋””์ฝ”๋”ฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ฐœ๊ฒฌํ–ˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ํ”„๋กฌํ”„ํŠธ์˜ ์ธ๊ณต์  ์ธ์ƒ์ด ๋ชจ๋ธ ์‘๋‹ต์˜ ํ’ˆ์งˆ๊ณผ ์™„ํ™” ์–ธ์–ด ์‚ฌ์šฉ์„ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค.

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋™๊ธฐ

๋ฌธ์ œ ์ •์˜

์ธ๊ฐ„์€ ์ƒํ˜ธ์ž‘์šฉ ์ค‘์— ํƒ€์ธ์— ๋Œ€ํ•œ ์ดˆ๊ธฐ ์ธ์ƒ์„ ๋น ๋ฅด๊ฒŒ ํ˜•์„ฑํ•˜๋ฉฐ, ์ด๋Ÿฌํ•œ ์ธ์ƒ์€ ํƒœ๋„์™€ ํ–‰๋™์— ์ง€์†์ ์ธ ์˜ํ–ฅ์„ ๋ฏธ์นฉ๋‹ˆ๋‹ค. ์œ ์‚ฌํ•˜๊ฒŒ, ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์€ ํ›ˆ๋ จ ๊ณผ์ •์—์„œ ๋‹ค์–‘ํ•œ ์ €์ž์˜ ๋Œ€๋Ÿ‰ ํ…์ŠคํŠธ์— ๋…ธ์ถœ๋˜์–ด ์–ธ์–ด ํŠน์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์œ ์‚ฌํ•œ "์ธ์ƒ"์„ ํ˜•์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์—ฐ๊ตฌ์˜ ์ค‘์š”์„ฑ

  1. ํŽธํ–ฅ ๋ฐ ๊ณต์ •์„ฑ: LLMs์ด ์–ธ์–ด ํŠน์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์ธ์ƒ์„ ํ˜•์„ฑํ•˜๋Š” ๋ฐฉ์‹์„ ์ดํ•ดํ•˜๋Š” ๊ฒƒ์€ ํŽธํ–ฅ์„ ์‹๋ณ„ํ•˜๊ณ  ์™„ํ™”ํ•˜๋Š” ๋ฐ ํ•„์ˆ˜์ ์ž…๋‹ˆ๋‹ค.
  2. ๋ชจ๋ธ ํ–‰๋™ ์˜ˆ์ธก: ์ธ๊ณต์  ์ธ์ƒ์€ ์‘๋‹ต ํ’ˆ์งˆ ๋ฐ ์–ธ์–ด ์‚ฌ์šฉ๊ณผ ๊ฐ™์€ ๋ชจ๋ธ์˜ ํ•˜์œ„ ์„ฑ๋Šฅ์— ์˜ํ–ฅ์„ ๋ฏธ์น  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ์‚ฌํšŒ์–ธ์–ดํ•™์  ์˜ํ–ฅ: ๋‹ค์–‘ํ•œ ๋ฐฉ์–ธ๊ณผ ์–ธ์–ด ๋ณ€ํ˜•์€ ์„œ๋กœ ๋‹ค๋ฅธ ์ธ์ƒ์„ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด๋Š” ์†Œ์™ธ๋œ ์ง‘๋‹จ์˜ ์‚ฌ์šฉ ๊ฒฝํ—˜์— ์˜ํ–ฅ์„ ๋ฏธ์นฉ๋‹ˆ๋‹ค.

๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„

  • LLMs์— ์ง์ ‘ ํ”„๋กฌํ”„ํŠธํ•˜์—ฌ ์ธ์ƒ์„ ๋ณด๊ณ ํ•˜๋„๋ก ํ•˜๋ฉด ๋ถˆ์ผ์น˜์„ฑ๊ณผ ๊ธ์ •์  ํŽธํ–ฅ์ด ๋ฐœ์ƒํ•ฉ๋‹ˆ๋‹ค.
  • LLMs์˜ ๋‚ด์žฌ์  ์ธ์ƒ์„ ์ •๋Ÿ‰ํ™”ํ•˜๊ณ  ๋ถ„์„ํ•˜๋Š” ์ฒด๊ณ„์  ๋ฐฉ๋ฒ•์ด ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.
  • ์ธ์ƒ์ด ํ•˜์œ„ ํ–‰๋™์— ์–ด๋–ป๊ฒŒ ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š”์ง€์— ๋Œ€ํ•œ ์ดํ•ด๊ฐ€ ์ œํ•œ์ ์ž…๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ

  1. "์ธ๊ณต์  ์ธ์ƒ" ๊ฐœ๋… ์ œ์‹œ: ํ”„๋กฌํ”„ํŠธ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ LLMs์ด ํ˜•์„ฑํ•˜๋Š” ๋‚ด์žฌ์  ์ธ์ƒ์„ ์ฒ˜์Œ์œผ๋กœ ์ฒด๊ณ„์ ์œผ๋กœ ์—ฐ๊ตฌํ•ฉ๋‹ˆ๋‹ค.
  2. ์„ ํ˜• ํ”„๋กœ๋ธŒ ๋ฐฉ๋ฒ• ๊ฐœ๋ฐœ: SCM ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ˆจ๊ฒจ์ง„ ์ƒํƒœ์—์„œ ์ธ์ƒ์„ ๋””์ฝ”๋”ฉํ•˜๋Š” ํ”„๋กœ๋ธŒ๋ฅผ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค.
  3. ์ธ์ƒ-ํ–‰๋™ ์—ฐ๊ด€์„ฑ ํ™•๋ฆฝ: ์ธ๊ณต์  ์ธ์ƒ์ด ์‘๋‹ต ํ’ˆ์งˆ๊ณผ ์™„ํ™” ์–ธ์–ด ์‚ฌ์šฉ์„ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ์Œ์„ ์ฆ๋ช…ํ•ฉ๋‹ˆ๋‹ค.
  4. ์˜ํ–ฅ ์š”์ธ ์‹๋ณ„: LLM ์ธ์ƒ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ๋‚ด์šฉ, ์Šคํƒ€์ผ ๋ฐ ๋ฐฉ์–ธ ํŠน์„ฑ์„ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค.
  5. ๋ฐฉ์–ธ ํŽธํ–ฅ ๋…ธ์ถœ: LLMs์ด ์•„ํ”„๋ฆฌ์นด๊ณ„ ๋ฏธ๊ตญ์ธ ์–ธ์–ด(AAL)์— ๋Œ€ํ•ด ๋” ๋ถ€์ •์ ์ธ ์ธ์ƒ์„ ๊ฐ€์ง€๊ณ  ์žˆ์Œ์„ ๋ฐœ๊ฒฌํ•ฉ๋‹ˆ๋‹ค.

๋ฐฉ๋ฒ•๋ก  ์ƒ์„ธ ์„ค๋ช…

์ž‘์—… ์ •์˜

์‚ฌ์šฉ์ž ํ”„๋กฌํ”„ํŠธ๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ, ๋ชฉํ‘œ๋Š”:

  1. LLM ์ˆจ๊ฒจ์ง„ ํ‘œํ˜„์—์„œ SCM ๊ธฐ๋ฐ˜ ์ธ์ƒ ์ ์ˆ˜๋ฅผ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค.
  2. ์ธ์ƒ๊ณผ ๋ชจ๋ธ ํ–‰๋™ ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค.
  3. ์ธ์ƒ ํ˜•์„ฑ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ํ”„๋กฌํ”„ํŠธ ํŠน์„ฑ์„ ์‹๋ณ„ํ•ฉ๋‹ˆ๋‹ค.

๊ณ ์ •๊ด€๋… ๋‚ด์šฉ ๋ชจ๋ธ(SCM)

SCM์€ ๋‘ ๊ฐ€์ง€ ์ฐจ์›์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค:

  • ๋”ฐ๋œปํ•จ(Warmth): ๋ชฉํ‘œ์˜ ์˜๋„์— ๋Œ€ํ•œ ์ธ์‹(์˜ˆ: ์นœ์ ˆํ•จ, ์ „ํˆฌ์„ฑ)
  • ๋Šฅ๋ ฅ(Competence): ๋ชฉํ‘œ๊ฐ€ ์˜๋„๋ฅผ ์„ฑ๊ณต์ ์œผ๋กœ ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ๋Šฅ๋ ฅ(์˜ˆ: ์ง€๋Šฅ, ๊ถŒ๋ ฅ)

๋ฐ์ดํ„ฐ ์ƒ์„ฑ ํ”„๋กœ์„ธ์Šค

1. ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ

๋‹จ๊ณ„ 1: ํŠน์„ฑ ์–ดํœ˜ โ†’ ์ธ์ƒ ์‚ฌ์–‘(์˜ˆ: "์นœ์ ˆํ•˜๊ณ  ์„ธ์‹ฌํ•จ")
๋‹จ๊ณ„ 2: ์ธ์ƒ ์‚ฌ์–‘์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•ฉ์„ฑ ์‚ฌ์šฉ์ž ํ”„๋กฌํ”„ํŠธ ์ƒ์„ฑ
๋‹จ๊ณ„ 3: LLM ์ˆจ๊ฒจ์ง„ ํ‘œํ˜„ ์ถ”์ถœ
๋‹จ๊ณ„ 4: ํ”„๋กœ๋ธŒ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ๊ตฌ์„ฑ(ํ‘œํ˜„-๋ ˆ์ด๋ธ” ์Œ)

2. ํ”„๋กœ๋ธŒ ํ›ˆ๋ จ

  • ๋‹ค์ธต ํผ์…‰ํŠธ๋ก (MLP) ํ™œ์„ฑํ™”๋ฅผ ์ž…๋ ฅ ํŠน์„ฑ์œผ๋กœ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
  • ๋…๋ฆฝ์ ์ธ ๋”ฐ๋œปํ•จ ๋ฐ ๋Šฅ๋ ฅ ํ”„๋กœ๋ธŒ๋ฅผ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค.
  • 5-ํด๋“œ ๊ต์ฐจ ๊ฒ€์ฆ์„ ์‚ฌ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.
  • ๋‹ค์–‘ํ•œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ๋น„์œจ(100%, 10%, 1%)์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

๊ธฐ์ˆ ์  ํ˜์‹ ์ 

  1. ์‹ฌ๋ฆฌํ•™ ์ด๋ก  ๊ธฐ๋ฐ˜: ์‹ฌ๋ฆฌํ•™์˜ SCM ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ LLM ๋ถ„์„์— ์ ์šฉํ•ฉ๋‹ˆ๋‹ค.
  2. ํ”„๋กœ๋ธŒ ๋Œ€ ํ”„๋กฌํ”„ํŠธ ๋น„๊ต: ํ”„๋กœ๋ธŒ ๋ฐฉ๋ฒ•๊ณผ ์ง์ ‘ ํ”„๋กฌํ”„ํŠธ์˜ ์‹ ๋ขฐ์„ฑ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋น„๊ตํ•ฉ๋‹ˆ๋‹ค.
  3. ๋‹ค์ธต ๋ถ„์„: ๋‹ค์–‘ํ•œ ๋ชจ๋ธ ์ธต์—์„œ ์ธ์ƒ ์ •๋ณด์˜ ๋ถ„ํฌ๋ฅผ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค.
  4. ํ–‰๋™ ์˜ˆ์ธก ๊ฒ€์ฆ: ํ•˜์œ„ ์ž‘์—…์„ ํ†ตํ•ด ์ธ์ƒ์˜ ์œ ํšจ์„ฑ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.

์‹คํ—˜ ์„ค์ •

๋ชจ๋ธ

  • Llama-3.1 (8B): 32์ธต, 4096 ์ˆจ๊ฒจ์ง„ ์ฐจ์›
  • Llama-3.2 (1B): 16์ธต, 2048 ์ˆจ๊ฒจ์ง„ ์ฐจ์›
  • OLMo-2 (7B): 32์ธต, 4096 ์ˆจ๊ฒจ์ง„ ์ฐจ์›

๋ฐ์ดํ„ฐ์…‹

ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ

  • 131๊ฐœ์˜ ๋”ฐ๋œปํ•จ ํŠน์„ฑ๊ณผ 104๊ฐœ์˜ ๋Šฅ๋ ฅ ํŠน์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•ฉ๋‹ˆ๋‹ค.
  • ๊ฐ ์ธ์ƒ ์‚ฌ์–‘์— ๋Œ€ํ•ด 10๊ฐœ์˜ ์ƒ˜ํ”Œ ์ƒ์„ฑ(์˜จ๋„=0.9)
  • ์ด 274,830๊ฐœ์˜ ํ”„๋กฌํ”„ํŠธ/๋ชจ๋ธ

์‹ค์ œ ๋ฐ์ดํ„ฐ

  • LMSysChat: 100๋งŒ ๊ฐœ์˜ ์‹ค์ œ ๋Œ€ํ™”์—์„œ 2,000๊ฐœ์˜ ์ฒซ ๋ผ์šด๋“œ ํ”„๋กฌํ”„ํŠธ ์ƒ˜ํ”Œ๋ง
  • TwitterAAE: 400๊ฐœ์˜ ํŠธ์œ—(200๊ฐœ AAL, 200๊ฐœ WME)
  • Counterparts ๋ฐ์ดํ„ฐ์…‹: ๋‹ค๋ฅธ ๋ณ€์ˆ˜๋ฅผ ์ œ์–ดํ•˜๋Š” ๋ณ‘๋ ฌ ์ฝ”ํผ์Šค

ํ‰๊ฐ€ ์ง€ํ‘œ

  • ํ”„๋กœ๋ธŒ ์„ฑ๋Šฅ: F1 ์ ์ˆ˜, ์ •ํ™•๋„
  • ์ž๊ธฐ ์ผ๊ด€์„ฑ: ๋ณด๊ณ ๋œ ์ธ์ƒ๊ณผ ์ œ๊ณต๋œ ํŠน์„ฑ์˜ ์ผ์น˜๋„
  • ์ธ๊ฐ„ ํ‰๊ฐ€: 4์  Likert ์ฒ™๋„, Krippendorff's ฮฑ = 0.71

์‹คํ—˜ ๊ฒฐ๊ณผ

์ฃผ์š” ๋ฐœ๊ฒฌ

๋ฐœ๊ฒฌ 1: ํ”„๋กฌํ”„ํŠธ ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„

LLM์ด ๋ณด๊ณ ํ•˜๋Š” ์ธ์ƒ์€ ์ผ๋ฐ˜์ ์œผ๋กœ ๊ธ์ •์  ํŠน์„ฑ(๋”ฐ๋œปํ•จ/๋Šฅ๋ ฅ)์œผ๋กœ ํŽธํ–ฅ๋˜์–ด ์žˆ์œผ๋ฉฐ, ํŠนํžˆ 1์ธ์นญ ์ƒํ™ฉ์—์„œ ๊ทธ๋ ‡์Šต๋‹ˆ๋‹ค:

  • Llama-3.1 (8B) 1์ธ์นญ ๋”ฐ๋œปํ•จ ์ž๊ธฐ ์ผ๊ด€์„ฑ์€ 51.67%์— ๋ถˆ๊ณผํ•ฉ๋‹ˆ๋‹ค.
  • 3์ธ์นญ ์ƒํ™ฉ์—์„œ๋Š” ๊ฐœ์„ ๋˜์ง€๋งŒ ์—ฌ์ „ํžˆ ์ œํ•œ์ ์ž…๋‹ˆ๋‹ค(์ตœ๋Œ€ 80.77%).

๋ฐœ๊ฒฌ 2: ์ธ๊ฐ„-๋ชจ๋ธ ์ธ์ƒ ์ผ๊ด€์„ฑ

์ธ๊ฐ„ ์ฃผ์„๊ณผ ์›๋ณธ ํŠน์„ฑ ๊ฐ„์˜ ์ผ๊ด€์„ฑ:

  • ์ „์ฒด Cohen's ฮบ = 0.68, Spearman r = 0.68
  • ํŠน์„ฑ ์–ดํœ˜ ๋ฐ SCM ๋ ˆ์ด๋ธ”์˜ ์œ ํšจ์„ฑ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.

๋ฐœ๊ฒฌ 3: ํ”„๋กœ๋ธŒ ๋ฐฉ๋ฒ•์˜ ์œ ํšจ์„ฑ

์„ ํ˜• ํ”„๋กœ๋ธŒ๋Š” ์ˆจ๊ฒจ์ง„ ํ‘œํ˜„์—์„œ ์ธ์ƒ์„ ์„ฑ๊ณต์ ์œผ๋กœ ๋””์ฝ”๋”ฉํ•ฉ๋‹ˆ๋‹ค:

  • ๋”ฐ๋œปํ•จ ํ”„๋กœ๋ธŒ F1 ์ ์ˆ˜: 75-90%
  • ๋Šฅ๋ ฅ ํ”„๋กœ๋ธŒ F1 ์ ์ˆ˜: 75-85%
  • ์„ฑ๋Šฅ์€ ๋ชจ๋ธ์˜ ์ค‘๊ฐ„ ์ธต์—์„œ ์ตœ๊ณ ์กฐ์— ๋„๋‹ฌํ•ฉ๋‹ˆ๋‹ค.

๋ฐœ๊ฒฌ 4: ๋”ฐ๋œปํ•จ ์šฐ์œ„ ํšจ๊ณผ

๋ชจ๋ธ์€ ๋”ฐ๋œปํ•จ ์ฐจ์›์—์„œ ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค:

  • ๋”ฐ๋œปํ•จ ํ”„๋กœ๋ธŒ ์„ฑ๋Šฅ์ด ๋Šฅ๋ ฅ ํ”„๋กœ๋ธŒ๋ณด๋‹ค ์ง€์†์ ์œผ๋กœ ๋†’์Šต๋‹ˆ๋‹ค.
  • ์ธ๊ฐ„ ์ธ์ƒ ํ˜•์„ฑ์˜ "๋”ฐ๋œปํ•จ ์šฐ์„  ํšจ๊ณผ"๋ฅผ ๋ชจ๋ฐฉํ•ฉ๋‹ˆ๋‹ค.

์ธ์ƒ-ํ–‰๋™ ์—ฐ๊ด€์„ฑ ์‹คํ—˜

์‘๋‹ต ํ’ˆ์งˆ ์˜ˆ์ธก

์ˆœ์„œ ๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ธ์ƒ์ด ์‘๋‹ต ํ’ˆ์งˆ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค:

๋ชจ๋ธ๋”ฐ๋œปํ•จ ๊ณ„์ˆ˜๋Šฅ๋ ฅ ๊ณ„์ˆ˜
Llama-3.2-1B1.07**0.90**
Llama-3.1-8B0.49*0.39*
OLMo-2-7B0.76**0.35*

๋ฐœ๊ฒฌ 5: ๋”ฐ๋œปํ•จ ๋ฐ ๋Šฅ๋ ฅ ์ธ์ƒ์ด ์‘๋‹ต ํ’ˆ์งˆ์„ ์œ ์˜๋ฏธํ•˜๊ฒŒ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.

์™„ํ™” ์–ธ์–ด ๋ถ„์„

์Œ์ดํ•ญ ํšŒ๊ท€๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ธ์ƒ์ด ์™„ํ™” ์–ธ์–ด ์‚ฌ์šฉ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค:

๋ชจ๋ธ๋”ฐ๋œปํ•จ ๊ณ„์ˆ˜๋Šฅ๋ ฅ ๊ณ„์ˆ˜
Llama-3.2-1B-0.46*-1.06**
Llama-3.1-8B-0.14-1.18**
OLMo-2-7B0.40**-0.69**

๋ฐœ๊ฒฌ 6: ๋‚ฎ์€ ๋Šฅ๋ ฅ ์ธ์ƒ์ด ๋” ๋งŽ์€ ์™„ํ™” ์–ธ์–ด ์‚ฌ์šฉ์„ ์œ ์˜๋ฏธํ•˜๊ฒŒ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.

์˜ํ–ฅ ์š”์ธ ๋ถ„์„

๋‚ด์šฉ ๋ฐ ์Šคํƒ€์ผ ํŠน์„ฑ

LIWC ๋ฐ IDP๋ฅผ ์‚ฌ์šฉํ•œ ๋ถ„์„ ๊ฒฐ๊ณผ:

๋†’์€ ๋”ฐ๋œปํ•จ ํŠน์„ฑ:

  • ํƒ์ƒ‰์  ์–ดํœ˜("wondering", "might", "seem")
  • ์ฐจ์ด ์–ดํœ˜("would", "could", "hope")
  • ์ •์ค‘ํ•จ๊ณผ ์‹ฌ๋ฆฌ์  ๊ฑฐ๋ฆฌ๋ฅผ ๊ตฌํ˜„ํ•ฉ๋‹ˆ๋‹ค.

๋‚ฎ์€ ๋”ฐ๋œปํ•จ ํŠน์„ฑ:

  • ์˜๋ฌธ์‚ฌ("what", "how")
  • ์ธ๊ณผ ์–ดํœ˜("because", "effect")

๋†’์€ ๋Šฅ๋ ฅ ํŠน์„ฑ:

  • ํ†ต์ฐฐ ์–ดํœ˜("rethink", "know", "informed")
  • ๊ณต์‹์  ์–ธ์–ด ๊ตฌ์กฐ

๋‚ฎ์€ ๋Šฅ๋ ฅ ํŠน์„ฑ:

  • ๋น„๊ณต์‹ ๋งˆ์ปค("yeah", "sure", ์ด๋ชจ์ง€)
  • ์ธํ„ฐ๋„ท ์–ธ์–ด("aight", "gonna")

๋ฐฉ์–ธ ํŽธํ–ฅ ๋ถ„์„

๋ฐœ๊ฒฌ 8: ๋ชจ๋ธ์€ AAL ํ…์ŠคํŠธ์— ๋Œ€ํ•ด ๋” ๋ถ€์ •์ ์ธ ์ธ์ƒ์„ ๊ฐ€์ง‘๋‹ˆ๋‹ค.

  • AAL vs WME ๋”ฐ๋œปํ•จ ์ƒ๊ด€๊ด€๊ณ„: r = -0.32 (p โ‰ค 0.001)
  • AAL vs WME ๋Šฅ๋ ฅ ์ƒ๊ด€๊ด€๊ณ„: r = -0.52 (p โ‰ค 0.001)
  • ๋ณ‘๋ ฌ ์ฝ”ํผ์Šค๊ฐ€ ์œ ์‚ฌํ•œ ์ถ”์„ธ๋ฅผ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ

ํ”„๋กฌํ”„ํŠธ ํŠน์„ฑ๊ณผ LLM ํ–‰๋™

  • ํ™”์šฉ๋ก ์  ํŠน์„ฑ: ์ •์ค‘ํ•จ, ๊ฐ์ • ์ž๊ทน์ด ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ
  • ์‚ฌํšŒ์–ธ์–ดํ•™์  ํŠน์„ฑ: ์–ธ์–ด ๋ณ€ํ˜•์ด ๋ฌธํ™”์  ์ •๋ ฌ ๋ฐ ๊ฐ์ •์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ
  • ๋ฐฉ์–ธ ์—ฐ๊ตฌ: LLMs์—์„œ AAL ๋“ฑ ๋ฐฉ์–ธ์˜ ํŽธํ–ฅ ๋ฐ ์„ฑ๋Šฅ ์ฐจ์ด

๊ณ ์ •๊ด€๋…๊ณผ LLMs

  • ์ƒ์„ฑ ํŽธํ–ฅ: ๋ชจ๋ธ ์ถœ๋ ฅ์˜ ๊ณ ์ •๊ด€๋… ๋ฐ ์‚ฌํšŒ์  ํŽธํ–ฅ
  • ๊ณ ์ •๊ด€๋… ๋‚ด์šฉ: SCM ๋“ฑ์˜ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉํ•œ LLM ๊ณ ์ •๊ด€๋… ๋ถ„์„
  • ์‚ฌํšŒ์  ํƒœ๋„ ๋ฐ˜์˜: ์‚ฌํšŒ์  ํŽธํ–ฅ์˜ ๋ฐ˜์˜์œผ๋กœ์„œ์˜ LLMs

๊ฒฐ๋ก  ๋ฐ ๋…ผ์˜

์ฃผ์š” ๊ฒฐ๋ก 

  1. ๋ฐฉ๋ฒ• ์œ ํšจ์„ฑ: ์„ ํ˜• ํ”„๋กœ๋ธŒ๋Š” ์ง์ ‘ ํ”„๋กฌํ”„ํŠธ๋ณด๋‹ค LLM ์ธ์ƒ์„ ๋” ์•ˆ์ •์ ์œผ๋กœ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค.
  2. ํ–‰๋™ ์˜ˆ์ธก๋ ฅ: ์ธ๊ณต์  ์ธ์ƒ์€ ์‘๋‹ต ํ’ˆ์งˆ ๋ฐ ์–ธ์–ด ์‚ฌ์šฉ ํŒจํ„ด์„ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ํŽธํ–ฅ ์‹๋ณ„: ํŠน์ • ๋ฐฉ์–ธ ๋ฐ ์ง‘๋‹จ์— ๋Œ€ํ•œ ํŽธํ–ฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ฐœ๊ฒฌํ–ˆ์Šต๋‹ˆ๋‹ค.
  4. ๋”ฐ๋œปํ•จ ์šฐ์œ„: LLMs์€ ์ธ๊ฐ„๊ณผ ์œ ์‚ฌํ•œ ๋”ฐ๋œปํ•จ ์šฐ์„  ํšจ๊ณผ๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.

ํ•œ๊ณ„

  1. ๋ฒ”์œ„ ์ œํ•œ: ์˜์–ด ๋Œ€ํ™”์˜ ์ฒซ ๋ผ์šด๋“œ ๋ฉ”์‹œ์ง€์—๋งŒ ์ดˆ์ ์„ ๋งž์ถฅ๋‹ˆ๋‹ค.
  2. ๋ชจ๋ธ ๊ทœ๋ชจ: 8B ๋งค๊ฐœ๋ณ€์ˆ˜ ์ดํ•˜์˜ ์˜คํ”ˆ์†Œ์Šค ๋ชจ๋ธ๋กœ ์ œํ•œ๋ฉ๋‹ˆ๋‹ค.
  3. ์ด๋ก ์  ํ”„๋ ˆ์ž„์›Œํฌ: SCM๋งŒ ์‚ฌ์šฉํ•˜๋ฉฐ ๋‹ค๋ฅธ ๊ณ ์ •๊ด€๋… ๋ชจ๋ธ์„ ํƒ์ƒ‰ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.
  4. ๋ฌธํ™”์  ์ฐจ์ด: ์ธ์ƒ ํ˜•์„ฑ์˜ ๋ฌธํ™” ๊ฐ„ ์ฐจ์ด๋ฅผ ๊ณ ๋ คํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

์œค๋ฆฌ์  ๊ณ ๋ ค์‚ฌํ•ญ

  1. ์˜์ธํ™” ์œ„ํ—˜: LLMs์˜ ๊ณผ๋„ํ•œ ์˜์ธํ™”๋ฅผ ํ”ผํ•˜๊ธฐ ์œ„ํ•ด ์ฃผ์˜๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
  2. ํŽธํ–ฅ ์ฆํญ: ์‹๋ณ„๋œ ํŽธํ–ฅ์ด ์†Œ์™ธ๋œ ์ง‘๋‹จ์— ํ•ด๋ฅผ ๋ผ์น  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ์‘์šฉ ๊ฒฝ๊ณ„: ์–ด๋–ค ์ƒํ™ฉ์—์„œ ์ฐจ๋ณ„ํ™”๋œ ํ–‰๋™์ด ํ•ฉ๋ฆฌ์ ์ธ์ง€ ๋ช…ํ™•ํžˆ ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

ํ–ฅํ›„ ๋ฐฉํ–ฅ

  1. ๋‹ค์ค‘ ๋ผ์šด๋“œ ๋Œ€ํ™”: ๋Œ€ํ™” ๊ณผ์ •์—์„œ ์ธ์ƒ์˜ ์ง„ํ™”๋ฅผ ์—ฐ๊ตฌํ•ฉ๋‹ˆ๋‹ค.
  2. ๋ฌธํ™” ๊ฐ„ ์—ฐ๊ตฌ: ๋‹ค์–‘ํ•œ ๋ฌธํ™” ๋ฐฐ๊ฒฝ์—์„œ์˜ ์ธ์ƒ ํ˜•์„ฑ์„ ํƒ์ƒ‰ํ•ฉ๋‹ˆ๋‹ค.
  3. ์™„ํ™” ์ „๋žต: ํ•ด๋กœ์šด ํŽธํ–ฅ์„ ์ค„์ด๊ธฐ ์œ„ํ•œ ๊ธฐ์ˆ ์  ๋ฐฉ๋ฒ•์„ ๊ฐœ๋ฐœํ•ฉ๋‹ˆ๋‹ค.
  4. ์ด๋ก ์  ํ™•์žฅ: ๋” ๋ณต์žกํ•œ ์ธ์ƒ ํ˜•์„ฑ ๋ชจ๋ธ์„ ์ ์šฉํ•ฉ๋‹ˆ๋‹ค.

์‹ฌ์ธต ํ‰๊ฐ€

์žฅ์ 

  1. ๋†’์€ ํ˜์‹ ์„ฑ: ์‹ฌ๋ฆฌํ•™ ์ธ์ƒ ์ด๋ก ์„ LLM ๋ถ„์„์— ์ฒ˜์Œ์œผ๋กœ ์ฒด๊ณ„์ ์œผ๋กœ ์ ์šฉํ•ฉ๋‹ˆ๋‹ค.
  2. ์—„๋ฐ€ํ•œ ๋ฐฉ๋ฒ•๋ก : ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ, ํ”„๋กœ๋ธŒ ๊ธฐ์ˆ  ๋ฐ ์ธ๊ฐ„ ํ‰๊ฐ€๋ฅผ ๊ฒฐํ•ฉํ•ฉ๋‹ˆ๋‹ค.
  3. ๋†’์€ ์‹ค์šฉ ๊ฐ€์น˜: LLM ํŽธํ–ฅ์„ ์ดํ•ดํ•˜๊ณ  ์™„ํ™”ํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ๋„๊ตฌ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
  4. ์ถฉ๋ถ„ํ•œ ์‹คํ—˜: ๋‹ค์ค‘ ๋ชจ๋ธ, ๋‹ค์ค‘ ์ž‘์—…์˜ ํฌ๊ด„์  ๊ฒ€์ฆ
  5. ์‚ฌํšŒ์  ์˜์˜: ์ค‘์š”ํ•œ ๊ณต์ •์„ฑ ๋ฌธ์ œ๋ฅผ ๋…ธ์ถœํ•ฉ๋‹ˆ๋‹ค.

๋ถ€์กฑํ•œ ์ 

  1. ์ด๋ก ์  ํ•œ๊ณ„: SCM์ด ๋ชจ๋“  ๊ด€๋ จ ์ธ์ƒ ์ฐจ์›์„ ํฌ์ฐฉํ•˜์ง€ ๋ชปํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  2. ๋ฐ์ดํ„ฐ ํŽธํ–ฅ: ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๊ฐ€ ์‹ค์ œ ์‚ฌ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ์™„์ „ํžˆ ๋ฐ˜์˜ํ•˜์ง€ ๋ชปํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ์ธ๊ณผ ๊ด€๊ณ„: ์ธ์ƒ๊ณผ ํ–‰๋™ ๊ฐ„์˜ ๊ด€๊ณ„์— ํ˜ผ๋™ ๋ณ€์ˆ˜๊ฐ€ ์žˆ์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  4. ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ: ๋” ํฐ ๋ชจ๋ธ ๋ฐ ๋‹ค์–‘ํ•œ ํ›ˆ๋ จ ํŒจ๋Ÿฌ๋‹ค์ž„์—์„œ์˜ ๊ฒฐ๊ณผ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์ด ๋ถˆ๋ช…ํ™•ํ•ฉ๋‹ˆ๋‹ค.

์˜ํ–ฅ๋ ฅ

  1. ํ•™์ˆ ์  ๊ธฐ์—ฌ: LLM ํŽธํ–ฅ ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์ด๋ก ์  ํ”„๋ ˆ์ž„์›Œํฌ ๋ฐ ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
  2. ์‹ค๋ฌด์  ๊ฐ€์น˜: ๋ชจ๋ธ ํ‰๊ฐ€ ๋ฐ ํŽธํ–ฅ ๊ฐ์ง€์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. ์ •์ฑ…์  ์˜์˜: AI ๊ณต์ •์„ฑ ์ •์ฑ… ์ˆ˜๋ฆฝ์— ๊ณผํ•™์  ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
  4. ํ•™์ œ ๊ฐ„ ์˜ํ–ฅ: ์‹ฌ๋ฆฌํ•™, ์‚ฌํšŒ์–ธ์–ดํ•™ ๋ฐ AI ์•ˆ์ „ ๋ถ„์•ผ๋ฅผ ์—ฐ๊ฒฐํ•ฉ๋‹ˆ๋‹ค.

์ ์šฉ ์‹œ๋‚˜๋ฆฌ์˜ค

  1. ๋ชจ๋ธ ํ‰๊ฐ€: ๋ชจ๋ธ ๊ฐœ๋ฐœ ๊ณผ์ •์—์„œ ์ž ์žฌ์  ํŽธํ–ฅ์„ ๊ฐ์ง€ํ•ฉ๋‹ˆ๋‹ค.
  2. ์‘์šฉ ๊ฐ์‹œ: ๋ฐฐํฌ๋œ ๋ชจ๋ธ์˜ ๊ณต์ •์„ฑ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.
  3. ์—ฐ๊ตฌ ๋„๊ตฌ: ๊ด€๋ จ ๋ถ„์•ผ ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ ๋ถ„์„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
  4. ๊ต์œก ๋ชฉ์ : AI ์‹œ์Šคํ…œ์˜ ์‚ฌํšŒ์  ์˜ํ–ฅ์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ๋„์›€์„ ์ค๋‹ˆ๋‹ค.

์ฐธ๊ณ ๋ฌธํ—Œ

๋ณธ ๋…ผ๋ฌธ์€ ์‹ฌ๋ฆฌํ•™, ์‚ฌํšŒ์–ธ์–ดํ•™ ๋ฐ ๊ณ„์‚ฐ ์–ธ์–ดํ•™ ๋“ฑ ์—ฌ๋Ÿฌ ๋ถ„์•ผ์˜ ์ค‘์š”ํ•œ ์—ฐ๊ตฌ๋ฅผ ์ฐธ๊ณ ํ–ˆ์œผ๋ฉฐ, ํŠนํžˆ:

  • Fiske et al. (2002)์˜ ๊ณ ์ •๊ด€๋… ๋‚ด์šฉ ๋ชจ๋ธ
  • Blodgett et al. (2016)์˜ ๋ฐฉ์–ธ ์—ฐ๊ตฌ ๋ฐ์ดํ„ฐ์…‹
  • LLM ํŽธํ–ฅ ๋ฐ ๊ณต์ •์„ฑ์— ๊ด€ํ•œ ์ตœ๊ทผ ์—ฐ๊ตฌ

์ „์ฒด ํ‰๊ฐ€: ์ด๊ฒƒ์€ ๋ฐฉ๋ฒ•๋ก ์  ํ˜์‹ , ์‹คํ—˜ ์„ค๊ณ„ ๋ฐ ์‚ฌํšŒ์  ์˜์˜ ์ธก๋ฉด์—์„œ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•˜๋Š” ๊ณ ํ’ˆ์งˆ ์—ฐ๊ตฌ ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค. "์ธ๊ณต์  ์ธ์ƒ" ๊ฐœ๋…์„ ๋„์ž…ํ•จ์œผ๋กœ์จ LLM ํ–‰๋™์„ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ๊ด€์ ์„ ์ œ๊ณตํ•˜๋ฉฐ, AI ๊ณต์ •์„ฑ ์—ฐ๊ตฌ๋ฅผ ์ถ”์ง„ํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•œ ๊ฐ€์น˜๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค.