3 minute read

Meta info.
  • Authors: Timothy Obiso, Kenneth Lai, Abhijnan Nath, Nikhil Krishnaswamy, James Pustejovsky
  • Paper: https://arxiv.org/pdf/2506.10934
  • Affiliation: Brandeis University, Colorado State University
  • Published: June 12, 2025

TL; DR

๋Œ€ํ™”์—์„œ belief์€ ํ†ต์ƒ ์—ฐ๊ตฌ๋“ค์˜ ๊ฐ€์ •์ฒ˜๋Ÿผ '๋งค๋„๋Ÿฝ๊ฒŒ' ์—…๋ฐ์ดํŠธ ๋˜์ง€ ์•Š์œผ๋ฏ€๋กœ, ์ƒˆ๋กœ์šด ์ •๋ณด์— ๋Œ€ํ•œ ์ˆ˜์šฉ ์ €ํ•ญ(epistemic friction)์„ ์ •๋Ÿ‰ํ™”/๋ฒกํ„ฐํ™”ํ•˜์—ฌ ๋ชจ๋ธ๋งํ•˜๋Š” belief ๋ณ€ํ™” ๋ชจ๋ธ๋ง ์ œ์•ˆ

image 1 image

Background

  • ๊ธฐ์กด LLM ์‹œ์Šคํ…œ/๋Œ€ํ™”์ด๋ก ์˜ ์ „์ œ๋Š”, โ€˜๋ฐœํ™”๊ฐ€ ์‹ ๋ขฐ๋˜๋ฉด ์ •๋ณด๋Š”ย ์ฆ‰์‹œ belief์— ๋ฐ˜์˜๋œ๋‹คโ€™ ์ฆ‰, ๋งˆ์ฐฐ์ด ์—†๋‹ค๊ณ  ๊ฐ€์ •
    • ์‚ฌ๋žŒ๋“ค์€ ๋Œ€ํ™” ์ค‘ common ground์„ ๊ฐ€์ •, ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋Œ€ํ™”๊ฐ€ ์ด๋ฃจ์–ด์ง„๋‹ค๊ณ  ๊ฐ€์ • (Grice, 1975; Stalnaker, 2002; Clark & Wilkes-Gibbs, 1986 ๋“ฑ๋“ฑ)
    • DEL ์—ญ์‹œ ์‹ ๋… ๋ณ€ํ™”๋ฅผ smoothํ•˜๋‹ค๊ณ  ๊ฐ€์ •
  • ๊ทธ๋Ÿฌ๋‚˜ ์‹ค์ œ ์ธ๊ฐ„ ๋Œ€ํ™”์—์„œ๋Š” ์˜คํ•ด/์˜์‹ฌ/์ถฉ๋Œ/๋ฌด์‹œ ๋“ฑ ์ €ํ•ญ/๋งˆ์ฐฐ์ด ์ผ๋ฐ˜์  > ์ •๋ณด ํก์ˆ˜๊ฐ€ โ€˜๋งˆ์ฐฐ ์—†์ดโ€™ ์ด๋ฃจ์–ด์ง€๋Š” ๊ฒƒ์€ ์•„๋‹˜

Problem States

๋Œ€ํ™” ์† ์‹ ๋… ๋ณ€ํ™”๊ฐ€ ํ•ญ์ƒ โ€˜๋งˆ์ฐฐ ์—†์ดโ€™ ํ†ตํ•ฉ๋œ๋‹ค๊ณ  ๊ฐ€์ •ํ•˜๋Š” ๊ธฐ์กด ๋ชจ๋ธ์€ ํ˜„์‹ค์˜ ๋น„์ •๋ ฌ์„ฑ, ์ €ํ•ญ, ์ถฉ๋Œ์„ ๊ฐ„๊ณผํ•˜๊ณ  ์žˆ์œผ๋ฏ€๋กœ, ์ด์— ๋”ฐ๋ผ ์ž˜๋ชป๋œ ์ •๋ ฌ/์ €ํ•ญ์ด ๋Œ€ํ™”์™€ ์ธ๊ฐ„-AI ์ƒํ˜ธ์ž‘์šฉ์„ ์–ด๋–ป๊ฒŒ ํ˜•์„ฑํ•˜๋Š”์ง€ ๋ฌด์‹œํ•œ๋‹ค > ์‹ ๋… ์—…๋ฐ์ดํŠธ์—์„œ ์ €ํ•ญ์„ ์ •๋Ÿ‰ํ™”ํ•  ์ˆ˜ ์žˆ์„๊นŒ?

Suggestions

์ˆ˜์šฉ ์ €ํ•ญ์„ DEL๊ณผ ๋ฒกํ„ฐ ์—ฐ์‚ฐ์œผ๋กœ ๋ชจ๋ธ๋งํ•˜์ž == DEF(Dynamic Epistemic Friction) ์ œ์•ˆ

  • DEL: ์ •๋ณด ์—…๋ฐ์ดํŠธ์™€ ์‹ ๋… ๋ณ€ํ™” ๊ณผ์ •์„ ํ˜•์‹์ ์œผ๋กœ ๋‹ค๋ฃจ๋Š” ๋…ผ๋ฆฌ ์ฒด๊ณ„
    • ์„ธ๊ณ„(W), ์ ‘๊ทผ ๊ฐ€๋Šฅ์„ฑ(Ra), ์ฐธ๊ฐ’ ํ• ๋‹น(V)์œผ๋กœ ๊ตฌ์„ฑ๋œ ๋ชจ๋ธ M
    • ์–ด๋–ค ์‚ฌ๋žŒ์ด ฯ†๋ฅผ ๋ฏฟ๋Š”๋‹ค๊ณ  ํ•  ๋•Œ, ๊ทธ๊ฐ€ ๊ฐ€๋Šฅํ•œ ํ˜„์‹ค๋กœ ์—ฌ๊ธฐ๋Š” ๋ชจ๋“  ์„ธ๊ณ„(w โˆˆ W)์—์„œ๋Š” ฯ†๊ฐ€ ์ฐธ์ด์–ด์•ผ ํ•จ (i.e., ฯ† holds in all Ra-accessible worlds)
  • epistemic friction: โ€œ์ •๋ณด ์ˆ˜์šฉ ์‹œ ์ƒ๊ธฐ๋Š” ์‹ฌ๋ฆฌ์ /๋…ผ๋ฆฌ์  ์ €ํ•ญ(resistance)โ€
    • ๋งˆ์ฐฐ์ด ํด์ˆ˜๋ก ๋Œ€ํ™” ์ƒ๋Œ€๊ฐ€ ์ •๋ณด๋ฅผ ๋ฐ›์•„๋“ค์ด๊ธฐ ์–ด๋ ต๊ณ , ์„ค๋“/์„ค๊ณ„ํ•  ๋•Œ ๋” ๋งŽ์€ ์ฆ๊ฑฐ/์ •๊ตํ•œ ํ‘œํ˜„์ด ํ•„์š”
    • Formalization: F(ฯ†, B, E) โˆ 1 -ย alignment(ฯ†, B, E), where ฯ† = proposition, B = current belief state, E = evidence,
    • ๋งˆ์ฐฐ(F) = 1-์ •๋ ฌ(alignment) ๋กœ ์ •์˜ํ•˜์—ฌ, ์ •๋ ฌ์ด ๋œ๋์„์ˆ˜๋ก ๋งˆ์ฐฐ์€ ์ปค์ง„๋‹ค.
      • alignmentย = ๊ธฐ์กด ์‹ ๋…๊ณผ ฯ†+E ๊ฐ„์˜ ์œ ์‚ฌ๋„
      • Friction: ์ƒˆ๋กœ์šด ์ฃผ์žฅ ฯ†๊ฐ€ ์—์ด์ „ํŠธ์˜ ๊ธฐ์กด ์‹ ๋… B_a์™€ ์ถฉ๋Œํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ์‹ ๋…์˜ ์ €ํ•ญ
    • ๊ธฐ์กด DEL์—์„œ๋Š”, Bโ€™_a = B_a โˆช {ฯ†} ํ•˜๋ฉด ๋์ธ๋ฐ
    • ์—ฐ๊ตฌ์—์„œ์˜ ์ œ์•ˆ์€ ์ด ฯ†๊ฐ€ ๊ธฐ์กด ์‹ ๋…์—์„œ ์—ฐ์—ญ๋˜์ง€ ์•Š์œผ๋ฉด friction์œผ๋กœ ์ฒ˜๋ฆฌํ•ด์•ผ ํ•œ๋‹ค๋Š” ์ฃผ์žฅ.
      • ๋ฒกํ„ฐ ๊ธฐ๋ฐ˜ (HRR)์œผ๋กœ ์ ‘๊ทผํ•˜์—ฌ belief์™€ ฯ†+E ๋ฒกํ„ฐ๊ฐ€ orthogonalํ•˜๋ฉด frictionย ๐Ÿ”ผ
      • ์ฆ‰ ์ƒˆ๋กœ์šด ์ •๋ณด ฯ†๊ฐ€ ๊ธฐ์กด ์‹ ๋… ์ง‘ํ•ฉ์œผ๋กœ๋ถ€ํ„ฐ ๋‹จ์ˆœ ๋ง๋ถ™์ด๋Š” ๊ฒƒ์ด ์•„๋‹Œ revision์„ ์š”๊ตฌํ•  ๊ฒฝ์šฐ ์ด๋Š” ๋งˆ์ฐฐ๋กœ ํ•ด์„
  • Vectorized Belief & Update:ย ์‹ ๋…์„ vector๋กœ ๋‘๊ณ , alignment๋ฅผ cos sim์œผ๋กœ ๊ณ„์‚ฐ, ฮฒ, ฮฑ๋Š” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ
    • ์ •๋ ฌ๋œ ์ •๋ณด๋Š” ๊ฐ•ํ™”๋˜๊ณ , ์ถฉ๋Œ๋˜๋Š” ์ •๋ณด๋Š” ๋ถ€๋ถ„ ์ˆ˜์šฉ > belief๋ฅผ ๋œ ์›€์ง์ž„
      • ์ •๋ ฌ๋œ ์ •๋ณด๋Š” ฮฑยทcos ์œ ์‚ฌ๋„๋งŒํผ belief๋ฅผ ์ฆํญ
      • ์ถฉ๋Œ๋œ ์ •๋ณด๋Š” ์ œํ•œ๋œ ๋ฒ”์œ„ ๋‚ด์—์„œ๋งŒ ๋ฐ˜์˜ (min(ฮฒ, โ€ฆ))
    • Formalization: Bโ€™_a = B_a + min(ฮฒ, ฮฑ ร— sign) ร— CosSim(B_a, ฯ†_b) โŠ™ ฯ†_bย Eq.17
      • alignment(ฯ†, B, E) = CosSim(v_B, v_ฯ† + v_E)
  • Friction Equilibrium: ๋Œ€ํ™”๋ฅผ ํ†ตํ•ด ์ฐธ๊ฐ€์ž์˜ belief๊ฐ€ ์ ์ง„์ ์œผ๋กœ ์กฐ์ •
    • ๋งค turn๋งˆ๋‹ค
      1. ๊ฐ proposition ฯ†_i์— ๋Œ€ํ•ด friction F_i(a)๋ฅผ ๊ณ„์‚ฐ
      2. ๋งˆ์ฐฐ์ด ํฐ ์ฃผ์žฅ๋“ค๋งŒ ์„ ํƒย (high-friction subset H)
      3. ๊ฐ€์žฅ ๋งˆ์ฐฐ์ด ํฐ ๊ฒƒ๋ถ€ํ„ฐย ์ฆ๊ฑฐ ์ถ”๊ฐ€ or ์ฃผ์žฅ ์ˆ˜์ •
      4. ์—…๋ฐ์ดํŠธ ํ›„ ์ „์ฒด ๋งˆ์ฐฐ์„ ๋‹ค์‹œ ์ธก์ • > ๊ท ํ˜•(Equilibrium)์— ๋„๋‹ฌํ•  ๋•Œ๊นŒ์ง€ ๋ฐ˜๋ณต
        • Equilibrium:ย ์ „์ฒด ๋งˆ์ฐฐ ํ‰๊ท ์ด ๊ธฐ์ค€๊ฐ’ ์ดํ•˜๊ฐ€ ๋  ๋•Œโ€™ย ๋„๋‹ฌ

Effects

DEF๋Š” ํ˜„์‹ค์  belief ์ถ”๋ก ์ด ๊ฐ€๋Šฅํ•˜๊ณ , friction ์กฐ์ ˆ์ด ์„ฑ๋Šฅ์— ๋งค์šฐ ์ค‘์š”ํ•˜๋‹ค

  • task:ย Weights Task Dataset (3์ธ ํ˜‘์—… ๊ณผ์ œ, ๋ธ”๋ก ๋ฌด๊ฒŒ ์ถ”๋ก )
    • DEF ๋ชจ๋ธ์ด ์ฐธ๊ฐ€์ž์˜ ์ตœ์ข… belief state ์˜ˆ์ธก ๊ฐ€๋Šฅํ•œ๊ฐ€?
    • ๊ฐ€์žฅ ๋ฐœํ™” ์ ์€ ์ฐธ๊ฐ€์ž 1๋ช… ์„ ํƒ > belief ๋ฒกํ„ฐ ์ถ”์ 
    • ๋ฐœํ™”๋งˆ๋‹ค vectorized proposition์œผ๋กœ ๋ณ€ํ™˜ํ•ดย Eq.17์— ๋”ฐ๋ผ ์—…๋ฐ์ดํŠธ
    • ์ตœ์ข… belief ๋ฒกํ„ฐ์™€ ์ •๋‹ต([10,10,20,30,50]) ๋น„๊ต > RMSE ์ธก์ •
    • ฮฑ=5, ฮฒ=2๋กœ friction ๊ฐ•๋„ ์กฐ์ ˆ (๊ฐ€์žฅ ์„ฑ๋Šฅ ์ข‹์•˜๋‹ค๊ณ )
  • Results:
    • tab2ย k๊ฐ€ ํด์ˆ˜๋ก(๋Œ€ํ™”๋ฅผ ๊ธธ๊ฒŒ ์ค€๋‹ค๊ณ ) ๋ฐ˜๋“œ์‹œ ์„ฑ๋Šฅ์ด ์ข‹์•„์ง€๋Š” ๊ฑด ์•„๋‹˜ (์ฆ‰ long context๊ฐ€ ํ•ญ์ƒ ์ด๋กญ์ง„ ์•Š์Œ)
      • ๋Œ€๋ถ€๋ถ„ ๊ทธ๋ฃน์—์„œ 2~3g ์˜ค์ฐจ ์ˆ˜์ค€์œผ๋กœ ์ •ํ™•๋„ ๋†’์Œ
      • yellow block์€ ๋Šฆ๊ฒŒ ์–ธ๊ธ‰๋˜์–ด ๋งˆ์ฐฐ ํผ > ์˜ˆ์ธก ์–ด๋ ค์›€: ๋ฐ˜๋ณต๋œ ๋ฐœํ™”, ์ฆ๊ฑฐ๊ฐ€ ๋งŽ์„์ˆ˜๋ก friction์ด ์ค„์–ด๋“ฆ์œผ๋กœ, ๋ฐ˜๋Œ€๋กœย ์ ๊ฒŒ ๋“ฑ์žฅํ•˜๊ฑฐ๋‚˜ ๋งฅ๋ฝ ์—†์ด ์–ธ๊ธ‰๋˜๋ฉดย ๊ธฐ์กด belief์™€ ์ž˜ ์ •๋ ฌ๋˜์ง€ ์•Š์Œ >ย alignmentย ๐Ÿ”ฝย = frictionย ๐Ÿ”ผ
    • fig2ย ๋งˆ์ฐฐ์„ ์กฐ์ ˆํ•˜์ง€ ์•Š์œผ๋ฉดย belief๊ฐ€ ์™œ๊ณก๋จ: DEF๋Š” ๋งˆ์ฐฐ ๊ณ„์ˆ˜(ฮฑ, ฮฒ)๋ฅผ ์ž˜ ์กฐ์ ˆํ•ด์•ผ ์„ฑ๋Šฅ์ด ๋†’์•„์ง„๋‹ค

Personal note. ๊นŠ์ด ๊ณ ๋ฏผํ•ด๋ณด์ง€ ์•Š์€ setup์ด๋ผ + ๋…ผ๋ฆฌ๋ช…์ œ๋ฅผ ๋‹ค๋ฃฌ ์—ฐ๊ตฌ(HRR)๋ผ ์กฐ๊ธˆ ๊นŒ๋‹ค๋กญ๊ฒŒ ๋А๊ปด์งˆ ๋ป” ํ–ˆ๋Š”๋ฐ, ๊ทธ๋ ‡๊ธฐ ๋•Œ๋ฌธ์— notation๋งŒ ์ž˜ ์žก๊ณ  ๊ฐ€๋ฉด ์ดํ•ดํ•˜๊ธฐ ์–ด๋ ค์šธ ์ •๋„๋Š” ์•„๋‹™๋‹ˆ๋‹ค. ์ œ์•ˆํ•˜๊ณ  ์žˆ๋Š” โ€˜์ˆ˜์šฉ ๋งˆ์ฐฐโ€™์ด๋ผ๋Š” ๊ฐœ๋…๊ณผ ๋ฌธ์ œ๋Š” ์œ ์šฉํ•œ ๊ฒƒ ๊ฐ™์€๋ฐ, ํ˜„์‹ค์—์„œ ฯ†๋ฅผ ์–ด๋–ป๊ฒŒ ์ •์˜ํ•  ์ˆ˜ ์žˆ์„์ง€, ํ˜น์€ ์ด vector๊ฐ€ ์–ผ๋งˆ๋‚˜ ๊ฐ•๊ฑดํ•œ์ง€์— ๋Œ€ํ•œ ๋…ผ์˜๊ฐ€ ํ•„์š”ํ•  ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. ๊ทธ์— ์•ž์„œ์„œ ์‹ ๋…์€ ์–ด๋–ป๊ฒŒ ์ •์˜ํ• ์ง€ (ํ˜„์‹ค์˜ ๋Œ€ํ™”์—์„œ๋Š” ์‹ ๋…์ด ํ•ญ์ƒ ๋“œ๋Ÿฌ๋‚˜๊ฑฐ๋‚˜ ์ •์˜๋˜๋Š” ๊ฒƒ์€ ์•„๋‹ˆ๋ฏ€๋กœ..) ๋„ ๋ฌผ๋ก  ํ•œ๊ณ„๋กœ ๋ณด์ž…๋‹ˆ๋‹ค. ์‹คํ—˜์ ์œผ๋กœ ์ถฉ๋ถ„ํ•˜์ง€ ์•Š๊ณ (target task๊ฐ€ WTD ํ•œ๊ฐ€์ง€) ๋น„๊ต ๋Œ€์ƒ๋„ ๋งˆ๋œฉ์ฐฎ์ง€๋งŒ, ์•„์ง ๊ฐœ๋…/์ด๋ก  ์ œ์•ˆ์— ๊ฐ€๊น๊ธฐ๋„ ํ•˜๊ณ , ์ฒซ๊ฑธ์Œ์œผ๋กœ์„œ๋Š” ๋ˆˆ์—ฌ๊ฒจ๋ณผ๋งŒํ•œ ์‹œ์ž‘์ด ๋  ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค.