2 minute read

Meta info.

TL; DR

LRM์ด thinkํ•˜๋Š” ๊ฒƒ์ฒ˜๋Ÿผ ๋ณด์—ฌ๋„, ๋ณต์žก๋„๊ฐ€ ๋†’์œผ๋ฉด ์‹คํŒจํ•˜๊ฑฐ๋‚˜ ์ถ”๋ก ๋„ ๋น„ํšจ์œจ์ ์œผ๋กœ(=๋œ) ํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์•„, ์ง„์ •ํ•œ ์ผ๋ฐ˜ํ™” ์ถ”๋ก  ์„ฑ๋Šฅ์€ ๋ถ€์กฑํ•˜๋‹ค.

image 1 image 2 image 3 image 4 image 5 image 6 image 7 image

Background

  • LLM์˜ reasoning-intensive task๋ฐœ์ „์— CoT์™€ self-verification์ด ์ฃผ์š” ๊ธฐ์ˆ 
  • DeekSeek-R1์ด๋‚˜ ์—ฌํƒ€ reasoning ์ž˜ํ•œ๋‹ค๋Š” ๋ชจ๋ธ๋“ค์€ ๋Œ€์ฒด๋กœ MATH500/AIME ๋“ฑ ์ˆ˜ํ•™ ๋ฒค์น˜๋งˆํฌ๋กœ ํ‰๊ฐ€๋œ ๊ฒƒ์ด ์ผ๋ฐ˜์ 
    • data contamination์— ๋Œ€ํ•œ ๊ณ ๋ ค ๋ถ€์กฑ
    • complexity์— ๋Œ€ํ•œ ์„ธ๋ถ€์„ฑ ๊ณ ๋ ค ๋ถ€์กฑ
    • ์ค‘๊ฐ„ Reasoning trace์— ๋Œ€ํ•œ ๋ถ„์„ ๋ถ€์žฌ

Problem States

๋ณต์žก์„ฑ์ด ์ฆ๊ฐ€ํ•  ๋•Œ LRM์˜ ์ถ”๋ก ์ด ์–ผ๋งˆ๋‚˜ ์ž˜๋˜๊ณ  ์ด๋ฅผ ์–ผ๋งˆ๋‚˜ ์ผ๋ฐ˜ํ™”ํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€?

  • RQ1ย โ€™thinkingโ€™์„ ์ฆ๊ฐ€ํ•˜๋Š” ๊ฒŒ ๋ฌธ์ œ ํ•ด๊ฒฐ ์„ฑ๋Šฅ์— ์ƒ๊ด€๊ด€๊ณ„๊ฐ€ ์žˆ๋Š”๊ฐ€?
  • RQ2ย LRM์€ ์ง„์ •ํ•œ ์ถ”๋ก ์ธ๊ฐ€, ์•„๋‹ˆ๋ฉด ํŒจํ„ด ๋งค์นญ์ธ๊ฐ€?
  • RQ3ย ๋‹ค์–‘ํ•œ(?) ๋ณต์žก์„ฑ์— ๊ฑธ์ณ reasoning traces ๋‚ด๋ถ€์—์„œ๋Š” ๋ญ๊ฐ€ ๋ฐœ์ƒํ•˜๋Š”๊ฐ€ (๋ฌด์Šจ ์ž‘์šฉ์ด ๋ฐœ์ƒ๋˜๋Š”๊ฐ€)?

Suggestions

  • ์ˆ˜ํ•™ ๋ฒค์น˜๋งˆํฌ ์ด์ƒ์œผ๋กœย ํผ์ฆ ๊ธฐ๋ฐ˜ ํ†ต์ œย ์‹คํ—˜ย ์ œ์•ˆ: Tower of Hanoi, Checker Jumping, River Crossing, Blocks World
    • ๋…ผ๋ฆฌ๊ทœ์น™์ด ์žˆ๊ณ 
    • ๋ณต์žก๋„ ์กฐ์ ˆ ๊ฐ€๋Šฅ = scaling ๋ถ„์„ ๊ฐ€๋Šฅ (complexity ์กฐ์ ˆ ๊ฐ€๋Šฅ
    • simulator ๊ธฐ๋ฐ˜ ํ‰๊ฐ€๋กœ ์ •๋‹ต ๋ฐ reasoning trace๊นŒ์ง€ ๊ฒ€์ฆ ๊ฐ€๋Šฅ: ๊ฐ€์กด pass@k ํ‰๊ฐ€ ํ•œ๊ณ„ ๋›ฐ์–ด๋„˜์Œ

Effects

  • Fig 4ย Fig 5ย ํผ์ฆ๋ณ„๋กœ 3๋‹จ๊ณ„ ์„ฑ๋Šฅ ๊ตฌ๊ฐ„ ํ™•์ธ: ์ค‘๊ฐ„ reasoning trace ๋ถ„์„ ๊ฒฐ๊ณผ ์–ด๋””์„œ ์ •๋‹ต์ด ๋‚˜์˜ค๋Š”์ง€ ํŒŒ์•…
    • ๋‚ฎ์€ ๋ณต์žก๋„: vanilla LLM > LRM
      • LRM์€ ์ข…์ข…ย overthinkingํ•ด์„œ ์ดˆ๋ฐ˜์— ๋‹ต์ด ๋‚˜์™€๋„ reasoning์„ ์ง€์†ํ•˜๋Š” ๊ฒฝ์šฐ ๋ฐœ์ƒ
    • ์ค‘๊ฐ„ ๋ณต์žก๋„: LRM์˜ reasoning์ด ๊ธธ์ˆ˜๋ก (CoT ์ถ”๋ก  Path ๊ธธ์ˆ˜๋ก) ์„ฑ๋Šฅ ํ–ฅ์ƒ
    • ๋†’์€ ๋ณต์žก๋„: reasoning collapse ๋ฐœ์ƒ
      • reasoning ํ•˜๋‚˜ ์•ˆํ•˜๋‚˜ ๋ชป๋งž์ถ”๋Š”๊ฑด ๋งˆ์ฐฌ๊ฐ€์ง€ : accuracy 0%
      • LRM์ด ์กฐ๊ธˆ ๋Šฆ๊ฒŒ collapse๋  ๋ฟ
  • Fig 6ย Claude-3.7-Thinking, DeepSeek-R1, o3-mini ๋“ฑ reasoning ๋ชจ๋ธ๋“ค์€ ๋ณต์žกํ• ์ˆ˜๋ก ์ •ํ™•๋„ ํ•˜๋ฝ ์ถ”์„ธ
    • scaling์—์„œ์˜ ์ด์ƒํ˜„์ƒ: complexity๊ฐ€ ๋†’์•„์ง€๋ฉด ์ถ”๋ก  ํ† ํฐ ์ˆ˜๊ฐ€ ๊ฐ์†Œ > ์ถ”๋ก  ํฌ๊ธฐ ํ˜„์ƒ (reasoning collapse)
      • LRM์— ํ† ํฐ ๋ฒ„์ง“์ด ๋” ์žˆ์–ด๋„ ์ƒ๊ฐ์„ ๋ฉˆ์ถค
      • LRM์ด scaling ๋ถˆ๊ฐ€๋Šฅํ•œ (๊ตฌ์กฐ์ ์œผ๋กœ) ํ•œ๊ณ„๊ฐ€ ์žˆ์Œ์„ ์ฃผ์žฅ
  • Fig 7ย Claude-3.7-Thinking์˜ reasoning trace ๋ถ„์„
    • ๋‹จ์ˆœํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•ด overthinking
    • ๋‹ต์ด tarce ์ค‘ํ›„๋ฐ˜์— ๋‚˜ํƒ€๋‚˜๋Š” ๊ฒฝํ–ฅ ํ™•์ธ = self-correction์ด ์ ์ง„์ ์œผ๋กœ ๋ฐ˜์˜
    • ์–ด๋ ค์šฐ๋ฉด ์–ด์จŒ๋“  ์‹คํŒจ
  • Fig 8ย gold ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ค˜๋„ ์ œ๋Œ€๋กœ LRM ์•ˆ์ •์„ฑ ๋‚ฎ์Œ (=๋ชป๋งž์ถค)
    • ๊ทผ๋ณธ์ ์œผ๋กœ ๊ธฐํ˜ธ ์กฐ์ž‘์ด๋‚˜ ์ผ๊ด€์„ฑ ์ธก๋ฉด์˜ ์‹คํŒจ๋กœ ํŒŒ์•…๋จ

Personal note. LLM๋“ค์ด ์ง„์งœ reasoning์€ ๋ชปํ•œ๋‹ค.. ๋Š” ๊ฒŒ ์‹ฌํ”Œํ•œ, ๋˜ ์˜ˆ์ƒ ๊ฐ€๋Šฅํ•œ ๊ฒฐ๋ก ์ธ๋ฐ, ๋Œ€๋ถ€๋ถ„์˜ ์—ฐ๊ตฌ์ž๋“ค์ด LLM์ด ์ง„์งœ thinkingํ•˜์ง€ ๋ชปํ•˜๋Š” ๊ฒƒ ๊ฐ™๋‹ค๋Š” ์‚ฌ์‹ค์— ๊ธ์ •ํ•˜๋ฉด์„œ๋„ ๊ทธ์— ์•ž์„œ์„œ ์•„์ง๋„ thinking์ด๋ž€ ๋ฌด์—‡์ด๊ณ , ๊ทธ๋ž˜์„œ ์ง„์งœ reasoning์€ ๋ฌด์—‡์ธ์ง€์— ๋Œ€ํ•œ ๋ณด๋‹ค ์ง„์ง€ํ•˜๊ฒŒ ์ •์˜ํ•  ์ˆ˜ ์—†๋‹ค๋Š” ์ธก๋ฉด์—์„œ (์ •์˜ํ• ์ˆ˜ ์žˆ๊ธฐ๋Š” ํ•œ๊ฑด์ง€..?) ์•„์ง๋„ ๋งŽ์€ ๊ณ ๋ฏผ์ด ํ•„์š”ํ•˜๋‹ค๋Š” ์ƒ๊ฐ์ด ๋“ญ๋‹ˆ๋‹ค.

๊ด€๋ จํ•ด์„œ ์ด ์—ฐ๊ตฌ์˜ setup ๊ด€๋ จํ•ด์„œ ๊ณ ๋ฏผํ•ด๋ณผ๋งŒํ•œ ์ ์€, ์ด๋“ค์ด ์ •์˜ํ•œ ๋ณต์žก๋„๊ฐ€ ์—ฌ์ „ํžˆ ๊ณ„์‚ฐ ๋Šฅ๋ ฅ(?)์— ๊ฐ€๊น๋‹ค๊ณ  ๋ณด์ž…๋‹ˆ๋‹ค. ์ฆ‰ LLM์ด ๋ง์…ˆ ๋ชปํ•˜๊ณ  ๊ณฑ์…ˆ ๋ชปํ•˜๋Š” ๊ฒƒ๊ณผ ๋งˆ์ฐฌ๊ฐ€์ง€.. ๋‹ค์‹œ ๋งํ•˜๋ฉด ๊ณ„์‚ฐ์„ฑ ์ถ”๋ก  ์ž˜ํ•œ๋‹ค๊ณ  LLM์ด ๋˜‘๋˜‘ํ•œ๊ฑด์ง€? ๋Š” ๋˜ ๋‹ค๋ฅธ ๋ฌธ์ œ ์•„๋‹ˆ๋ƒ๋Š” ์ƒ๊ฐ..

์•„๋ฌดํŠผ antropic ๋“ฑ์—์„œ ์ฃผ๋ชฉํ•˜๊ณ  ์žˆ๋Š” interpretability ์—ฐ๊ตฌ๋„ ๊ณ„์† ํŒ”๋กœ์—…ํ•ด๋ด„์ง ํ•œ ๊ฒƒ ๊ฐ™๊ณ , ์•„์šธ๋Ÿฌ ๋ฐ์ดํ„ฐ์…‹์„ ๊ณต๊ฐœํ•˜๊ฑฐ๋‚˜ ํ•œ ๊ฑด ์•„๋‹Œ๋ฐ, ์—„๋ฐ€ํ•˜๊ฒŒ ํ™•์ธํ•ด๋ณด์ง„ ๋ชปํ–ˆ์ง€๋งŒ, ๋’ค์— appendix ์ฐธ๊ณ ํ•ด์„œ ๋”ฐ๋ผํ•˜๋ฉด ๋ฐ์ดํ„ฐ์…‹์„ ๊ตฌ์ถ•ํ•ด๋ณผ ์ˆ˜ ์žˆ๋Š” ์—ฌ์ง€๋Š” ์žˆ๋Š” ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค.