less than 1 minute read

Meta info.

TL; DR

LLM์˜ decoding์„ greedy decoding์—์„œ top-k decoding์œผ๋กœ ๋ฐ”๊พธ๋ฉด prompt ์—†์ด๋„ CoT reasoning ์œ ๋„ ๊ฐ€๋Šฅ

Untitled

Untitled

Untitled

Untitled

Suggestions

  • ๋ช…์‹œ์ ์€ prompting ์—†์ด ๋””์ฝ”๋”ฉ๋งŒ ์ข€ ์กฐ์ž‘ํ•ด์ฃผ๋ฉด CoT ๋น„์Šทํ•˜๊ฒŒ ํ•  ์ˆ˜ ์žˆ๋‹ค. (๋ฌผ๋ก  ๋””์ฝ”๋”ฉ์— ์ถ”๊ฐ€ ๊ณ„์‚ฐ ๋น„์šฉ ์žˆ์Œ!)
  • Greedy Decoding ๋Œ€์‹ ์— top-k์˜ ํ›„์ˆœ์œ„ ํ† ํฐ๋“ค์„ ์กฐ์‚ฌํ–ˆ์„ ๋•Œ ์ด ์‹œํ€€์Šค์— CoT๋ฅผ ์ง„ํ–‰ํ•˜๋Š” ๊ฒฝ๋กœ๊ฐ€ ๋…น์•„์ง„ ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์•˜๋‹ค๊ณ .
  • ์ด๋Ÿฐ path๋ฅผ ํƒˆ ๊ฒฝ์šฐ (๋…ผ๋ฌธ์—์„œ๋Š”ย CoT-decoding์ด๋ผ๊ณ  ๋ช…๋ช…) ๋ชจ๋ธ ์‹ ๋ขฐ๋„๋„ ๋†’์•„์ง€๋Š” ๊ฒฝํ–ฅ.

Personal note. LLM(์—ฌ๊ธฐ์„œ๋Š” PaLM-2, Mistral-7B)์—๊ฒŒ ์งˆ๋ฌธ์„ ๋„ฃ์€ ๋‹ค์Œ ๊ฐ€์žฅ ๋จผ์ € ๋‚˜์˜ฌ ์ˆ˜ ์žˆ๋Š” ์ƒ์œ„ k๊ฐœ ํ† ํฐ๋“ค๋กœ๋ถ€ํ„ฐ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ–ˆ์„ ๋•Œ, ๊ฐ€์žฅ ํ™•๋ฅ ์ด ๋†’์€ ํ† ํฐ์ด ์•„๋‹Œ ๋‹ค๋ฅธ ํ† ํฐ์œผ๋กœ๋ถ€ํ„ฐ CoT์Šค๋Ÿฌ์šด & ํ›จ์”ฌ confidentํ•œ ๋‹ต๋ณ€์ด ๋‚˜์˜ฌ ์ˆ˜ ์žˆ๋‹ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค.

๋‹ค๋งŒ ๋ชจ๋ธ์ด ์–ด๋А ์ •๋„ ์ปค์•ผ(PaLM-2 Large) ์ข€ ๋ณผ๋งŒํ•œ ๋‹ต๋ณ€์„ ์ถœ๋ ฅํ•˜๊ณ , few-shot์ด๋‚˜ instruction finetuning์„ ๊ฑฐ์นœ ๋ชจ๋ธ๋ณด๋‹ค ์„ฑ๋Šฅ์ด ๋’ค์ฒ˜์ง€๋ฉฐ, ๋‹ค๋ฅธ ๋‹ต๋ณ€ ํƒ์ƒ‰์— ์ถ”๊ฐ€์ ์ธ cost๊ฐ€ ๋“œ๋Š” ๋“ฑ ์—ฌ๋Ÿฌ ๋‹จ์ ์ด ์žˆ์–ด ์•„์ง๊นŒ์ง„ ๊ฐ€๋Šฅ์„ฑ๋งŒ ๋ณด์—ฌ์ค€ ๋…ผ๋ฌธ์œผ๋กœ ๋ณด์ž…๋‹ˆ๋‹ค.