less than 1 minute read

Meta info.

TL; DR

๋ณ„๋„ ํ•™์Šต์ด๋‚˜ ํŠœ๋‹ ์—†์ด ํ•œ ์Œ์˜ pretrained LLM์œผ๋กœ ๊ฐ„๋‹จํžˆ ๊ณ„์‚ฐ๋งŒ ํ•˜๋ฉด machine generated text๋ฅผ ํƒ์ง€ํ•ด๋‚ด๋Š” ๋ฐฉ๋ฒ•๋ก  Binoculars ์ œ์•ˆ. ์ƒ์„ฑ๋œ sample 90% ์ด์ƒ ํƒ์ง€(pic1)

Untitled

Untitled 1

Untitled 2

Untitled 3

Suggestions

  • cross-perplexity: ๊ฐ„๋‹จํ•˜๊ฒŒ M1์˜ probability distribution์— M2์˜ log PPL(pic2)์„ element-wise products. ์ฆ‰, M1์˜ ์˜ˆ์ธก์ด M2์— ์˜ํ•ด ์–ด๋–ป๊ฒŒ ํŒ๋‹จ๋˜๋Š”์ง€(how surprising) weighting ํ•˜๋Š” ๋ฐฉ์‹. (pic3)
    • e.g. M1์ด ์–ด๋А token์„ ๋†’์€ ํ™•๋ฅ ๋กœ ์˜ˆ์ธกํ–ˆ์ง€๋งŒ, M2๋Š” ๋‚ฎ์€ PPL์„ ์ฃผ๋ฉด(log PPL ์€ ์ปค์ง€๋Š”), ๊ฒฐ๋ก ์ ์œผ๋กœ cross-perplexity ๊ฐ’์€ ๋†’์•„์ง€๊ณ , ์ด๋Š” ๊ณง M2 ์ž…์žฅ์—์„œ๋Š” M1์˜ ์˜ˆ์ธก์ด โ€œsurprisingโ€
  • Binoculars score (B): perplexity๋ฅผ crosss-perplexity nomalizationํ•œ ๋ฒ„์ „. (pic4)