less than 1 minute read

Meta info.
  • Authors: Shangjian Yin, Peijie Huang, Yuhong Xu, Haojing Huang, Jiatian Che
  • Paper: https://arxiv.org/pdf/2403.04481
  • Affiliation: SCAU, Tsinghua Univ.
  • Published: March 7, 2024

TL; DR

SLU(Spoken Language Understanding)์— ๋Œ€ํ•œ LLM ํ™œ์šฉ ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ LM-MixATIS, LM-MixSNIPS ๋ฒค์น˜๋งˆํฌ ๋ฐ metric ์ œ์•ˆ

Untitled

Untitled

Problem States

  1. LLM์ด multi-intent SLU์— ๋Œ€ํ•ด ๊ธฐ์กด์˜ SOTA๋ชจ๋ธ๊ณผ ๋น„๊ตํ•œ ์„ฑ๋Šฅ์€ ์–ด๋– ํ•œ๊ฐ€?
    • multi-intent SLU: multi-intent detection + (entity) slot-filling
  2. LLM ๊ทœ๋ชจ๊ฐ€ ์„ฑ๋Šฅ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š”๊ฐ€?
  3. LLM์„ ํ™œ์šฉํ•œ SLU์— ์ ํ•ฉํ•œ metric์€?

Effects

  • Experiment
    • model: QLoRA-tuned LLM ์‚ฌ์šฉ, task-specific fine-tuning ํ•„์š”ํ•˜๋‹ค๊ณ  ์–ธ๊ธ‰
    • datasets: LM-MixX(sub-intent instruction ํฌํ•จ) ์ ์šฉ
      • sub-intent instructions: ๋ฐœํ™”์—์„œ ๊ฐ intent๋ณ„๋กœ ํ•ด๋‹นํ•˜๋Š” entity ์ถ”์ถœ >slot-filling ์œผ๋กœ ์—ฐ๊ฒฐ
    • prompt: pic1 ์ฐธ๊ณ 
    • metric: Accuracy for ID, F1 for SF, Overall Accuracy + ESA, CSA ์ œ์•ˆ
      • ESA: entity๋ฅผ ์–ผ๋งˆ๋‚˜ ๋งž์ท„๋Š”์ง€
      • CSA: Accuracy for ID * ESA
  • Results
    • LM-MixX ๋ฐ์ดํ„ฐ์…‹๊ณผ ์ œ์•ˆ๋œ ํ”„๋กฌํ”„ํŠธ๋ผ๋ฉด LLM ์„ฑ๋Šฅ ์ถฉ๋ถ„ํžˆ supervised-SOTA ์ด์ƒ์œผ๋กœ ์ข‹๋‹ค
    • ๋‹ค๋งŒ ๋ชจ๋ธ ํฌ๊ธฐ๊ฐ€ ํฌ๋‹ค๊ณ  ๋ฐ˜๋“œ์‹œ ์„ฑ๋Šฅ์ด ์ข‹์€๊ฒƒ์€ ์•„๋‹ˆ๋‹ค (MixATIS)
    • ESA, CSA๋Š” ๋” ์—„๊ฒฉํ•˜๊ฒŒ SF๋ฅผ ํ‰๊ฐ€ํ•˜๋ฏ€๋กœ ์ „๋ฐ˜์ ์ธ SLU์— ๋” ๊ฐœ์„ ๋œ metric์ด๋‹ค