Self-Discover: Large Language Models Self-Compose Reasoning Structures

March 5, 2024 less than 1 minute read

Meta info.

Authors: Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng
Paper: https://arxiv.org/pdf/2402.03620.pdf
Affiliation: Google DeepMind, USC
Published: February 6, 2024

TL; DR

델이 여러 reasoning techniques(CoT, critical thinking, ...) 중에서 하나를 스스로 선택하여 task별로 적합한 추론 전략을 구성하도록 하는 프레임워크 제안. BBH에서 단순 CoT보다 성능이 좋고 CoT Self-consistency보다도 추론 연산이 10~40x 덜 든다고. sLLM에서 더 잘된다고 언급.

Untitled

Suggestions

stage1: task level에서 추론 구조 선택. 전체 key-value format 생성
- 3가지 meta-prompt
  - select: proper framework (CoT, critical thinking, …)
  - adapt: rephrase for specific task
  - implement: actionable to fill the values
stage2: value-filling 포맷으로 instance level solving

Personal note. 사전학습이나 label을 주지 않는 점이 orca 학습방식과 차이인 듯 합니다.(Orca는 작은 모델 instruction tuning할 때 어떤 instruction을 줄 지 모델이 선택하게 하는 거였고, self-discover는 비슷한 방식을 icl에서 접근.)