2 minute read

Meta info.
  • Authors: Pengcheng Jiang, Jiacheng Lin, Zhiyi Shi, Zifeng Wang, Luxi He, Yichen Wu, Ming Zhong, Peiyang Song, Qizheng Zhang, Heng Wang, Xueqiang Xu, Hanwen Xu, Pengrui Han, Dylan Zhang, Jiashuo Sun, Chaoqi Yang, Kun Qian, Tian Wang, Changran Hu, Manling Li, Quanzheng Li, Hao Peng, Sheng Wang, Jingbo Shang, Chao Zhang, Jiaxuan You, Liyuan Liu, Pan Lu, Yu Zhang, Heng Ji, Yejin Choi, Dawn Song, Jimeng Sun, Jiawei Han
  • Paper: https://arxiv.org/pdf/2512.16301
  • Affiliation: Caltech, Georgia Tech, Harvard Univ., Northwestern Univ., Princeton, Stanford Univ., TAMU, UC Berkeley, UIUC, UW, Unity, University of California San Diego
  • Published: December 18, 2025

TL; DR

agentic AI ์—ฐ๊ตฌ์—์„œ adaptation์ด๋ผ๋Š” ๊ฐœ๋…์ด ํ˜ผ์šฉ๋˜์–ด์™”๊ณ , ์ฒด๊ณ„์ ์ธ ์‹œ์Šคํ…œ ์ˆ˜์ค€ ์„ค๊ณ„ ๋ฐ ๋น„๊ต๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด adaptation ๋Œ€์ƒ(agent vs tool)๊ณผ adaptation์„ ์œ ๋„ํ•˜๋Š” ์‹ ํ˜ธ๋ฅผ ๊ตฌ๋ถ„ํ•˜๋Š” ๋ถ„๋ฅ˜ ์ฒด๊ณ„ ์ œ์•ˆ

image.png

image.png

image.png

image.png

image.png

image.png

image.png

image.png

image.png

Background

Agentic AI์—์„œ adaptation์ด๋ผ๋Š” ํ‘œํ˜„์˜ ๋‚จ๋ฐœ

  • LLM fine-tuning, prompt update, memory ์ˆ˜์ •, retriever ๋ณ€๊ฒฝ, search ์ „๋žต ๊ฐœ์„ , sub-agent ํ•™์Šต, โ€ฆ
  • adaptation์˜ ๋Œ€์ƒ๋„ ๋‹ค๋ฅด๊ณ  signal๋„ ๋‹ค๋ฅด๊ณ , system ๋ฆฌ์Šคํฌ, ๋น„์šฉ ๋ชจ๋‘ ๋‹ค๋ฆ„ โ†’ ๋„ˆ๋ฌด ๋‹ค๋ฅธ ๊ฒƒ๋“ค์„ ๊ฐ™์€ ํ‘œํ˜„ ์•„๋ž˜์„œ ๋น„๊ตํ•˜๊ณ  ์žˆ์Œ

Problem States

Adaptation์€ (1) ๋ฌด์—‡(agent vs. tool)์„ ๋ฐ”๊พธ๋Š”๊ฐ€ (2) ์–ด๋–ค ์‹ ํ˜ธ(execution vs. output)๋กœ ๋ฐ”๊พธ๋Š”๊ฐ€์— ์˜ํ•ด ๊ตฌ๋ถ„ํ•ด์•ผ ํ•œ๋‹ค.

Suggestions

2by2 quadrant; ์–ด๋–ค ์—ฐ๊ตฌ๊ฐ€ ์–ด๋–ค adaption์„ ํƒ€๊นƒํ•˜๋Š”๊ฐ€?

  • ์ ์‘๋Œ€์ƒ : Agent / Tool
  • ์‹ ํ˜ธ:
    • A1 (agent์— ๋Œ€ํ•ด) Tool execution : ํ˜„์‹ค ํ”ผ๋“œ๋ฐฑ ๊ธฐ๋ฐ˜ agent ํ•™์Šต
    • A2 (agent์— ๋Œ€ํ•ด) Agent output : ์ž๊ธฐ ์ถœ๋ ฅ ๊ธฐ๋ฐ˜ agent ํ•™์Šต
    • T1 (tool์— ๋Œ€ํ•ด) Agent-agonistic : agent์™€ ๋ฌด๊ด€ํ•œ ๋„๊ตฌ ๊ฐœ์„ 
    • T2 (tool์— ๋Œ€ํ•ด) Agent supervised : agent ํ–‰๋™์„ supervision ์‹ ํ˜ธ๋กœ ์“ฐ๋Š” ๋„๊ตฌ adaptation

Effects

  • ์ด ํ”„๋ ˆ์ž„์„ ์ ์šฉํ•˜๋ฉด ๊ธฐ์กด ์—ฐ๊ตฌ์˜ ๋น„๊ต๊ฐ€ ๋ฌด์˜๋ฏธํ•ด์ง
    • e.g.
      • ์ด agent๋Š” tool-use๊ฐ€ ๋›ฐ์–ด๋‚˜๋‹ค โ†’ agent adaption์ธ๊ฐ€? tool adaptation์ธ๊ฐ€?
      • memory๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋‹ˆ ์„ฑ๋Šฅ์ด ์˜ค๋ฅธ๋‹ค โ†’ agent๊ฐ€ ๋ฐ”๋€๊ฑด๊ฐ€, tool(memory module)์ด ๋ฐ”๋€๊ฑด๊ฐ€?
      • execution feedback์œผ๋กœ ํ•™์Šตํ–ˆ๋‹ค โ†’ agent๋ฅผ? retriever(search module)๋ฅผ?
  • ์—ฐ๊ตฌ ๋ฐฉํ–ฅ ์ œ์•ˆ
    • Hybrid / co-adaptation: agent 1๊ฐœ๊ฐ€ ๋ชจ๋“  adaptation์„ ๋ถ€๋‹ดํ•˜๊ฑฐ๋‚˜ (๋น„์šฉ, ๋ถˆ์•ˆ์ •์„ฑ, โ€ฆ) tool-only adaptation(๋‚ฎ์€ ํ‘œํ˜„๋ ฅ)๋ณด๋‹ค๋Š” agent์™€ tool์˜ ์—ญํ•  ๋ถ€๋‹ด์„ ์ฃผ์š” ์„ค๊ณ„๋ณ€์ˆ˜๋กœ ์„ค์ •ํ—ค์•ผ ํ•œ๋‹ค.
    • T2 (Agent-supervised Tool Adaptation) : memory update, retriever tuning, search sub-agent, planner refinement์€ ์ƒ๋Œ€์ ์œผ๋กœ ์ €๋ ดํ•˜๊ณ  online/continual์— ์ ํ•ฉํ•˜๋ฉด์„œ ์ƒ๋Œ€์ ์œผ๋กœ Safety ํ†ต์ œ๋„ ์‰ฌ์›€ โ†’ ์‹ค์ œ ์‹œ์Šคํ…œ์€ T2๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์ปค์งˆ ๊ฒƒ

Personal note. ์„œ๋ฒ ์ดํŽ˜์ดํผ๋ผ ๋‚ด์šฉ์„ ์žฌ๊ตฌ์„ฑํ•˜๋Š” ๊ฒƒ์€ ํฐ ์˜๋ฏธ๋Š” ์—†์„ ๊ฒƒ ๊ฐ™์•„์„œ, ์ œ๊ฐ€ agent/tool-use ๊ด€๋ จ ์—ฐ๊ตฌ ์ง„ํ–‰ํ•˜๋ฉด์„œ ์ƒ๊ฐํ•ด๋ณผ๋งŒํ–ˆ๋˜ ์ง€์  ์งง๊ฒŒ ์ •๋ฆฌํ•ด๋ด…๋‹ˆ๋‹ค. ์ €์ž๋“ค์€ ์‹ค์งˆ์ ์œผ๋กœ T2,๊ทธ๋Ÿฌ๋‹ˆ๊นŒ agent๋Š” ๊ณ ์ •ํ•˜๊ณ  agent์˜ output์œผ๋กœ tool์„ adaptationํ•ด์•ผํ•œ๋‹ค๋Š” ์ž…์žฅ์„ ๋ฐ€์–ด์ฃผ๊ณ  ์žˆ๋‹ค๊ณ  ๋А๊ผˆ๊ณ , ์ € ์—ญ์‹œ ์ €์ž๋“ค๊ณผ ๊ฐ™์€ ์ƒ๊ฐ์œผ๋กœ memory๋ฅผ ์—ฐ๊ตฌํ•˜๊ณ  ์žˆ๊ธฐ๋Š” ํ•˜์ง€๋งŒ, memory๋ฅผ tool๋กœ ๋ฌถ์–ด๋„ ๋ ์ง€(T2) ๋Š” ์กฐ๊ธˆ ๊ณ ๋ฏผํ•ด๋ณผ ์—ฌ์ง€๊ฐ€ ์žˆ๋Š” ๊ฒƒ ๊ฐ™์•„์š”.

  • ์‹œ์Šคํ…œ์—์„œ adaptation์˜ ์ฑ…์ž„์€ ์–ด๋””์— ์žˆ๋Š”๊ฐ€? agent? tool? memory?
  • adaptation์ด online์ธ๊ฐ€, offline์ธ๊ฐ€? ๋น„์šฉ์ด๋‚˜ ์•ˆ์ •์„ฑ์€ ๊ฐ๋‹นํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€?
  • rollback์˜ ๊ฐ€๋Šฅ์„ฑ์ด ์žˆ๋Š”๊ฐ€?
    • agent adaptation์€ ์ƒ๋Œ€์ ์œผ๋กœ ์–ด๋ ต๊ณ , tool adaptation์€ ์‰ฌ์šธ ๋“ฏ

      Comment. ๋ญ”๊ฐ€ ์—ฌ๋Ÿฌ๊ฐ€์ง€๊ฐ€ ๋‚จ๋ฐœ๋˜๊ณ  ์ œ๋Œ€๋กœ ์ •์˜๋„ ๋˜์ง€ ์•Š์•„ ๊ด€๋ จ ์—ฐ๊ตฌ์˜ ์ •๋ฆฌ๊ฐ€ ์–ด๋ ค์› ๋˜ ๋ถ€๋ถ„๋“ค์ด ์ด๋Ÿฌํ•œ ๋ฅ˜์˜ ํŽ˜์ดํผ๋“ค์ด ๋‚˜์˜ค๋ฉด์„œ ์–ด๋А์ •๋„ ์ •๋ฆฌ๊ฐ€ ๋˜์–ด๊ฐ€๋Š” ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. ์ •๋ฆฌ๊ฐ€ ๋˜์–ด๊ฐ„๋‹ค๋Š” ๊ฒƒ์€ ํ•„๋“œ๊ฐ€ ๊ณ ์ธ๋ฌผ์ด ๋˜์–ด๊ฐ„๋‹ค๋Š” ์˜๋ฏธ์ด๊ธฐ๋„ ํ•ด์„œ ๋” ๊ณ ์ฐฉํ™”๋˜๊ธฐ ์ „์— ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ๋ฅผ ํ•ด๋ณด๋Š”๊ฒŒ ์ค‘์š”ํ•  ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค.