Yejin Yoon

Yejin Yoon

Ph.D Candidate at Hanyang University.

Posts 2025

← All years 2024 →

Dec 2025 5 posts

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

alignment-learning personalization

Adaptation of Agentic AI

agent function-calling memory

Budget-Aware Tool-Use Enables Effective Agent Scaling

agent function-calling

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

function-calling

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory

self-improvement memory

Nov 2025 3 posts

General Agentic Memory via Deep Research

memory multi-agent

Flipping the Dialogue: Training and Evaluating User Language Models

dialogue-system multi-turn

HaluMem: Evaluating Hallucinations in Memory Systems of Agents

Oct 2025 3 posts

Reasoning with Sampling: Your Base Model is Smarter Than You Think

reinforcement-learning post-training reasoning

LightMem: Lightweight and Efficient Memory-Augmented Generation

dialogue-system long-context rag +1

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

function-calling

Sep 2025 5 posts

DiaTool-DPO: Direct Preference Optimization for Controlling Conversation Flow in Tool-Augmented LLMs

dialogue-system function-calling

Facilitating Multi-Turn Function Calling for LLMs via Compositional Instruction Tuning

dialogue-system function-calling

Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity

dialogue-system

MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs

benchmark evaluation long-context +1

GraphRAG-R1: Graph Retrieval-Augmented Generation with Process-Constrained Reinforcement Learning

odqa ppo rag +1

Aug 2025 4 posts

SSRL: Self-Search Reinforcement Learning

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

ensemble llm-as-a-judge reinforcement-learning

TO CHAT OR TASK: a Multi-turn Dialogue Generation Framework for Task-Oriented Dialogue Systems

benchmark dialogue-system

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Jul 2025 4 posts

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

llm-as-a-judge self-improvement

Exploring Persona Sentiment Sensitivity in Personalized Dialogue Generation

dialogue-system

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

reinforcement-learning

MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents

benchmark memory

Jun 2025 5 posts

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

diffusion long-context

Dynamic Epistemic Friction in Dialogue

dialogue-system

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

lrm self-improvement reasoning

CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions

dialogue-system

May 2025 2 posts

ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents

agent prompting reasoning

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

long-context rag prompt-compression

Apr 2025 4 posts

TTRL: Test-Time Reinforcement Learning

reinforcement-learning test-time-scaling

Reasoning Models Can Be Effective Without Thinking

Concise Reasoning via Reinforcement Learning

reinforcement-learning reasoning

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

agent self-improvement multi-agent

Mar 2025 8 posts

Reasoning to Learn from Latent Thoughts

Scaling Laws of Synthetic Data for Language Models

llm-as-a-judge scaling-laws self-improvement +1

A-MEM: Agentic Memory for LLM Agents

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

benchmark multi-modality reasoning

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning

pbrl reinforcement-learning

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

multi-linguality

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

benchmark multi-agent

Chain of Draft: Thinking Faster by Writing Less

Feb 2025 4 posts

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

LIMO - Less is More for Reasoning

The Differences Between Direct Alignment Algorithms are a Blur

alignment-learning

Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge

llm-as-a-judge self-improvement

Jan 2025 5 posts

The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

self-improvement

The GAN is dead; long live the GAN! A Modern GAN Baseline

Slow Perception: Let's Perceive Geometric Figures Step-by-step

lvlm multi-modality