Yejin Yoon

Yejin Yoon

Ph.D Candidate at Hanyang University.

Posts 2024

← All years 2023 →

Dec 2024 7 posts

Alignment Faking in Large Language Models

alignment-learning

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

self-improvement reasoning

The FACTS Grounding Leaderboard: Benchmarking LLMs’ Ability to Ground Responses to Long-Form Input

factuality knowledge-conflicts

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

knowledge-editing unlearning

LLM Evaluators Recognize and Favor Their Own Generations

Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability

Reverse Thinking Makes LLMs Stronger Reasoners

Nov 2024 5 posts

Counterfactual Generation from Language Models

knowledge-editing language-modeling

Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation

code petl self-improvement

Questioning the Survey Responses of Large Language Models

CRAB: Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Detecting Training Data of Large Language Models via Expectation Maximization

Oct 2024 7 posts

Direct Multi-Turn Preference Optimization for Language Agents

alignment-learning dialogue-system optimization

Inference Scaling for Long-Context Retrieval Augmented Generation

long-context rag

Real-time Fake News from Adversarial Feedback

factuality knowledge-conflicts time-sensitive

LC-LLM RAG: Long-Context LLMs Meet RAG

language-modeling long-context rag

MoEE: Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

language-modeling moe representation-learning

Differential Transformer

attention language-modeling long-context +2

Selective Attention Improves Transformer

attention language-modeling petl +1

Sep 2024 8 posts

Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement

data-selection sft self-improvement

Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding

dst dialogue-system

Knowing When to Ask - Bridging Large Language Models and Data

knowledge-graph rag tableqa +1

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

language-modeling activation

Configurable Foundation Models: Building LLMs from a Modular Perspective

ensemble petl rag +3

Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models

knowledge-conflicts rag hallucination

Safety Layers of Aligned Large Language Models: The Key to LLM Security

safety interpretability

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Aug 2024 12 posts

Planning Like Human: A Dual-process Framework for Dialogue Planning

alignment-learning dialogue-system optimization +1

To Code, or Not To Code? Exploring Impact of Code in Pre-training

The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

interpretability

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

language-modeling hallucination

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

knowledge-conflicts language-modeling multi-linguality +1

Adaptive Retrieval-Augmented Generation for Conversational Systems

dialogue-system icl peft +2

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

language-modeling rag

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

domain-adaptation evaluation rag

Word Translation Without Parallel Data

multi-linguality representation-learning translate

Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost

Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation

adaptor dialogue-system petl +1

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

sae interpretability

Jul 2024 6 posts

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

sae activation interpretability

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

dialogue-system industry language-modeling +1

Enhancing HNSW Index for Real-Time Updates: Addressing Unreachable Points and Performance Degradation

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

benchmark evaluation long-context

RouteLLM: Learning to Route LLMs with Preference Data

ensemble industry

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

long-context rag

Jun 2024 5 posts

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

language-modeling

From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

prompting self-improvement

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

transformers interpretability

May 2024 2 posts

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

language-modeling

Better & Faster Large Language Models via Multi-token Prediction

language-modeling

Apr 2024 5 posts

Retrieval Head Mechanistically Explains Long-Context Factuality

Chinchilla Scaling: A replication attempt

language-modeling

Scaling Laws for Reward Model Overoptimization

alignment-learning reinforcement-learning

Label Supervised LLaMA Finetuning

ReALM: Reference Resolution As Language Modeling

dialogue-system multi-modality

Mar 2024 6 posts

Social Learning: Towards Collaborative Learning with Large Language Models

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

prompt-compression

RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems

Is Cosine-Similarity of Embeddings Really About Similarity?

representation-learning

Do Large Language Model Understand Multi-Intent Spoken Language ?

benchmark domain-adaptation icl +2

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Feb 2024 12 posts

Benchmarking Large Language Models in Retrieval-Augmented Generation

evaluation language-modeling rag

Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance

language-modeling prompting

Generative Representational Instruction Tuning

language-modeling peft rag +1

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Chain-of-Thought Reasoning Without Prompting

language-modeling

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

code evaluation

Specialized Language Models with Cheap Inference from Limited Domain Data

domain-adaptation language-modeling peft +2

The boundary of neural network trainability is fractal

Orion-14B: Open-source Multilingual Large Language Models

icl language-modeling multi-linguality

The Power of Noise: Redefining Retrieval for RAG Systems

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

ai-detection language-modeling

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

benchmark evaluation rag

Jan 2024 18 posts

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Corrective Retrieval Augmented Generation

Knowledge Fusion of Large Language Models

ensemble fusion weight-merging

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

factuality knowledge-editing language-modeling

DocLLM: A layout-aware generative language model for multimodal document understanding

domain-adaptation factuality industry +1

Self-Rewarding Language Models

dpo reinforcement-learning self-improvement

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

ChatQA: Building GPT-4 Level Conversational QA Models

dialogue-system

Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers

benchmark factuality language-modeling +1

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

dialogue-system evaluation language-modeling +1

Blending is All You Need

ensemble language-modeling

LLaMA Pro: Progressive LLaMA with Block Expansion

domain-adaptation language-modeling

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

reinforcement-learning self-improvement self-learning

Improving Text Embeddings with Large Language Models

representation-learning

Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models

Making Large Language Models A Better Foundation For Dense Retrieval