Dongyoung Kim

Dongyoung Kim

Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment

Dongyoung Kim, Kimin Lee ,Jinwoo Shin, Jaehyung Kim

International Conference on Learning Representations (ICLR) 2025, Oral Presentation (207/11672=1.77%)

Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration

Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo

Conference on Neural Information Processing Systems (NeurIPS), 2023

Debiasing Online Preference Learning via Preference Feature Preservation

Dongyoung Kim, Jinsung Yoon,Jinwoo Shin, Jaehyung Kim

Annual Meeting of the Association for Computational Linguistics (ACL) 2025 (Findings)

Learning to Correct for QA Reasoning with Black-box LLMs

Jaehyung Kim, Dongyoung Kim, Yiming Yang

Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 (Long, Main)

Training-free LLM Verification via Recycling Few-shot Examples

Dongseok Lee, Jimyung Hong, Dongyoung Kim, Jaehyung Kim

International Conference on Machine Learning (ICML) 2025 Workshop ES-FoMo-III

Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment

Dongyoung Kim, Kimin Lee ,Jinwoo Shin, Jaehyung Kim

International Conference on Learning Representations (ICLR) 2025, Oral Presentation (207/11672=1.77%)

Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration

Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo

Conference on Neural Information Processing Systems (NeurIPS), 2023

Debiasing Online Preference Learning via Preference Feature Preservation

Dongyoung Kim, Jinsung Yoon,Jinwoo Shin, Jaehyung Kim

Annual Meeting of the Association for Computational Linguistics (ACL) 2025 (Findings)

Learning to Correct for QA Reasoning with Black-box LLMs

Jaehyung Kim, Dongyoung Kim, Yiming Yang

Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 (Long, Main)

Training-free LLM Verification via Recycling Few-shot Examples

Dongseok Lee, Jimyung Hong, Dongyoung Kim, Jaehyung Kim

International Conference on Machine Learning (ICML) 2025 Workshop ES-FoMo-III