Dongyoung Kim
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
Dongyoung Kim, Kimin Lee ,Jinwoo Shin, Jaehyung Kim
International Conference on Learning Representations (ICLR) 2025, Oral Presentation (207/11672=1.77%)
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo
Conference on Neural Information Processing Systems (NeurIPS), 2023
Debiasing Online Preference Learning via Preference Feature Preservation
Dongyoung Kim, Jinsung Yoon,Jinwoo Shin, Jaehyung Kim
Annual Meeting of the Association for Computational Linguistics (ACL) 2025 (Findings)
Learning to Correct for QA Reasoning with Black-box LLMs
Jaehyung Kim, Dongyoung Kim, Yiming Yang
Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 (Long, Main)
Training-free LLM Verification via Recycling Few-shot Examples
Dongseok Lee, Jimyung Hong, Dongyoung Kim, Jaehyung Kim
International Conference on Machine Learning (ICML) 2025 Workshop ES-FoMo-III
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
Dongyoung Kim, Kimin Lee ,Jinwoo Shin, Jaehyung Kim
International Conference on Learning Representations (ICLR) 2025, Oral Presentation (207/11672=1.77%)
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo
Conference on Neural Information Processing Systems (NeurIPS), 2023
Debiasing Online Preference Learning via Preference Feature Preservation
Dongyoung Kim, Jinsung Yoon,Jinwoo Shin, Jaehyung Kim
Annual Meeting of the Association for Computational Linguistics (ACL) 2025 (Findings)
Learning to Correct for QA Reasoning with Black-box LLMs
Jaehyung Kim, Dongyoung Kim, Yiming Yang
Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 (Long, Main)
Training-free LLM Verification via Recycling Few-shot Examples
Dongseok Lee, Jimyung Hong, Dongyoung Kim, Jaehyung Kim
International Conference on Machine Learning (ICML) 2025 Workshop ES-FoMo-III