My name is

Jiwoo Hong

I am a graduate student at KAIST AI, specializing in AI and NLP under the supervision of Professor James Thorne. I graduated with the highest honors (Summa Cum Laude) with a Bachelor’s degree in Statistics and Industrial Engineering from SungKyunKwan University. My research interests focus on generalizability in post-training, including RLHF, RLVR, and reward modeling. Please visit ‘Publications’ to check my recent works!

News:

May, 2025 : One paper accepted to ACL 2025 Main Track!
May, 2025 : I joined Amazon Rufus team as an Applied Scientist Intern in Palo Alto, CA!
May, 2025 : Two papers accepted to ICML 2025! See you at Vancouver🇨🇦
Feb, 2025 : I joined Naver Cloud as a NLP research scientist intern!
Jan, 2025 : One paper accepted to NAACL 2025 main track! See you at Albuquerque🇺🇸

CV

Experience

Amazon Rufus
Naver Cloud

Applied Scientist Intern - Amazon Rufus

May 2025 - Present

I am an applied scientist intern in Amazon Rufus team, at Palo Alto, CA. My research focus at Amazon Rufus would be the intersection of multi-objective optimization and reinforcement learning in language models.

AI Research Intern - Naver Cloud

Feb 2025 - May 2025

I worked as a NLP research intern in the post-training team at Naver Cloud for large reasoning models. Core contributor of HyperCLOVA X THINK model.

Publications

Large Reasoning Model RLVR

HyperCLOVA X THINK Technical Report

NAVER Cloud HyperCLOVA X Team

Technical Report

RLHF Reward Modeling Generalizability

On the Robustness of Reward Models in Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim, Gujin Son, Woojin Chung, Shao Tang, Aman Gupta, James Thorne

ICML 2025

Alignment RLHF

AlphaPO - Reward shape matters for LLM alignment

Aman Gupta, Shao Tang, Qingquan Song, Sirou Zhu, Jiwoo Hong, and 8 more authors

ICML 2025

LLM Reasoning Generalizability

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Guijin Son, Jiwoo Hong, Hyunwoo Ko, James Thorne

ACL 2025

AI for Science Multi-modal Reasoning

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Guijin Son, Jiwoo Hong, Honglu Fan, Heejeong Nam, Hyunwoo Ko, Seungwon Lim, Jinyeop Song, Jinha Choi, Gonçalo Paulo, Youngjae Yu, Stella Biderman

Preprint

RLVR Reasoning

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Sanghwan Bae*, Jiwoo Hong*, Min Young Lee, Hanbyul Kim, JeongYeon Nam, Donghyun Kwak

Preprint

RLHF Reward Models Generalizability

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Jiwoo Hong*, Noah Lee*, Rodrigo Martínez-Castaño, César Rodríguez, James Thorne

NAACL 2025

Alignment RLHF

ORPO: Monolithic Preference Optimization without Reference Model

Jiwoo Hong, Noah Lee, James Thorne

EMNLP 2024

Pre-training Interpretability

Stable Language Model Pre-training by Reducing Embedding Variability

Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se Young Yun

EMNLP 2024

NLP Application Interpretability

Disentangling Structure and Style: Political Bias Detection in News by Inducing Document Hierarchy

Jiwoo Hong, Yejin Cho, Jiyoung Han, Jaemin Jung, James Thorne

Findings of EMNLP 2023

Evaluation RLHF

Evaluating the Consistency of LLM Evaluators

Noah Lee*, Jiwoo Hong*, James Thorne

COLING 2025

Alignment Diffusion

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Jiwoo Hong*, Sayak Paul*, Noah Lee, Kashif Rasul, James Thorne, Jongheon Jeong

ICLR 2025 SCOPE Workshop

Invited Talks

Trillion Parameter Consortium (TPC)

Resource-friendly alignment in language models: From reward modeling to preference alignment

Kakao Brain, KISTI

ORPO: Monolithic Preference Optimization without Reference Model

Twelve Labs

Resource-friendly single step language model alignment with ORPO

Contact

Any contacts related to my works or research collaboration are always welcome!

Contact