Jiwoo Hong
Master's Student at KAIST GSAI
HOME
PUBLICATIONS
CATEGORIES
TAGS
ARCHIVES
Home
Categories
Language Model Alignment
Category
Cancel
Language Model Alignment
1
Reference-free Monolitic Odds Ratio Preference Optimization (ORPO)
Mar 3, 2024
Trending Tags
🌟Publications
🔥NLP
EMNLP2023
Journalism AI
Multi-Agent Reinforcement Learning
🔥Alignment
🔥LLM
🔥Reinforcement Learning
Trending Tags
🌟Publications
🔥NLP
EMNLP2023
Journalism AI
Multi-Agent Reinforcement Learning
🔥Alignment
🔥LLM
🔥Reinforcement Learning
×
A new version of content is available.
Update