This week’s Arxiv Dive we get into Direct Preferen...
# 03-ai-events
s
This week’s Arxiv Dive we get into Direct Preference Optimization: Your Language Model is Secretly a Reward Model. Hope to see some of y’all there Friday! https://lu.ma/oxenbookclub