Scott Howard
09/11/2023, 8:49 PMTraining language models to follow instructions with human feedback
(paper here: https://arxiv.org/pdf/2203.02155.pdf.)
@Greg Schoeninger leads a 30m review, convo, Q&A on each week’s reading. If you’d like to join us for any of our upcoming Friday discussions - sign up here: https://lu.ma/oxenbookclub