Hey everyone, does anyone have any contacts or know folks that would be well suited in discussing the below topic?
Looking to understand how critical labeling and clean/structured data is to reinforcement learning from human feedback (RLHF) and AI more broadly. Ideally, folks with experience at hyperscalers / large AI players (Google, Deepmind, OpenAI, Meta, etc)