Reward models for LMs are fundamentally broken
By panthertrax · 2026-06-24 · 1 points · 0 comments
https://twitter.com/vijaytarian/status/2069438063345115187
By panthertrax · 1 points · 0 comments · on Hacker News, read on BetterNews.
Open the full discussion on BetterNews