Show HN: AST-guard A gradient-immune structural guard against RL reward hacking
By thinking-nick · 2026-06-29 · 3 points · 0 comments
https://github.com/Nick-is-building/ast-guard
By thinking-nick · 3 points · 0 comments · on Hacker News, read on BetterNews.
Open the full discussion on BetterNews