Discussion about this post

User's avatar
Steeven's avatar

> Exclusivity affects pricing significantly. Environments and tasks can be sold exclusively to one customer or non-exclusively to multiple labs.

I do wonder how they maintain this exclusivity. If two frontier labs ask for a slack clone from the same vendor, presumably the vendor would give them the same product. Maybe with minor edits

Neural Foundry's avatar

Really solid overview of where the RL environment market is heading. The $1B Anthropic number is insane but makes sense if you think about compute waste from bad task quality. What jumped out is the reward hacking iteration problem, even with models getting better at not gaming graders, it still takes multiple rounds to get the grading robust. The 2-3% minimum pass rate treshhold is a useful heuristic I hadn't seen quantified before, below that and you risk the task being infeasible which wastes everything downstream.

No posts

Ready for more?