Discussion about this post

User's avatar
Neural Foundry's avatar

Impressive stuff seeing this 12 percentage point jump in a single iteration. The held-out set performance really stands out tho, models typically struggle more on unseen problem types but here it actualy went the other way. I ran into a similiar thing when benchmarking solvers last year, the numerical shortcuts that Pantone mentioned are kinda inevitable without explictly constraining proof methods.

No posts

Ready for more?