Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How is Claude doing on the benchmark that market is based on? Maybe not so good? Idk. Just because Claude is good for real world use doesn't mean it's winning the benchmark, but the benchmark is all that matters for the Polymarket.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: