I like Kimi too, but they definitely have some benchmark contamination: the blog...

vessenes 7 months ago | parent | context | favorite | on: Kimi K2 Thinking, a SOTA open-source trillion-para...

I like Kimi too, but they definitely have some benchmark contamination: the blog post shows a substantial comparative drop in swebench verified vs open tests. I throw no shade - releasing these open weights is a service to humanity; really amazing.