On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside significantly larger models; it outpaces DeepSeek-V3.2, which scores 70.2%, ...
Comprehensive REST API with webhooks gives recruitment agencies full operational control from lead generation through ...
Tired of using numerous productivity tool apps? See how McStumble combines numerous tools in one easy to use website for free.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results