Tom Fenton moves from local AI concepts to hands-on tools for matching LLMs to hardware, running local chatbots with Ollama and benchmarking AI performance.
GLM-5.2, Z.ai’s open-weight model, has reached 39% F1 on Semgrep’s IDOR benchmark, beating Anthropic’s Claude Code coding assistant in the prompt-only lane. Claude Code scored 37% F1 with Opus 4.6 and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results