Best local coding helper so far
Qwen3 Coder Next
Qwen3 Coder Next is the safest coding recommendation from the evidence we have published. It has passed small coding checks, but we still need larger project tests.
View evidenceSiliconBench local model testing
See which local models are best for coding, writing, speed, reliability, and everyday use, with the test evidence underneath.
These are practical local-AI results from our own Mac testing. They show what worked in the tests listed here, not a universal ranking of every model.
Last data update: 9 May 2026, 17:50
Current recommendations
Each card gives the practical answer first, then links to the evidence that supports it. Where evidence is thin, the page says so instead of guessing.
Qwen3 Coder Next
Qwen3 Coder Next is the safest coding recommendation from the evidence we have published. It has passed small coding checks, but we still need larger project tests.
View evidenceGemma 4 26B
Gemma 4 26B is currently the strongest writing and summarising candidate in these published tests.
View evidenceQwen3 Coder Next
Qwen3 Coder Next was the quickest model in the new three-task mini pass.
View evidenceQwen3 Coder Next
Qwen3 Coder Next has the strongest repeated availability evidence in these published tests.
View evidenceQwen3.6 35B
Qwen3.6 35B gave the cleanest all-round result in the new quick practical pass.
View evidenceTracked models
The page focuses on what each model is useful for. Technical setup details are kept out of the way.
Our current local coding pick. It passed the new three-task quick practical pass and remains the fastest measured lane in this page, but still needs bigger real-project coding tests.
A useful everyday local model, but the new quick pass exposed a real presentation issue: it can show its thinking process instead of giving a clean plain-English answer.
A larger local model that passed three quick practical checks cleanly: plain-English advice, exact one-line output, and a small coding helper. Needs broader testing before promotion.
A very large local model that gave useful advice and code in the quick pass, but failed an format-following check by copying the placeholder. Promising, not clean yet.
The best non-Qwen all-rounder candidate in this evidence set so far. It did well on several writing and general tasks, but still struggled with some format-following and structured-output tests.
This entry is not a local result. Ollama was available, but this GLM listing is cloud-only, so it stays out of local recommendations.
A small local model that passed quick writing and coding checks but failed an exact-format check. Useful as small-model coverage, not a recommendation yet.
Now genuinely testable locally through a chat-style test setup. It did useful work on reasoning, privacy advice, fake-question handling, structured answers, and small code, but it was too bullish on a missing-data business decision and still has exact wording/language caveats. It also used about 152 GB of memory.
These families are listed as future coverage gaps. They are not local winners on this page yet.
An auxiliary local general-purpose lane that passed three quick practical checks and answered short tasks quickly. It still needs larger real-work tests.
Now running locally and tested beyond a simple quick check. It passed the new three-task practical pass and still carries earlier caveats on stricter file-making and formatting checks.
A local Ollama model that answered a simple one-sentence explanation task. This fills one Ollama coverage gap, but it needs the same practical checks as the other models before comparison.
Evidence behind the verdict
Evidence is grouped by plain questions: can we use it, did it answer quickly, what jobs did it handle, and where did it fail?
8 May 2026, 15:34
8 May 2026, 14:32
9 May 2026, 13:34
9 May 2026, 15:12
8 May 2026, 15:34
8 May 2026, 16:53
8 May 2026, 16:54
8 May 2026, 16:56
8 May 2026, 16:59
9 May 2026, 13:34
9 May 2026, 13:34
9 May 2026, 15:21
9 May 2026, 15:30
9 May 2026, 16:24
9 May 2026, 16:24
9 May 2026, 16:24
9 May 2026, 16:24
9 May 2026, 16:24
9 May 2026, 16:24
9 May 2026, 16:31
9 May 2026, 17:08
9 May 2026, 17:22
9 May 2026, 17:45
8 May 2026, 14:32
8 May 2026, 15:04
9 May 2026, 13:31
5 May 2026, 15:48
5 May 2026, 10:53
8 May 2026, 16:53
8 May 2026, 17:20
8 May 2026, 16:20
8 May 2026, 16:25
9 May 2026, 13:35
8 May 2026, 16:53
How these results are judged
Update path