Digest - Tuesday, April 21, 2026

Show HN: Mediator.ai – Using Nash bargaining and LLMs to systematize fairness

🟠 HackerNews by sanity ▲ 6

technical tools buildable # showcase

Showcase of a working LLM application combining Nash bargaining theory with LLM comparisons for negotiation mediation. Includes technical explanation and working implementation with clear methodology.

Show HN: Ctx – a /resume that works across Claude Code and Codex

🟠 HackerNews by dchu17 ▲ 3

technical tools coding buildable # showcase

Detailed showcase of a working tool (ctx) for Claude Code and Codex with installation instructions, feature list, and technical architecture. Provides actionable implementation details and demonstrates clear use cases.

Claude Cowork can now build live artifacts

🔴 r/ClaudeAI by /u/ClaudeOfficial

technical tools

Official announcement from Claude team about new Live Artifacts feature with specific capabilities and availability details.

Gemma 4 26B-A4B GGUF Benchmarks

🔴 r/LocalLLaMA by /u/danielhanchen

research_verified models research tools # resource

Comprehensive benchmarking study with methodology, quantitative results (KL Divergence metrics), multiple comparison tables, and reproducible methodology. Includes GitHub repo and HuggingFace dataset links.

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

🔴 r/LocalLLaMA by /u/evoura

research_verified models research coding # resource

Comprehensive benchmark comparing 21 local LLMs with standardized testing (164 coding problems, HumanEval+), detailed methodology, performance table, and hardware specs. Includes GitHub repo and Medium article.

Qwen3.5-27B, Qwen3.5-122B, and Qwen3.6-35B on 4x RTX 3090 — MoEs struggle with strict global rules

🔴 r/LocalLLaMA by /u/DehydratedWater_

research_verified models research tools # resource

Extensive empirical study comparing three Qwen models with 20+ live agentic sessions each, detailed vLLM metrics, multiple performance tables, specific hardware config, and quantitative analysis of rule-following behavior.

Fine-tuned Qwen3 SLMs (0.6-8B) beat frontier LLMs on narrow tasks

🔴 r/LocalLLaMA by /u/Jolly-Gazelle-6060

research_verified research models coding # resource

Systematic benchmark study with open-sourced code, data, eval scripts, detailed methodology notes, and reproducible results across multiple models and datasets.