Digest - Monday, March 30, 2026

Show HN: /slot-machine development (CC vs. Codex; CE vs. superpowers)

🟠 HackerNews by pejmanjohn ▲ 5

technical tools coding buildable # showcase

Presents a concrete opensource tool for parallel code generation with agent review and comparison. Includes GitHub link, clear methodology, and practical tradeoffs (tokens vs quality). Actionable for implementation.

Show HN: Pglens – 27 read-only PostgreSQL tools for AI agents via MCP

🟠 HackerNews by jeeybee ▲ 5 💬 2

technical tools coding # showcase

This is a PostgreSQL MCP (Model Context Protocol) tool for AI agents, directly relevant to Claude/LLM tooling. While the selftext is empty, the title clearly indicates a functional tool/integration that extends AI agent capabilities.

I gave an AI SSH access to my production infrastructure – 3 months later

🟠 HackerNews by fawraw ▲ 4 💬 1

technical research_verified # showcase

This is a case study about giving an AI system SSH access to production infrastructure, directly relevant to Claude/LLM applications and real-world deployment scenarios. The title suggests a documented experience/experiment with AI agents in infrastructure contexts.

TurboQuant on MLX: 4.6x KV cache compression with custom Metal kernels (Qwen 32B at 98% FP16 speed)

🔴 r/LocalLLaMA by /u/dirtyhand3

technical tools coding research buildable # showcase

Working implementation of TurboQuant for MLX with specific performance metrics (4.6x compression, 0.98x FP16 speed), detailed optimization journey, and links to code and PR.

Built a simple PyTorch flash-attention alternative for AMD GPUs that don't have it

🔴 r/LocalLLaMA by /u/Lowkey_LokiSN

technical coding models buildable troubleshooting # showcase

Detailed technical implementation of a memory-efficient attention mechanism for AMD GPUs with comprehensive benchmarks, code details, and reproducible results. Includes specific optimization techniques, fallback strategies, and performance data across multiple use cases.

Why the 1M context window burns through limits faster and what to do about it

🔴 r/ClaudeCode by /u/lucifer605

technical tools meta-tooling # tutorial

Comprehensive technical deep-dive into token caching mechanisms, TTL behavior, and cost optimization. Includes specific metrics, experimental data from vLLM testing, and actionable strategies with detailed explanations of API behavior.

ZINC — LLM inference engine written in Zig, running 35B models on $550 AMD GPUs

🔴 r/LocalLLaMA by /u/Mammoth_Radish2

technical coding tools hardware # showcase

Detailed technical project announcement with functional codebase, architecture decisions, performance metrics (7.1 tok/s), and public GitHub repo. Actionable implementation details about Zig/Vulkan stack.