Detailed technical post about quantization with specific parameters, methodology explanation, and links to resources. Provides actionable information for model optimization.
Detailed technical post about quantization with specific parameters, methodology explanation, and links to resources. Provides actionable information for model optimization.
Comprehensive technical documentation with quantified results, open-source models, training data, and methodology writeup. Fully reproducible with links to all artifacts.
Detailed performance benchmarks with specific hardware, quantization methods, and measurable improvements. Includes interactive charts and methodological transparency. Updates clarify bug context.
Detailed case study demonstrating real-world implementation of Claude Code for business operations including bot automation, React app, browser control, and document generation with specific examples.
Updated CLAUDE.md system prompt backed by research citations (Self-Attention Limits Working Memory, Chroma Context Rot, Lost in the Middle TACL 2024). Provides research-grounded patterns for multi-session Claude Code work with templates and GitHub repo.
Comprehensive technical writeup of oMLX inference server with detailed feature list, architecture decisions (paged SSD caching, continuous batching), API compatibility specifications, and open-source release. Provides actionable tool for local LLM deployment on Apple Silicon.
Full-featured open-source desktop email client (Velo) built entirely with Claude. Includes comprehensive technical architecture (Tauri, React, Rust, SQLite), feature list, GitHub repository, and honest documentation of AI-assisted development methodology.