Stochastic Sandbox

Home Perspectives Events Webinars Academy Certifications Programs

Where randomness meets reason

Tag

local-ai

2 posts

Apr 4, 2026 Deep Dives

Local LLM on a $550 AMD Mini PC: 28B Models at 20 tok/s

AMD 780M iGPU + 64GB DDR5 runs Gemma 4 28B at 19.5 tok/s. Setup guide, benchmarks, and cost breakdown vs. Mac Mini for local LLM inference under $600.

local-ai local-llm amd radeon-780m mini-pc minisforum llama.cpp ollama inference hardware gemma gguf vulkan moe budget-ai-hardware home-lab integrated-gpu
Mar 28, 2026 Deep Dives

omlx: Run Local LLMs on Apple Silicon with a RAG Customer Support App

omlx: macOS-native LLM server for Apple Silicon with SSD KV caching that cuts cold-start prefill from 90s to under 5s. Complete RAG customer support chatbot tutorial included.

llm apple-silicon mlx rag local-inference tutorial omlx local-ai

Stochastic Sandbox

About · Privacy · © 2026