Posts for: #Local-Ai

Running a 35B MoE Model on a 16GB Consumer GPU

2026-05-27

A GPU chip as an ancient amphitheater with 256 niches, only 8 glowing amber. Wood grain transforms into circuit traces beneath it.

How to serve Qwen3.6-35B-A3B, a Mixture-of-Experts model with 3B active parameters, on an RTX 5070 Ti using llama.cpp. Full config, performance numbers, and the flags that make it fit.

[Read more]

The $59 Voice Recorder That Beats a $159 AI Note-Taker

2026-04-30

#local-ai #audio-transcription #privacy #apple-silicon #dji #whisperkit #personal-systems

The $59 Voice Recorder That Beats a $159 AI Note-Taker

A technology preview: using a DJI Mic 2 transmitter and free local transcription on Apple Silicon instead of a Plaud NotePin subscription. Better audio, word-level timestamps, no cloud, and a fraction of the cost.

[Read more]