Deep dives into on-device AI, iOS engineering, and the infrastructure behind voice-first personal computing.
How Audria detects when a conversation shifts topics in real time — the math behind semantic drift detection, embedding-based segmentation, and why traditional approaches fail for voice-first AI.
We built a complete voice-first AI system that runs its entire pipeline locally on an iPhone — speech recognition, language model inference on the Neural Engine, constrained decoding, and a knowledge graph memory system. All in airplane mode.