<span class="vcard">/u/zinyando</span>
/u/zinyando

Shipped Izwi v0.1.0-alpha-12 (faster ASR + smarter TTS)

Between 0.1.0-alpha-11 and 0.1.0-alpha-12, we shipped: Long-form ASR with automatic chunking + overlap stitching Faster ASR streaming and less unnecessary transcoding on uploads MLX Parakeet support New 4-bit model variants (Parakeet, LFM2.5, Qw…

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support

Quick update on Izwi (local audio inference engine) – we've shipped some major features: What's New: Speaker Diarization – Automatically identify and separate multiple speakers using Sortformer models. Perfect for meeting transcripts. Forced Al…

Izwi v0.1.0-alpha is out: new desktop app for local audio inference

We just shipped Izwi Desktop + the first v0.1.0-alpha releases. Izwi is a local-first audio inference stack (TTS, ASR, model management) with: CLI (izwi) OpenAI-style local API Web UI New desktop app (Tauri) Alpha installers are now available for: …

I built a way to test Qwen3-TTS and Qwen3-ASR locally on your laptop

Supports Qwen3-TTS models (0.6B-1.7B) and ASR models. Docker + native deployment options. Key features: 🎭 Voice cloning with reference audio 🎨 Custom voice design from text descriptions ⚡ MLX + Metal GPU acceleration for M1/M2/M3 🎨 Modern React …