machine learning machine learning deployment Local-First AI Inference: A Cloud Architecture Pattern for Cost-Effective Document Processing – infoq.com Google Inc. May 11, 2026 May 11, 2026 Local-First AI Inference: A Cloud Architecture Pattern for Cost-Effective Document Processing infoq.com