A video detailing the high level design is here.
https://www.youtube.com/watch?v=bE2kRmXMF0I
My short / long term memory designs, vocal daisy chaining and also my docker compose stack can be found here! https://github.com/RoyalCities/RC-Home-Assistant-Low-VRAM
I've also done extensive testing to ensure it fits on most semi-recent graphics cards :)
[link] [comments]