I tested local models on 100+ real RAG tasks. Here are the best 1B model picks
TL;DR — Best model by real-life file QA tasks (Tested on 16GB Macbook Air M2) The idea of this test is to really understand how models perform in privacy-concerned real-life tasks*, instead of utilizing traditional benchmarks to measure general AI cap…