<span class="vcard">/u/sgt102</span>
/u/sgt102

Multi-query benchmarking

Hello, Another team has suggested that a customer problem could be solved simply by putting the target text and a bunch of queries into a single prompt and then collecting the results. Is anyone aware of a benchmark that shows how good LLMs are at an…