Mini RAG Benchmark

Hi @haesleinhuepf @kaabl @marabuuu ,

I created some sort of 'mini benchmark' for two different RAG approaches for the Chatbot.
20 queries, relating to our training material, were processed by both approaches in order to find the 10 best matching slides in our training material.
You can find the results PDF [here](https://github.com/NFDI4BIOIMAGE/SlideInsight/blob/5852e8b8430237bad09c2e1c7a016ed8b5a80208/RAG/benchmark_results.pdf). 

It would be really nice, if you could take a look at the 20 examples and decide for each one, which results (i.e. which slides) match best to the corresponding query. So for each query, we should decide if approach A or B led to a better result over all.
By doing that, we can decide which approach (or if any of the two approaches) is best to use for the multimodal RAG Chatbot.

Thanks and have a great day,
Lea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mini RAG Benchmark #61

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Mini RAG Benchmark #61

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions