Skip to content

Mini RAG Benchmark #61

@lea-33

Description

@lea-33

Hi @haesleinhuepf @kaabl @marabuuu ,

I created some sort of 'mini benchmark' for two different RAG approaches for the Chatbot.
20 queries, relating to our training material, were processed by both approaches in order to find the 10 best matching slides in our training material.
You can find the results PDF here.

It would be really nice, if you could take a look at the 20 examples and decide for each one, which results (i.e. which slides) match best to the corresponding query. So for each query, we should decide if approach A or B led to a better result over all.
By doing that, we can decide which approach (or if any of the two approaches) is best to use for the multimodal RAG Chatbot.

Thanks and have a great day,
Lea

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions