zlacker

[return to "Better RAG Results with Reciprocal Rank Fusion and Hybrid Search"]
1. thefou+G71[view] [source] 2024-05-30 21:48:14
>>johnjw+(OP)
I also found pure RAG with vector search to not work. I was creating a bot that could find answers to questions about things by looking at Slack discussions.

At first, I downloaded entire channels, loaded them into a vector DB, and did RAG. The results sucked. Vector searches don't understand things very well, and in this world, specific keywords and error messages are very searchable.

Instead, I take the user's query, ask an LLM (Claude / Bedrock) to find keywords, then search Slack using the API, get results, and use an LLM to filter for discussions that are relevant, then summarize them all in a response.

This is slow, of course, so it's very multi-threaded. A typical response will be within 30 seconds.

◧◩
2. qetern+632[view] [source] 2024-05-31 08:24:30
>>thefou+G71
Are you doing this for a product or for internal usage?
[go to top]