List: RAG | Curated by Tak Yu Chan (Franky)

Jul 18, 2024
15 stories
RAG
Daniel Tunkelang
AI-Powered Search: Embedding-Based Retrieval and Retrieval-Augmented Generation (RAG)Replacing traditional search with AI-powered search means embedding-based retrieval and possibly retrieval-augmented generation (RAG).
Apr 8, 2024
3
Apr 8, 2024
3
In
TDS Archive
by
Matthew Gunton
Diving Deep into AutoGen and Agentic FrameworksThis blog post will go into the details of the “AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation” paper
Jun 28, 2024
Jun 28, 2024
In
Generative AI
by
zhaozhiming
Advanced RAG Retrieval Strategy: Query RewritingIntroduction to several query rewriting strategies in advanced RAG retrieval
May 17, 2024
5
May 17, 2024
5
Justin Muller
Prompt DecompositionHow adding more calls to an LLM can unlock scale and increase accuracy while lowering both cost and latency.
Jun 17, 2024
10
Jun 17, 2024
10
In
AI Advances
by
zhaozhiming
Advanced RAG Retrieval Strategy: Embedded TablesOne of the most challenging aspects of RAG (Retrieval Augmented Generation) applications is how to handle the content of complex documents…
May 27, 2024
5
May 27, 2024
5
Dan Cleary
How To Give Your Chatbot More MemoryI was lucky enough to recently speak and attend the first AI Engineer Summit in San Francisco this past month. There, OpenAI developer…
Oct 20, 2023
4
Oct 20, 2023
4
Crayon Consulting
Semantic Kernel: Integrating Conversational AI into Enterprise Apps using DotNet, Python, and JavaAre you a DotNet, Python, or Java developer? Are you looking for a tool to help you incorporate conversational AI into your apps? Have you…
Feb 7, 2024
1
Feb 7, 2024
1
In
AI Planet
by
Plaban Nayak
Setting up Query Pipeline For Advanced RAG Workflow using LlamaIndexWhat is QueryPipelines?
Feb 4, 2024
1
Feb 4, 2024
1
In
Towards AI
by
IVAN ILIN
Advanced RAG Techniques: an Illustrated OverviewA comprehensive study of the advanced retrieval augmented generation techniques and algorithms, systemising various approaches. The article…
Dec 17, 2023
38
Dec 17, 2023
38
Kelvin Lu
Fine-Tuning Embedding Model with PEFT and LoRAIn our previous discussion, we explored the evaluation of embedding models and the potential benefits of hosting these models to achieve…
Aug 1, 2023
2
Aug 1, 2023
2
Kelvin Lu
Hosting A Text Embedding Model That is Better, Cheaper, and Faster Than OpenAI’s SolutionWith a little bit of technical effort we can get a better text embedding model that is superior to the OpenAI solution.
Jul 23, 2023
5
Jul 23, 2023
5
In
Data Science at Microsoft
by
James Nguyen
Forget RAG: Embrace agent design for a more intelligent grounded ChatGPT!The Retrieval Augmented Generation (RAG) design pattern has been commonly used to develop a grounded ChatGPT in a specific data domain…
Nov 18, 2023
21
Nov 18, 2023
21
Kelvin Lu
Disadvantages of RAGThis is the first part of the RAG analysis:
Aug 25, 2023
9
Aug 25, 2023
9
Kelvin Lu
Compare PDF Question Answering Systems Build with OpenAI and Google VertexAIThis post showed how to build a RAG with LangChain on both OpenAI and Google LLMs, and how the two solutions perform differently.
Jun 12, 2023
3
Jun 12, 2023
3
Kelvin Lu
What We Need to Know Before Adopting a Vector DatabaseTo continue with our journey toward applicable Generative AI, I would like to discuss some of the challenges of applying vector databases…
Aug 15, 2023
4
Aug 15, 2023
4