Daniel TunkelangAI-Powered Search: Embedding-Based Retrieval and Retrieval-Augmented Generation (RAG)Replacing traditional search with AI-powered search means embedding-based retrieval and possibly retrieval-augmented generation (RAG).Apr 8, 20243Apr 8, 20243
InTDS ArchivebyMatthew GuntonDiving Deep into AutoGen and Agentic FrameworksThis blog post will go into the details of the “AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation” paperJun 28, 2024Jun 28, 2024
InGenerative AIbyzhaozhimingAdvanced RAG Retrieval Strategy: Query RewritingIntroduction to several query rewriting strategies in advanced RAG retrievalMay 17, 20245May 17, 20245
Justin MullerPrompt DecompositionHow adding more calls to an LLM can unlock scale and increase accuracy while lowering both cost and latency.Jun 17, 202410Jun 17, 202410
InAI AdvancesbyzhaozhimingAdvanced RAG Retrieval Strategy: Embedded TablesOne of the most challenging aspects of RAG (Retrieval Augmented Generation) applications is how to handle the content of complex documents…May 27, 20245May 27, 20245
Dan ClearyHow To Give Your Chatbot More MemoryI was lucky enough to recently speak and attend the first AI Engineer Summit in San Francisco this past month. There, OpenAI developer…Oct 20, 20234Oct 20, 20234
Crayon ConsultingSemantic Kernel: Integrating Conversational AI into Enterprise Apps using DotNet, Python, and JavaAre you a DotNet, Python, or Java developer? Are you looking for a tool to help you incorporate conversational AI into your apps? Have you…Feb 7, 20241Feb 7, 20241
InAI PlanetbyPlaban NayakSetting up Query Pipeline For Advanced RAG Workflow using LlamaIndexWhat is QueryPipelines?Feb 4, 20241Feb 4, 20241
InTowards AIbyIVAN ILINAdvanced RAG Techniques: an Illustrated OverviewA comprehensive study of the advanced retrieval augmented generation techniques and algorithms, systemising various approaches. The article…Dec 17, 202338Dec 17, 202338
Kelvin LuFine-Tuning Embedding Model with PEFT and LoRAIn our previous discussion, we explored the evaluation of embedding models and the potential benefits of hosting these models to achieve…Aug 1, 20232Aug 1, 20232
Kelvin LuHosting A Text Embedding Model That is Better, Cheaper, and Faster Than OpenAI’s SolutionWith a little bit of technical effort we can get a better text embedding model that is superior to the OpenAI solution.Jul 23, 20235Jul 23, 20235
InData Science at MicrosoftbyJames NguyenForget RAG: Embrace agent design for a more intelligent grounded ChatGPT!The Retrieval Augmented Generation (RAG) design pattern has been commonly used to develop a grounded ChatGPT in a specific data domain…Nov 18, 202321Nov 18, 202321
Kelvin LuCompare PDF Question Answering Systems Build with OpenAI and Google VertexAIThis post showed how to build a RAG with LangChain on both OpenAI and Google LLMs, and how the two solutions perform differently.Jun 12, 20233Jun 12, 20233
Kelvin LuWhat We Need to Know Before Adopting a Vector DatabaseTo continue with our journey toward applicable Generative AI, I would like to discuss some of the challenges of applying vector databases…Aug 15, 20234Aug 15, 20234