π From PDF Overload to AI Clarity: Building an AI RAG Assistant
Introduction
If youβve ever tried to dig a single obscure fact out of a massive technical manual, youβll know the frustration π©: you know itβs in there somewhere, you just canβt remember the exact wording, cmdlet name, or property that will get you there.
For me, this pain point came from Office 365 for IT Pros β a constantly updated, encyclopaedic PDF covering Microsoft cloud administration. Itβs a superb resourceβ¦ but not exactly quick to search when you canβt remember the magic keyword.
Often I know exactly what I want to achieve β say, add copies of sent emails to the senderβs mailbox when using a shared mailbox β but I canβt quite recall the right cmdlet or property to Ctrl+F my way to the answer.
Thatβs when I thought: what if I could take this PDF (and others in my archive), drop them into a centralised app, and use AI as the conductor and translator πΌπ€ to retrieve the exact piece of information I need β just by asking naturally in plain English.
This project also doubled as a test bed for Claude Code, which Iβd been using since recently completing a GenAI Bootcamp π.
I wanted to see how it fared when building something from scratch in an IDE, rather than in a chat window.
π In this post, Iβll give a very high level overview of the four iterations (v1βv4) - what worked, what failed, and what I learned along the way.
