Question 1

What is RAG and how is it different from fine-tuning?

Accepted Answer

RAG (Retrieval-Augmented Generation) retrieves relevant documents from a knowledge base at query time and passes them to the language model as context before generating an answer. The model answers the question based on the retrieved documents, not solely from its training data. Fine-tuning adjusts the model's weights by training it on your data, updating what the model 'knows.' The practical differences are important. RAG keeps your data in a retrieval system you control: documents are indexed, not baked into model weights. When your documents change, you update the index. When a document is removed or superseded, it is removed from the retrieval system and the model stops citing it. Fine-tuning bakes knowledge into the model: updating it requires retraining, and there is no clear source citation. For most enterprise use cases -- customer support, policy lookup, product information, legal review assistance -- RAG is the right approach because the knowledge changes frequently and source attribution matters. Fine-tuning is more appropriate for changing the model's reasoning style, output format, or task-specific behaviour.

Question 2

What types of documents and data sources does RAG work with?

Accepted Answer

RAG works with any content that can be indexed: PDFs, Word documents, PowerPoint presentations, web pages, Confluence and Notion pages, Zendesk or Intercom knowledge base articles, plain text files, and structured databases. The ingestion pipeline extracts content from the source format, chunks it into segments appropriate for retrieval, generates embeddings, and stores them in a vector database. For structured data (databases, spreadsheets, CSVs), we use hybrid retrieval approaches: semantic search over unstructured content combined with structured query generation for database records. The retrieval quality depends on document quality and structure. Well-organised, clearly written documents retrieve better than dense, poorly structured ones. We assess your document library during scoping and identify any document quality or organisation issues that will affect retrieval before we commit to retrieval quality targets.

Question 3

What is retrieval quality and how do you measure it?

Accepted Answer

Retrieval quality measures how often the retrieval system finds the right documents for a given query. A RAG system can generate fluent, confident-sounding answers from retrieved documents and still be wrong if it retrieved the wrong documents. Retrieval quality has two dimensions: recall (does the system retrieve the documents that contain the answer?) and precision (does the system avoid retrieving irrelevant documents that confuse the answer?). We evaluate retrieval quality using a test set of questions and expected source documents, measuring recall and precision at different retrieval depths. We iterate on chunking strategy, embedding model, and retrieval configuration until retrieval quality meets a defined threshold for your use case before building the answer generation layer on top of it. Retrieval quality is the foundation. Everything else depends on it.

Question 4

How do you handle access controls -- not every user should see every document?

Accepted Answer

Document-level access controls are a first-class design requirement in enterprise RAG. The retrieval system must only surface documents the querying user has permission to read. We implement access control in the retrieval layer using metadata filtering: each document in the vector index is tagged with its access group metadata, and at query time the retrieval query is filtered to only return documents the current user's permissions allow. For RAG systems connected to existing document repositories (SharePoint, Confluence, Google Drive), we use the source system's permission model: a document the user cannot read in SharePoint is not indexed for retrieval by that user. Access control design is defined during scoping and tested before deployment. A RAG system that leaks confidential documents to users who should not see them is a more serious problem than one that retrieves the wrong document.

Your enterprise data is not in the model. RAG puts it there.

When the answer needs to be your answer, not the model's best guess

What we build

Document ingestion pipelines

Customer support RAG

Internal knowledge assistant

Vector database and retrieval infrastructure

RAG for legal and compliance

Product documentation RAG

What question does your team ask repeatedly that has an answer in a document nobody can find?