Question 1

What does generative AI consulting cover?

Accepted Answer

Generative AI consulting covers the strategic and architectural decisions that determine whether a generative AI project succeeds or fails -- use case selection, model choice, architecture design (RAG vs. fine-tuning vs. prompt engineering), evaluation framework, cost modelling, and production requirements. It is the work that prevents teams from building impressive demos that fall apart in production, or spending development budget on use cases that don't justify the investment.

Question 2

How do I know which generative AI use cases are worth pursuing?

Accepted Answer

Worth pursuing: use cases with high-volume, repetitive text generation (document drafting, email composition, support response suggestion) where current manual effort is measurable. Use cases where AI-generated content can be reviewed before use (draft, not final output). Use cases where the cost of wrong answers is acceptable and reviewable. Not worth pursuing: use cases where accuracy is 100% required and AI errors have serious consequences without review. Use cases where the underlying data does not support the use case. Use cases where simpler rule-based systems would work.

Question 3

When should I use RAG vs. fine-tuning vs. prompt engineering?

Accepted Answer

Prompt engineering (system prompts, few-shot examples): try this first for any use case. It requires no training data, deploys immediately, and works well for a wider range of tasks than expected. RAG (retrieval-augmented generation): when you need the model to answer questions about your specific documents, knowledge base, or product data that the base model does not know. Fine-tuning: when you need consistent output format or style that prompt engineering cannot reliably achieve, and you have hundreds to thousands of high-quality examples. Most production use cases use RAG for knowledge grounding and prompt engineering for format control.

Question 4

How do I evaluate whether a generative AI system is production-ready?

Accepted Answer

Production readiness for generative AI requires: an evaluation framework (automated tests on representative inputs with pass/fail criteria, not just manual review), latency and cost benchmarks under expected load, hallucination detection for high-stakes outputs, graceful degradation when the model returns low-confidence or out-of-scope responses, and a feedback loop for capturing failures in production. Systems that pass demos but lack evaluation frameworks are not production-ready.

Question 5

How long does a generative AI consulting engagement take?

Accepted Answer

A focused use case assessment for a single application takes 1--2 weeks. A broader generative AI strategy engagement covering multiple use cases, architecture design, model selection, and build roadmap takes 3--6 weeks. For teams with an AI system already in development, a production readiness review takes 1--2 weeks and typically surfaces 5--10 specific issues to address before launch.

Question 6

What does generative AI consulting cost?

Accepted Answer

A focused use case assessment for a single application runs $6,000--$15,000. A broader AI strategy engagement with multiple use cases and architecture design runs $15,000--$40,000. A production readiness review for an existing AI system runs $8,000--$20,000. All engagements are fixed-price with a defined scope and deliverable.

Generative AI Consulting Services

Most generative AI projects fail on the production side

What we cover

Use case assessment and prioritisation

Model selection and evaluation

RAG and knowledge system design

Agent and multi-agent architecture

Production readiness review

AI governance and evaluation framework

Tell us what you are trying to build or evaluate.