AI in production: what actually works in 2026 | Cartwheel Galaxy

AI in production is different from AI in demos. Here's what we've actually shipped and what we've watched fail.

What works today

Document processing. LLMs extract structured data from PDFs, contracts, invoices, and forms reliably. We've built document workflows that replace hours of manual data entry.

Intelligent search. Vector databases plus LLMs let users ask natural language questions about their own documents, codebases, or knowledge bases. This is production-ready.

Automated workflows. LLMs orchestrate multi-step tasks: classify an incoming email, extract relevant fields, update a CRM, draft a response. With good guardrails, this works.

Natural language interfaces. Turn unstructured user input into structured actions. Good for internal tools where users shouldn't need to learn UI.

Code assistance. Not replacing engineers, but augmenting them meaningfully for boilerplate, refactoring, and documentation.

What doesn't work yet

Fully autonomous agents for complex tasks. Multi-hour tasks with dozens of decisions still break in weird ways. Great for demos. Fragile in production.

Replacing domain expertise. LLMs fail at tasks where a human expert combines deep context, judgment, and accountability. Don't ship AI-as-the-decision-maker for critical paths.

Deterministic calculations. Ask an LLM to do arithmetic. Use a calculator instead. Route deterministic work away from the LLM.

Production patterns that work

Human in the loop for anything that affects money, safety, or legal standing
Guardrails that catch LLM outputs before they reach production systems
Fallbacks for when the LLM is slow, down, or wrong
Evaluation with real test cases, not just vibes
Cost monitoring because token costs add up fast at scale

The honest take

AI lets you build things that weren't possible 3 years ago. It's not magic. It requires the same engineering discipline as any production system — monitoring, testing, error handling, cost control.

The teams shipping useful AI today are the ones who treat it like infrastructure, not like a party trick.