Engineering
Engineering··7 min read

AI in production: what actually works in 2026

Document processing, intelligent search, workflow automation — these work today. Fully autonomous agents — still fragile. An honest assessment.

AI in production is different from AI in demos. Here's what we've actually shipped and what we've watched fail.

What works today

Document processing. LLMs extract structured data from PDFs, contracts, invoices, and forms reliably. We've built document workflows that replace hours of manual data entry.

Intelligent search. Vector databases plus LLMs let users ask natural language questions about their own documents, codebases, or knowledge bases. This is production-ready.

Automated workflows. LLMs orchestrate multi-step tasks: classify an incoming email, extract relevant fields, update a CRM, draft a response. With good guardrails, this works.

Natural language interfaces. Turn unstructured user input into structured actions. Good for internal tools where users shouldn't need to learn UI.

Code assistance. Not replacing engineers, but augmenting them meaningfully for boilerplate, refactoring, and documentation.

What doesn't work yet

Fully autonomous agents for complex tasks. Multi-hour tasks with dozens of decisions still break in weird ways. Great for demos. Fragile in production.

Replacing domain expertise. LLMs fail at tasks where a human expert combines deep context, judgment, and accountability. Don't ship AI-as-the-decision-maker for critical paths.

Deterministic calculations. Ask an LLM to do arithmetic. Use a calculator instead. Route deterministic work away from the LLM.

Production patterns that work

  • Human in the loop for anything that affects money, safety, or legal standing
  • Guardrails that catch LLM outputs before they reach production systems
  • Fallbacks for when the LLM is slow, down, or wrong
  • Evaluation with real test cases, not just vibes
  • Cost monitoring because token costs add up fast at scale

The honest take

AI lets you build things that weren't possible 3 years ago. It's not magic. It requires the same engineering discipline as any production system — monitoring, testing, error handling, cost control.

The teams shipping useful AI today are the ones who treat it like infrastructure, not like a party trick.

Start a project

Cartwheel Galaxy · Custom Software Platforms