production AI (agentic workflows, RAG, evals/guardrails) across multi-cloud. Land secure, cost-efficient LLM platforms for product and internal automation teams.
Responsibilities:
Design reference architectures for inference/fine-tuning; build retrieval (chunking, embeddings, vector DB), safety, and eval pipelines.
Create golden paths (Terraform/IaC), SLOs for latency/cost/quality, and model routing across OSS/commercial providers.
Stand up observability (hallucination, grounding, toxicity) and incident runbooks; govern data residency/compliance.
Coach platform & app squads; publish playbooks and migration guides.
You bring
10–12+ yrs platform/cloud; 3+ yrs shipping GenAI in production at scale.
Depth in GPUs/accelerators, model gateways, vector DBs; strong security/compliance chops.
Executive-level communication.
Comp (guide): $220k–$290k base + bonus/equity; relocation optional.