Own end-to-end architecture for GenAI apps (RAG, agents, fine-tuning) on GCC-hosted cloud. Translate use-cases into secure, performant designs across model hosting, vector DBs, data pipelines, and observability.

Key responsibilities

Design reference architectures for LLM inference/fine-tuning (managed endpoints, GPUs, autoscaling, token/cost controls).
Land data-residency-compliant stacks (object stores, vector DB, feature stores) and govern security (KMS, VPC, zero-trust).
Build retrieval (RAG) pipelines: chunking, embeddings, guardrails, eval frameworks; set SLOs for latency/cost/quality.
Benchmark models (open/commercial), select providers, and negotiate GPU/compute reservations with FinOps discipline.
Create IaC blueprints (Terraform) and golden paths for app teams; handover to platform/SRE for production.
Set up monitoring & evals (hallucination, toxicity, grounding) and incident runbooks.

Candidate profile

8–12+ yrs cloud architecture; 2–3+ yrs hands-on GenAI/LLM delivery.
Depth in one major cloud + GPU/accelerator stacks; strong with vector DBs, RAG patterns, model gateways.
Proven security, compliance, and cost-controls in regulated MENA contexts.

Compensation
Tax-free base + bonus; housing/transport allowances; family medical; visa/relocation.

Location	United Arab Emirates
Discipline:	Technology
Job type:	Permanent
Published:	2 days ago

Regional AI Cloud Solutions Architect

Regional AI Cloud Solutions Architect

Latest jobs

Get new jobs for this search by email

If you are looking to make a career change, check out our latest careers.

View all

Discover our client solutions.

Petrogate UK Headquaters

152-160 Kemp House, City Road, London EC1V 2NX

+44 (0) 207 081 9830

Info@petrogateco.com