EngineeringRemoteFull time

Solution Architect Cluster Design

Translate customer AI workloads into precise, production-ready GPU cluster designs.

Role

You are the technical bridge between Sesterce's infrastructure capabilities and the compute requirements of enterprise and hyperscale customers — translating AI workloads into precise cluster designs that infrastructure, network, and ops teams execute against.

What you will do

Engage with customers to understand workload profiles (LLM pre-training, fine-tuning, inference) and translate them into cluster sizing and topology recommendations
Produce cluster design documents covering GPU type (H200, B200, GB300, Blackwell NVL), node count, InfiniBand topology, storage, and interconnect
Define and validate performance benchmarks (FLOPS utilization, MFU, all-reduce bandwidth) before customer commitment
Collaborate with the Sesterce OS team to align control plane (Slurm + Kubernetes) with designed topology and scheduling requirements
Support commercial negotiations with BoM estimates, lead times, and technical risk assessments

What we are looking for

5+ years in HPC or AI infrastructure, with direct involvement in GPU cluster design or technical pre-sales
Hands-on familiarity with NVIDIA GPU architectures (Hopper, Blackwell), NVLink, NVSwitch, and multi-rail InfiniBand
Ability to interpret AI workload performance profiles (model parallelism, pipeline vs. tensor parallelism) and map them to hardware
Strong written communication — your design docs must be unambiguous and executable by engineering teams
Background at or with cloud providers, HPC centers, or AI infrastructure vendors is a strong plus