Role
You are the technical bridge between Sesterce's infrastructure capabilities and the compute requirements of enterprise and hyperscale customers — translating AI workloads into precise cluster designs that infrastructure, network, and ops teams execute against.
What you will do
- Engage with customers to understand workload profiles (LLM pre-training, fine-tuning, inference) and translate them into cluster sizing and topology recommendations
- Produce cluster design documents covering GPU type (H200, B200, GB300, Blackwell NVL), node count, InfiniBand topology, storage, and interconnect
- Define and validate performance benchmarks (FLOPS utilization, MFU, all-reduce bandwidth) before customer commitment
- Collaborate with the Sesterce OS team to align control plane (Slurm + Kubernetes) with designed topology and scheduling requirements
- Support commercial negotiations with BoM estimates, lead times, and technical risk assessments
What we are looking for
- 5+ years in HPC or AI infrastructure, with direct involvement in GPU cluster design or technical pre-sales
- Hands-on familiarity with NVIDIA GPU architectures (Hopper, Blackwell), NVLink, NVSwitch, and multi-rail InfiniBand
- Ability to interpret AI workload performance profiles (model parallelism, pipeline vs. tensor parallelism) and map them to hardware
- Strong written communication — your design docs must be unambiguous and executable by engineering teams
- Background at or with cloud providers, HPC centers, or AI infrastructure vendors is a strong plus