Careers

Solution Architect Cluster Design

Translate customer AI workloads into precise, production-ready GPU cluster designs.

Role

You are the technical bridge between Sesterce's infrastructure capabilities and the compute requirements of enterprise and hyperscale customers — translating AI workloads into precise cluster designs that infrastructure, network, and ops teams execute against.

What you will do

  • Engage with customers to understand workload profiles (LLM pre-training, fine-tuning, inference) and translate them into cluster sizing and topology recommendations
  • Produce cluster design documents covering GPU type (H200, B200, GB300, Blackwell NVL), node count, InfiniBand topology, storage, and interconnect
  • Define and validate performance benchmarks (FLOPS utilization, MFU, all-reduce bandwidth) before customer commitment
  • Collaborate with the Sesterce OS team to align control plane (Slurm + Kubernetes) with designed topology and scheduling requirements
  • Support commercial negotiations with BoM estimates, lead times, and technical risk assessments

What we are looking for

  • 5+ years in HPC or AI infrastructure, with direct involvement in GPU cluster design or technical pre-sales
  • Hands-on familiarity with NVIDIA GPU architectures (Hopper, Blackwell), NVLink, NVSwitch, and multi-rail InfiniBand
  • Ability to interpret AI workload performance profiles (model parallelism, pipeline vs. tensor parallelism) and map them to hardware
  • Strong written communication — your design docs must be unambiguous and executable by engineering teams
  • Background at or with cloud providers, HPC centers, or AI infrastructure vendors is a strong plus