Vacancy Description
Are you looking for an exciting new opportunity?
Join a stealth-mode hyperscale data center startup building a next‑generation AI and cloud platform designed for startups and advanced research, powered by thousands of H100, H200, and B200 GPUs available on demand. Their platform supports everything from rapid experimentation to full‑scale model training and inference, with flexible orchestration via Slurm, Kubernetes, or direct SSH access. This is a rare opportunity to work at the intersection of hyperscale infrastructure and AI, shaping the operational backbone of one of the largest GPU clusters in private deployment.
If you want to build and operate infrastructure for frontier AI workloads, automate systems at petascale, and be part of a founding engineering team, this is the place to do it.
Responsibilities
- Design, deploy, and maintain large-scale GPU clusters (H100/H200/B200) for training and inference workloads.
- Build automat...
Ready to Apply?
अभी आवेदन करें
Submit your application for Senior Site Reliability Engineer (SRE) - AI Infrastructure at Hamilton Barnes Associates Limited
Apply for this Position