Power the infrastructure behind enterprise AI. At Animbus, you’ll design and manage the high-density GPU environments and Kubernetes platforms that keep mission-critical AI in production — reliably, securely, and at scale.
Animbus powers enterprise AI with managed infrastructure built for mission-critical performance. We enable organizations to move beyond experimentation and confidently scale AI into production through secure, high-performance, and fully managed computing environments.
Our platform integrates high-density GPU-driven NeoCloud infrastructure for training and inference, unified AI workload orchestration across hybrid and multi-cloud ecosystems, SRE-led operational excellence, enterprise-grade security, and transparent cost-efficient pricing. At Animbus, we remove the complexity of building and managing AI infrastructure — so enterprises can innovate faster, scale smarter, and focus on outcomes that matter.
7+
Years Experience Required
100%
Remote-Job
GPU
High-Density Infrastructure
Multi
Cloud Coverage
THE ROLE
What You’ll Be Doing
You’ll be the backbone of our AI infrastructure — building, automating, and maintaining the platforms that enterprise AI runs on.
• Architect and manage high-density GPU clusters for AI training and inference • Design & deploy intelligent proxy layers for secure AI workload routing, API traffic control, & multi-tenant isolation • Build and enhance Kubernetes-based AI platforms with advanced networking controls • Implement GPU-aware scheduling, workload isolation, and performance optimization • Develop Infrastructure-as-Code for compute, networking, and proxy deployments • Configure secure ingress/egress controls, API gateways, and service mesh architectures • Implement observability across GPU performance, proxy traffic, and distributed systems • Collaborate with AI teams to optimize inference latency and training throughput
REQUIREMENTS
Skills & Experience
Must-Have skills
• 7+ years of experience in Platform Engineering, DevOps, or Cloud Infrastructure. • strong expertise in Kubernetes, container networking, and cluster architecture.
• Experience with proxy technologies (NGINX, Envoy, HAProxy, or similar). • Deep understanding of Linux systems, networking (TCP/IP, DNS, TLS), and load balancing. • Experience with Infrastructure-as-Code (Terraform, Helm, Ansible, etc.). • Strong scripting/programming skills (Python, Go, Bash, etc.). • Exerience with AWS, Azure, or GCP.
Nice to have
• Experience with service mesh technologies (Istio, Linkerd, etc.) • Familiarity with AI frameworks (PyTorch, TensorFlow, Hugging Face)
• Exposure to MLOps pipelines and AI workload orchestration tools
• Experience with GPU performance tuning and capacity planning. • Knowledge of Zero Trust architecture and API security best practices. • Cloud or Kubernetes certifications.
JOIN US
READY TO BUILD WHAT MATTERS
If you’re passionate about AI infrastructure and driving innovation at enterprise scale, we want to hear from you. Send your profile to hello@animbus.ai
and take the first step towards a rewarding career with Animbus. Can’t find a perfect match? Feel free to submit your resume for future consideration.