Careers-AI Platform Engineer-GPU & Intelligent Proxy Infrastructure

Animbus Logo


Book A Meeting


Contact us

AI Platform Engineer ​

(GPU & Intelligent Proxy Infrastructure)​

Power the infrastructure behind enterprise AI. At Animbus, you’ll design and manage the high-density GPU environments and Kubernetes platforms that keep mission-critical AI in production — reliably, securely, and at scale.


APPLY NOW

WHO WE ARE

POWERING ENTERPRISE AI

Animbus powers enterprise AI with managed infrastructure built for mission-critical performance. We enable organizations to move beyond experimentation and confidently scale AI into production through secure, high-performance, and fully managed computing environments.

Our platform integrates high-density GPU-driven NeoCloud infrastructure for training and inference, unified AI workload orchestration across hybrid and multi-cloud ecosystems, SRE-led operational excellence, enterprise-grade security, and transparent cost-efficient pricing. At Animbus, we remove the complexity of building and managing AI infrastructure — so enterprises can innovate faster, scale smarter, and focus on outcomes that matter.

7+

Years Experience Required

100%

Remote-Job

GPU

High-Density Infrastructure

Multi

Cloud Coverage

THE ROLE

What You’ll Be Doing

You’ll be the backbone of our AI infrastructure — building, automating, and maintaining the platforms that enterprise AI runs on.

• Architect and manage high-density GPU clusters for AI training and inference
• Design & deploy intelligent proxy layers for secure AI workload routing, API traffic control, & multi-tenant isolation
• Build and enhance Kubernetes-based AI platforms with advanced networking controls
• Implement GPU-aware scheduling, workload isolation, and performance optimization
• Develop Infrastructure-as-Code for compute, networking, and proxy deployments
• Configure secure ingress/egress controls, API gateways, and service mesh architectures
• Implement observability across GPU performance, proxy traffic, and distributed systems
• Collaborate with AI teams to optimize inference latency and training throughput

REQUIREMENTS

Skills & Experience

Must-Have skills

• 7+ years of experience in Platform Engineering, DevOps, or Cloud Infrastructure.
• strong expertise in Kubernetes, container networking, and cluster architecture.

• Hands-on experience managing GPU environments (NVIDIA ecosystem preferred).

• Experience with proxy technologies (NGINX, Envoy, HAProxy, or similar).
• Deep understanding of Linux systems, networking (TCP/IP, DNS, TLS), and load balancing.
• Experience with Infrastructure-as-Code (Terraform, Helm, Ansible, etc.).
• Strong scripting/programming skills (Python, Go, Bash, etc.).
• Exerience with AWS, Azure, or GCP.

Nice to have

• Experience with service mesh technologies (Istio, Linkerd, etc.)
• Familiarity with AI frameworks (PyTorch, TensorFlow, Hugging Face)

• Exposure to MLOps pipelines and AI workload orchestration tools

• Experience with GPU performance tuning and capacity planning.
• Knowledge of Zero Trust architecture and API security best practices.
• Cloud or Kubernetes certifications.

JOIN US

READY TO BUILD WHAT MATTERS

If you’re passionate about AI infrastructure and driving innovation at enterprise scale, we want to hear from you. Send your profile to hello@animbus.ai
and take the first step towards a rewarding career with Animbus. Can’t find a perfect match? Feel free to submit your resume for future consideration.


Apply now ➡


View all roles

Sitemap

Home

Technology

Solutions

Why Animbus

About

Technology

Overview

Animbus Intelligent Cloud

Global L2 Fabric

Unified Control Plane

Managed SRE & Compliance

Privacy Policy

Privacy Policy

Join Us!

Careers


WE ARE HIRING!

© Copyright Animbus. All Rights Reserved.