Hiring MLOPS ENGINEER (AI Infrastructure)

April 17, 2026

Careers

Hiring MLOps Engineer at Trivita AI to manage AI infrastructure, deploy LLMs, optimize GPU usage, and build scalable MLOps pipelines.

Deploy and manage LLM serving systems (vLLM, TGI, Triton) and optimize inference performance using techniques such as PagedAttention and quantization
Build and manage GPU infrastructure, including NVIDIA MIG configuration for efficient resource utilization
Design and develop CI/CD pipelines for machine learning, including automated testing, deployment, and model versioning
Implement monitoring systems for GPU health, latency, throughput, and model drift
Design auto-scaling mechanisms for AI workloads on Kubernetes
Collaborate with Backend Developers and AI Researchers to integrate models into production systems

Strong expertise in Docker and Kubernetes, including GPU operator and resource scheduling
Deep understanding of NVIDIA GPU architecture and MIG
Proficiency in Python and shell scripting
Experience with ML frameworks such as PyTorch or TensorFlow
Experience with MLOps tools (Kubeflow, MLflow, Weights & Biases)
Experience with cloud platforms (AWS, GCP)
Familiarity with backend languages (Java or Go) is a plus