Santa Clara, CAOnsite$200,000 - $350,0001 months ago
full-timeseniorcustom
About the Role
NVIDIA is looking for a Senior AI Infrastructure Engineer to work on our AI training and inference platform. You will develop high-performance computing solutions that enable customers to train and deploy the world's largest AI models.
This role involves working on CUDA optimization, distributed training frameworks, and model serving infrastructure. You will push the boundaries of what's possible with GPU computing for AI workloads.
The ideal candidate has deep expertise in GPU programming, distributed systems, and ML infrastructure.
Requirements
- 5+ years of systems engineering experience
- Expert-level CUDA and C++ skills
- Experience with distributed training (NCCL, RDMA)
- Strong Python skills
- Experience with large-scale ML training
- Understanding of GPU architectures and memory management
Required Skills
C++CUDAPythonDistributed TrainingKubernetes
About NVIDIA AI
AI computing company. GPUs, CUDA, and AI infrastructure.