Senior DevOps / ML Infrastructure Engineer - AI Lab
Secure Global Money Transfers with Cutting-Edge Technology. Join our mission to protect cross-border transactions, helping customers send money safely worldwide. As a Senior DevOps / ML Infrastructure Engineer in our AI Lab, you'll maintain and scale our infrastructure while enabling seamless ML model integration into production workflows. You'll work alongside our Senior MLOps Architect to build a comprehensive ML platform that serves multiple teams across the organization.
What You'll Do:
- Manage multiple orchestration platforms: Kubernetes in AWS (CloudFormation) and on-prem Kubernetes clusters-
- Maintain Apache Flink infrastructure (managed in AWS or self-hosted in on-prem Kubernetes)
- Handle production support, incident response, and on-call rotations
- Perform regular patching activities and security vulnerability remediation
- Support and maintain workflow engine infrastructure
- Improve observability by utilizing Prometheus, Grafana, Splunk, Slack alerts, etc.
MLOps & Platform Development:
- Collaborate with Senior MLOps Architect to build and maintain ML infrastructure
- Set up and configure MLflow for experiment tracking and model registry
- Build automated MLOps pipelines for model training, experimentation, and deployment (Champion-Challenger, shadow mode)
- Support feature calculation pipelines and ETL processes
- Enable model serving infrastructure for Python-based ML services