Smartsheet07.03.2026
Senior AI/ML Ops Engineer (Hybrid in Bangalore)
Bangalore
Обязанности
- 01Designing, developing and overseeing the strategy and architecture of scalable and reliable AI/ML Ops platforms and pipelines
- 02Packaging and deploying AI/ML services to production with reproducibility and interpretability
- 03Designing and implementing automated CI/CD pipelines to accelerate model deployment
- 04Provisioning and optimizing infrastructure for training and serving using Docker, Kubernetes or serverless platforms
- 05Implementing post‑deployment monitoring for model performance, data drift and latency
- 06Automating retraining and data pipeline workflows to maintain model accuracy
- 07Managing deployment of foundation models, fine‑tuning workflows and Retrieval‑Augmented Generation stacks
- 08Optimizing GPU/CPU utilization to minimize cloud costs while ensuring low‑latency inference
- 09Collaborating with data scientists, data engineers and software engineers to bridge development and production
- 10Managing versioning for data, code and models using tools such as MLflow
- 11Implementing data security measures and ensuring compliance with governance policies
- 12Evaluating emerging data technologies and driving innovation in data infrastructure
- 13Diagnosing and resolving complex data‑related issues to ensure platform stability
- 14Performing other duties as assigned
Требования
- 01Experience with enterprise SaaS solutions requiring high availability and scalability
- 02Hands‑on experience building and maintaining AI/ML Ops platforms at scale
- 03Strong background in system design and AI/ML frameworks handling large structured and unstructured datasets
- 04Deep experience with Databricks Lakehouse ecosystem and AI/ML workflows on Databricks, MLflow, Mosaic AI, Unity Catalog, Vector Search and Knowledge Graphs
- 05Knowledge of AI/ML pipeline integration frameworks such as LangChain and LangGraph
- 06Proficiency with at least one major cloud provider (AWS, Azure or GCP), preferably AWS hosted data platforms
- 07Programming proficiency in Python and SQL
- 08Experience with modern software engineering practices: Kubernetes, CI/CD, Infrastructure‑as‑Code (Terraform preferred), observability and alerting
- 09Ability to optimize solution costs and design for cost efficiency
- 10Legal eligibility to work in India on an ongoing basis