Smartsheet07.03.2026

Senior AI/ML Ops Engineer (Hybrid in Bangalore)

Bangalore

Обязанности

  • 01Designing, developing and overseeing the strategy and architecture of scalable and reliable AI/ML Ops platforms and pipelines
  • 02Packaging and deploying AI/ML services to production with reproducibility and interpretability
  • 03Designing and implementing automated CI/CD pipelines to accelerate model deployment
  • 04Provisioning and optimizing infrastructure for training and serving using Docker, Kubernetes or serverless platforms
  • 05Implementing post‑deployment monitoring for model performance, data drift and latency
  • 06Automating retraining and data pipeline workflows to maintain model accuracy
  • 07Managing deployment of foundation models, fine‑tuning workflows and Retrieval‑Augmented Generation stacks
  • 08Optimizing GPU/CPU utilization to minimize cloud costs while ensuring low‑latency inference
  • 09Collaborating with data scientists, data engineers and software engineers to bridge development and production
  • 10Managing versioning for data, code and models using tools such as MLflow
  • 11Implementing data security measures and ensuring compliance with governance policies
  • 12Evaluating emerging data technologies and driving innovation in data infrastructure
  • 13Diagnosing and resolving complex data‑related issues to ensure platform stability
  • 14Performing other duties as assigned

Требования

  • 01Experience with enterprise SaaS solutions requiring high availability and scalability
  • 02Hands‑on experience building and maintaining AI/ML Ops platforms at scale
  • 03Strong background in system design and AI/ML frameworks handling large structured and unstructured datasets
  • 04Deep experience with Databricks Lakehouse ecosystem and AI/ML workflows on Databricks, MLflow, Mosaic AI, Unity Catalog, Vector Search and Knowledge Graphs
  • 05Knowledge of AI/ML pipeline integration frameworks such as LangChain and LangGraph
  • 06Proficiency with at least one major cloud provider (AWS, Azure or GCP), preferably AWS hosted data platforms
  • 07Programming proficiency in Python and SQL
  • 08Experience with modern software engineering practices: Kubernetes, CI/CD, Infrastructure‑as‑Code (Terraform preferred), observability and alerting
  • 09Ability to optimize solution costs and design for cost efficiency
  • 10Legal eligibility to work in India on an ongoing basis