Smartsheet06.03.2026
Senior AI/ML Ops Engineer-II (Hybrid in Bangalore)
Bangalore
Обязанности
- 01Designing, Developing and overseeing the strategy and architecture of scalable and reliable AI/ML Ops platforms / pipelines
- 02Package and deploy AI/ML services to production, ensuring they are reproducible and interpretable
- 03Design and implement automated CI/CD pipelines to accelerate model deployment using tools
- 04Provision and optimize infrastructure for training and serving, utilizing Docker, Kubernetes, or serverless platforms
- 05Implement post-deployment monitoring for model performance, data drift, and latency using tools
- 06Automate retraining and data pipeline workflows to ensure models stay accurate over time
- 07Manage the deployment of foundation models, fine-tuning workflows, and Retrieval-Augmented Generation (RAG) stacks (Vector DBs, Knowledge Graph)
- 08Manage GPU/CPU utilization to minimize cloud costs while maintaining low-latency inference for users
- 09Work closely with data scientists, data engineers, and software engineers to bridge the gap between model development and production
- 10Manage versioning for data, code, and models using tools like MLflow
- 11Implement data security measures, ensuring compliance with data governance policies, and protecting sensitive data
- 12Stay abreast of emerging data technologies and explore opportunities for innovation to improve the organisation’s data infrastructure
- 13Diagnose and resolve complex data-related issues, ensuring the stability and reliability of the data platform
Требования
- 01Enterprise SaaS software solutions with high availability and scalability
- 02Solution handling large scale structured and unstructured data from varied data sources
- 03Building and maintaining AI/ML Ops platform systems ensuring scalability, reliability, efficiency and security
- 04Working with Product engineering team to influence designs with data, AI and analytics use cases in mind
- 05In depth experience in System design, AI/ML Frameworks and tools involving large Petabytes of data with Databricks Lakehouse ecosystem
- 06AI/MLOps workflows on Databricks, MLFlow, Mosaic AI Agent Framework, Unity Catalog, Vector Search, Knowledge Graph
- 07Knowledge of AI/ML frameworks like LangChain, LangGraph for AI/ML Ops pipeline integration
- 08Hands-on experience with at least one major cloud provider (AWS, Azure, or GCP)
- 09Experience in AWS hosted data platform is preferable
- 10Programming languages like Python and SQL
- 11Modern software engineering practices like Kubernetes, CI/CD, IAC tools (Preferably Terraform), Observability, monitoring and alerting
- 12Solution Cost Optimisations and design to cost
- 13Legally eligible to work in India on an ongoing basis
Условия
- 01Hybrid work in Bangalore
- 02Work with global teams on innovative AI/ML solutions
- 03Opportunity to grow beyond your role and explore diverse challenges