GitLab24.04.2026

Site Reliability Engineer, Cloud Cost Utilization

Remote

Обязанности

  • 01Design and maintain cloud resource tagging and labeling strategies across GCP and AWS to support accurate cost attribution
  • 02Develop tooling and pipelines to ingest, normalize, and report on cloud billing data using the FOCUS specification
  • 03Automate cost anomaly detection, forecasting, and alerting so engineering teams can respond quickly to changes in infrastructure spend
  • 04Contribute to GitLab's observability and monitoring stacks, including Prometheus, LGTM (Loki, Grafana, Tempo, and Mimir), and ELK, with a focus on surfacing cost efficiency signals
  • 05Partner with Finance and Engineering leadership to support cloud cost forecasting for planning and budget discussions
  • 06Act as a subject matter expert for cloud cost attribution, tagging strategy, and FOCUS adoption across GitLab Infrastructure
  • 07Collaborate with Finance and Compliance teams on audits, certifications, and financial reporting needs related to cloud infrastructure usage
  • 08Contribute to infrastructure-as-code efforts, including Terraform and Ansible, so cost controls and tagging requirements are built into provisioning workflows from the start

Требования

  • 01Hands-on experience with cloud cost management in GCP and/or AWS, including billing data, pricing models, and optimization approaches
  • 02Familiarity with, or interest in adopting, the FinOps FOCUS specification for multi-cloud cost analysis
  • 03Experience designing or implementing cloud resource tagging and labeling strategies and improving adoption across teams
  • 04Comfort working across technical and business functions, including Engineering, Finance, and other stakeholders
  • 05Experience with infrastructure as code, including Terraform and Ansible
  • 06Familiarity with observability tooling, including Grafana, and an understanding of how reliability and cost signals can be connected
  • 07Ability to explain technical cost data clearly to non-engineering audiences and support informed decision-making
  • 08A self-directed approach to work, with comfort operating in a fully remote and asynchronous environment

Условия

  • 01Flexible Paid Time Off
  • 02Team Member Resource Groups
  • 03Equity Compensation & Employee Stock Purchase Plan
  • 04Growth and Development Fund
  • 05Parental Leave
  • 06Remote work
  • 07Asynchronous work environment