Grafana Labs07.05.2026

Senior Software Engineer - Grafana Databases, Managed Services | Spain | Remote

Spain (Remote)

Обязанности

  • 01Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure
  • 02Diagnosing and eliminating cross-layer failure modes (e.g., object storage latency, noisy neighbors, control-plane bottlenecks, query performance regressions, etc.)
  • 03Designing safe upgrade and rollout strategies at scale
  • 04Improving observability, automation, and operational ergonomics
  • 05Partnering closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance
  • 06Working directly with distributed systems behavior, Kubernetes scheduling dynamics, storage engines, compression trade-offs, etc.
  • 07Serving as a primary escalation point and on-call for relevant incidents
  • 08Owning the relationship with all system vendors, including WarpStream Labs and others
  • 09Reviewing and defining SLOs for shared database infrastructure, proactively reducing error budgets
  • 10Improving the diagnosability of core streaming and database systems in production
  • 11Implementing solutions that ensure reliability, scalability, and performance of high-throughput, multi-cloud infrastructure
  • 12Developing fault-tolerant patterns that account for distributed system realities
  • 13Planning and executing safe upgrades and rollouts across dozens of production clusters
  • 14Collaborating with database and platform engineering leaders to influence architecture, roadmap priorities, and long-term strategy
  • 15Participating in PR review and contributing to design documents, automation, tooling, and code improvements that reduce operational risk
  • 16Sharing best practices and distributed systems knowledge with partner teams
  • 17Participating in incident response, from investigation through resolution and post-incident reviews (PIR)

Требования

  • 016+ years of engineering experience
  • 02Meaningful time in SRE, platform engineering, production engineering, infrastructure engineering, or distributed systems roles
  • 03Experience operating distributed systems in production (e.g., streaming systems, analytical databases, large-scale storage backends)
  • 04Independent attitude
  • 05Good communication skills

Условия

  • 01Remote opportunity
  • 02Applicants living in Spain time zones only
  • 03Remote-first company
  • 04Regular 1:1s with your manager
  • 05Close collaboration with teammates across regions
  • 06On-call component
  • 07Company-funded AI coding assistant usage budget
  • 08Access to frontier models (e.g., GPT-Codex 5/3, Claude Opus 4.6, Gemini 3 Pro)