Exa23.02.2026

Software Engineer, Infrastructure

Полная занятостьОфис

Обязанности

  • 01Build the Kubernetes orchestration on a $20M GPU cluster
  • 02Scale our AWS batchjob system to handle map reduce jobs over 10s of thousands of machines
  • 03Design GPU scheduling software so we max out our cluster utilization
  • 04Build observability into our production systems

Требования

  • 01You have experience designing and operating large-scale infrastructure - GPU clusters or large Kubernetes clusters or cloud batchjob systems
  • 02You bring an obsessive mindset — always thinking about reliability, observability, and optimization across the entire stack

Условия

  • 01This is an in-person opportunity in Singapore
  • 02We’re happy to sponsor international candidates
  • 03In addition to premium healthcare benefits (medical, dental, vision), we also offer fertility benefits and a monthly wellness stipend to all of our employees