Crusoe09.04.2026

Systems Engineer II, Compute

Полная занятостьОфис

Обязанности

  • 01Design highly reliable and performant Linux applications used to manage virtualization stack across thousands of AI compute servers in multiple global datacenters
  • 02Integrate Crusoe applications with a wide variety of hardware and software AI chip-vendor stacks
  • 03Build solutions to optimize and monitor virtualized hardware (GPUs, Infiniband/ROCe NICs, Ephemeral Storage, etc.) in cutting-edge AI/HPC environments
  • 04Work side by side with Linux Kernel and Hypervisor teams to ensure Crusoe applications are seamlessly integrated with a variety of kernels and hypervisors
  • 05Analyze and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a focus on optimizing AI/ML workloads
  • 06Diagnose and resolve complex system issues across the virtualization stack (drivers, kernel, hypervisor, guest OS, and Crusoe applications)
  • 07Conduct thorough code reviews to ensure high software quality, reliability, and security within compute applications and virtualization stack
  • 08Collaborate with other engineering teams (hardware design, OS development, AI/ML application teams) for cohesive product development
  • 09Provide technical guidance and mentorship to junior engineers

Требования

  • 01Experience building applications on Linux kernels, specifically pertaining to virtualization, device drivers, memory management, and process scheduling
  • 02Solid understanding of hardware devices such as GPUs, CPUs, Infiniband and Ethernet NICs, Ephemeral Disks, and PCI Express
  • 03Strong grasp of distributed applications and highly-scalable systems design
  • 04Specific focus around communications protocols (gRPC, REST, TCP/IP), databases (Postgres, Redis), and systems design applications (Pub/Sub, Kafka)
  • 05Strong experience building software applications at higher (Golang, Java, Python) and lower (C, C++, Rust) levels
  • 06Keen eye for clean, maintainable code and a unit-test driven mindset
  • 07Excellent communication skills to collaborate across teams
  • 08Ability to rapidly adapt and learn new technologies
  • 09General knowledge of hypervisors, virtual machine lifecycles, and Linux KVM tooling
  • 10Understanding of Gitlab or GitHub CI/CD pipelines for bug-free code delivery

Условия

  • 01Competitive compensation
  • 02Restricted Stock Units
  • 03Paid time off & paid holidays
  • 04Comprehensive health, dental & vision insurance
  • 05Employer contributions to HSA account
  • 06Paid parental leave
  • 07Paid life insurance, short-term and long-term disability
  • 08Professional development & tuition reimbursement
  • 09Mental health & wellness support
  • 10Commuter benefits (parking & transit)
  • 11Cell phone stipend
  • 12401(k) Retirement plan