Crusoe09.04.2026
Systems Engineer II, Compute
Полная занятостьОфис
Обязанности
- 01Design highly reliable and performant Linux applications used to manage virtualization stack across thousands of AI compute servers in multiple global datacenters
- 02Integrate Crusoe applications with a wide variety of hardware and software AI chip-vendor stacks
- 03Build solutions to optimize and monitor virtualized hardware (GPUs, Infiniband/ROCe NICs, Ephemeral Storage, etc.) in cutting-edge AI/HPC environments
- 04Work side by side with Linux Kernel and Hypervisor teams to ensure Crusoe applications are seamlessly integrated with a variety of kernels and hypervisors
- 05Analyze and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a focus on optimizing AI/ML workloads
- 06Diagnose and resolve complex system issues across the virtualization stack (drivers, kernel, hypervisor, guest OS, and Crusoe applications)
- 07Conduct thorough code reviews to ensure high software quality, reliability, and security within compute applications and virtualization stack
- 08Collaborate with other engineering teams (hardware design, OS development, AI/ML application teams) for cohesive product development
- 09Provide technical guidance and mentorship to junior engineers
Требования
- 01Experience building applications on Linux kernels, specifically pertaining to virtualization, device drivers, memory management, and process scheduling
- 02Solid understanding of hardware devices such as GPUs, CPUs, Infiniband and Ethernet NICs, Ephemeral Disks, and PCI Express
- 03Strong grasp of distributed applications and highly-scalable systems design
- 04Specific focus around communications protocols (gRPC, REST, TCP/IP), databases (Postgres, Redis), and systems design applications (Pub/Sub, Kafka)
- 05Strong experience building software applications at higher (Golang, Java, Python) and lower (C, C++, Rust) levels
- 06Keen eye for clean, maintainable code and a unit-test driven mindset
- 07Excellent communication skills to collaborate across teams
- 08Ability to rapidly adapt and learn new technologies
- 09General knowledge of hypervisors, virtual machine lifecycles, and Linux KVM tooling
- 10Understanding of Gitlab or GitHub CI/CD pipelines for bug-free code delivery
Условия
- 01Competitive compensation
- 02Restricted Stock Units
- 03Paid time off & paid holidays
- 04Comprehensive health, dental & vision insurance
- 05Employer contributions to HSA account
- 06Paid parental leave
- 07Paid life insurance, short-term and long-term disability
- 08Professional development & tuition reimbursement
- 09Mental health & wellness support
- 10Commuter benefits (parking & transit)
- 11Cell phone stipend
- 12401(k) Retirement plan