xAI07.03.2026

Software Engineer - Kernels C++/CUDA

Palo Alto

Обязанности

  • 01Design, build, and optimize massive GPU clusters for extreme-scale training and inference workloads
  • 02Develop and tune low-level CUDA kernels (GeMM, Attention, etc.), using CUTLASS, Tensor Cores, and Nsight for maximum performance
  • 03Work on Linux kernel internals, scheduling, memory management, and resource isolation at cluster scale
  • 04Build custom container orchestration, virtualization layers (KVM, Firecracker, etc.), and distributed systems that go beyond standard Kubernetes
  • 05Profile, debug, and eliminate bottlenecks across GPU memory hierarchy, networking fabric, filesystems, and multi-GPU operations
  • 06Create and maintain infrastructure-as-code, automation, and tools that keep the entire supercomputer reliable and efficient
  • 07Collaborate closely with AI research teams to deliver production-grade performance and scalability

Требования

  • 01Deep low-level systems programming (C/C++ or Rust)
  • 02Experience building and operating high performance exabyte scale storage systems
  • 03Strong experience with large-scale GPU clusters or distributed compute infrastructure at production scale
  • 04Hands-on work with GPU kernel optimization (CUTLASS, custom kernels, Nsight profiling)
  • 05Experience with Linux kernel internals, scheduling, virtualization, or large-scale orchestration
  • 06Track record of building or running high-performance infrastructure for AI workloads (training or inference platforms)
  • 07Ability to reason from first principles and optimize for both memory-bound and compute-bound scenarios

Условия

  • 01Base salary: $180,000 - $440,000 USD
  • 02Equity compensation
  • 03Comprehensive medical, vision, and dental coverage
  • 04Access to a 401(k) retirement plan
  • 05Short & long-term disability insurance
  • 06Life insurance
  • 07Various other discounts and perks
  • 08Equal opportunity employer