Anthropic14.04.2026

Full-Stack Software Engineer, Reinforcement Learning

San Francisco

Обязанности

01Build and extend web platforms for RL environment creation, management, and quality review — including environment configuration, versioning, and validation workflows
02Develop vendor-facing interfaces and tooling that let external partners create, submit, and iterate on training environments with minimal friction
03Design and implement platforms for human data collection at scale, including labeling workflows, quality assurance systems, and feedback mechanisms that surface reward signal integrity issues early
04Build evaluation dashboards and observability UIs that give researchers real-time insight into environment quality, training run health, and reward hacking
05Create backend services and APIs that connect environment authoring tools, data collection systems, and RL training infrastructure
06Build and expand scalable code data generation pipelines, producing diverse programming tasks with robust reward signals across languages and difficulty levels
07Develop onboarding automation and documentation tooling so new vendors and internal users ramp up in hours, not weeks
08Partner closely with RL researchers, data operations, and vendor management to translate ambiguous requirements into well-scoped, well-designed products

01Strong software engineering fundamentals and real full-stack range — comfortable owning a surface from database schema to frontend
02Proficient in Python and a modern web stack (React, TypeScript, or similar)
03Track record of shipping systems that solved a hard problem, not just shipped on time
04Operate with high agency: identify what needs to be done and drive it forward without waiting for a ticket
05Thrive in a fast-moving environment where priorities shift and the next problem is often one nobody has solved before
06Care about Anthropic's mission to build safe, beneficial AI and want your work to contribute directly to it
07Communicate clearly with researchers, operations teams, and engineers, and can turn vague asks into well-scoped work