Anthropic10.04.2026

Anthropic Fellows Program, Reinforcement Learning

London

Обязанности

  • 01Conduct research and implement solutions in reinforcement learning algorithms
  • 02Build model‑based tools to improve AI training data quality
  • 03Create RL environments for capability and safety‑related tasks
  • 04Analyze and debug large‑scale model training processes
  • 05Collaborate with mentors and cross‑disciplinary teams to produce public research outputs

Требования

  • 01Strong technical background in computer science, mathematics, physics or related field
  • 02Fluent in Python programming
  • 03Ability to work full‑time for a 4‑month fellowship
  • 04Work authorization and residence in the US, UK, or Canada
  • 05Motivation for AI safety and empirical AI research

Условия

  • 014 months full‑time (40 h/week) with possible extension
  • 02Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD plus benefits
  • 03Funding for compute up to ~$15k per month
  • 04Access to shared workspaces in Berkeley, CA or London, UK; remote option available
  • 05Mentorship from Anthropic researchers and access to AI safety community