Anthropic09.03.2026

Safeguards Enforcement Analyst, Safety Evaluations

Remote-Friendly (Travel-Requ…

Обязанности

  • 01Support model launch readiness by running evaluations, monitoring and interpreting results, and surfacing regressions or unexpected behavior changes to relevant stakeholders
  • 02Partner closely with policy and domain experts throughout the evaluation lifecycle — from identifying risks and scoping the right evaluation approach, to coordinating creation of new evals and ensuring existing ones remain current with evolving policies, threat vectors, and model capabilities
  • 03Work with cross-functional stakeholders to help manage evaluation outcomes, including interpreting results and driving mitigations where needed
  • 04Think strategically about eval quality to build processes and eval paradigms that keep evaluations unsaturated, high-signal, and insightful as models improve
  • 05Build out processes and frameworks for creating product-specific evaluations as Anthropic's product surface area expands
  • 06Help design and scope tooling improvements that accommodate evolving eval needs and expand self-serve eval creation and iteration for non-technical users
  • 07Write and maintain rigorous documentation for evaluation creation, execution, and interpretation as the team builds out eval tooling and processes

Требования

  • 01Have experience in trust and safety, content operations, policy enforcement, or a related operational role at a technology company
  • 02Thrive in ambiguous, fast-moving environments — you're energized rather than frustrated when the path forward isn't clearly defined and you need to figure it out as you go
  • 03Have experience building processes, workflows, or programs from scratch (zero-to-one work), not just maintaining existing ones
  • 04Have strong program management instincts, naturally creating structure around complex, multi-stakeholder efforts by tracking timelines, dependencies, and deliverables to keep work on track
  • 05Are eager to expand your technical toolkit, including adopting internal tools and AI-assisted workflows (e.g., Claude Code) to accelerate your work
  • 06Can manage multiple concurrent workstreams across different domain areas without losing track of details — strong prioritization and context-switching are essential when deadlines and priorities shift quickly
  • 07Are a strong generalist comfortable moving fluidly across different types of work and switching contexts throughout the day
  • 08Are comfortable making judgment calls with incomplete information and escalating appropriately when needed
  • 09Communicate clearly and concisely, both in writing and cross-functionally
  • 10Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
  • 11Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Условия

  • 01The annual compensation range for this role is $230,000 — $270,000 USD
  • 02Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time
  • 03Visa sponsorship: We do sponsor visas