Anthropic09.03.2026
Safeguards Enforcement Analyst, Safety Evaluations
Remote-Friendly (Travel-Requ…
Обязанности
- 01Support model launch readiness by running evaluations, monitoring and interpreting results, and surfacing regressions or unexpected behavior changes to relevant stakeholders
- 02Partner closely with policy and domain experts throughout the evaluation lifecycle — from identifying risks and scoping the right evaluation approach, to coordinating creation of new evals and ensuring existing ones remain current with evolving policies, threat vectors, and model capabilities
- 03Work with cross-functional stakeholders to help manage evaluation outcomes, including interpreting results and driving mitigations where needed
- 04Think strategically about eval quality to build processes and eval paradigms that keep evaluations unsaturated, high-signal, and insightful as models improve
- 05Build out processes and frameworks for creating product-specific evaluations as Anthropic's product surface area expands
- 06Help design and scope tooling improvements that accommodate evolving eval needs and expand self-serve eval creation and iteration for non-technical users
- 07Write and maintain rigorous documentation for evaluation creation, execution, and interpretation as the team builds out eval tooling and processes
Требования
- 01Have experience in trust and safety, content operations, policy enforcement, or a related operational role at a technology company
- 02Thrive in ambiguous, fast-moving environments — you're energized rather than frustrated when the path forward isn't clearly defined and you need to figure it out as you go
- 03Have experience building processes, workflows, or programs from scratch (zero-to-one work), not just maintaining existing ones
- 04Have strong program management instincts, naturally creating structure around complex, multi-stakeholder efforts by tracking timelines, dependencies, and deliverables to keep work on track
- 05Are eager to expand your technical toolkit, including adopting internal tools and AI-assisted workflows (e.g., Claude Code) to accelerate your work
- 06Can manage multiple concurrent workstreams across different domain areas without losing track of details — strong prioritization and context-switching are essential when deadlines and priorities shift quickly
- 07Are a strong generalist comfortable moving fluidly across different types of work and switching contexts throughout the day
- 08Are comfortable making judgment calls with incomplete information and escalating appropriately when needed
- 09Communicate clearly and concisely, both in writing and cross-functionally
- 10Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
- 11Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience
Условия
- 01The annual compensation range for this role is $230,000 — $270,000 USD
- 02Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time
- 03Visa sponsorship: We do sponsor visas