Anthropic09.03.2026

Safeguards Enforcement Analyst, Safety Evaluations

Remote-Friendly (Travel-Requ…

Обязанности

01Support model launch readiness by running evaluations, monitoring and interpreting results, and surfacing regressions or unexpected behavior changes to relevant stakeholders
02Partner closely with policy and domain experts throughout the evaluation lifecycle — from identifying risks and scoping the right evaluation approach, to coordinating creation of new evals and ensuring existing ones remain current with evolving policies, threat vectors, and model capabilities
03Work with cross-functional stakeholders to help manage evaluation outcomes, including interpreting results and driving mitigations where needed
04Think strategically about eval quality to build processes and eval paradigms that keep evaluations unsaturated, high-signal, and insightful as models improve
05Build out processes and frameworks for creating product-specific evaluations as Anthropic's product surface area expands
06Help design and scope tooling improvements that accommodate evolving eval needs and expand self-serve eval creation and iteration for non-technical users
07Write and maintain rigorous documentation for evaluation creation, execution, and interpretation as the team builds out eval tooling and processes

Требования

01Have experience in trust and safety, content operations, policy enforcement, or a related operational role at a technology company
02Thrive in ambiguous, fast-moving environments — you're energized rather than frustrated when the path forward isn't clearly defined and you need to figure it out as you go
03Have experience building processes, workflows, or programs from scratch (zero-to-one work), not just maintaining existing ones
04Have strong program management instincts, naturally creating structure around complex, multi-stakeholder efforts by tracking timelines, dependencies, and deliverables to keep work on track
05Are eager to expand your technical toolkit, including adopting internal tools and AI-assisted workflows (e.g., Claude Code) to accelerate your work
06Can manage multiple concurrent workstreams across different domain areas without losing track of details — strong prioritization and context-switching are essential when deadlines and priorities shift quickly
07Are a strong generalist comfortable moving fluidly across different types of work and switching contexts throughout the day
08Are comfortable making judgment calls with incomplete information and escalating appropriately when needed
09Communicate clearly and concisely, both in writing and cross-functionally
10Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
11Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Условия

01The annual compensation range for this role is $230,000 — $270,000 USD
02Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time
03Visa sponsorship: We do sponsor visas

Safeguards Enforcement Analyst, Safety Evaluations

Обязанности

Требования

Условия

Похожие вакансии

Safeguards Policy Analyst, Fraud & Scams

Research Engineer, Safeguards Labs

Software Engineer, Safeguards Foundations (Internal Tooling)

Security Labs Engineer

Engineering Manager, Agent Prompts & Evals

Research Lead, Training Insights