isecjobs.com

Director, AI Alignment and Interpretability (Remote)

USA VA Remote, United States R

USD 195K-290K Executive-level Full Time

Apply Save
Found 3d ago
Tasks
Perks/Benefits
Skills/Tech-stack

AI alignment | Activation Patching | Adversarial ML | Artificial Intelligence | Behavioral Testing | Capability Elicitation | Causal tracing | Circuit analysis | Feature Visualization | Language Models | Large Language Models | Machine Learning | Mechanistic Interpretability | Offensive security | Probing Classifiers | Vulnerability research

Education

Master of Science | PhD

Roles

AI | AI Research Director | Director | Research Director | Research Scientist | Scientist

Regions

North America

Countries

United States

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs