Director, AI Alignment and Interpretability (Remote)

USA VA Remote, United States R

USD 195K-290K Executive-level Full Time

@ C...

Apply Save

Found 3d ago

Tasks

Apply behavioral constraints and deployment guardrails
Conduct circuit analysis for security capabilities
Detect offensive misuse signal in model internals
Develop evaluation framework and benchmarks
Lead mechanistic explanations of model behavior
Own alignment and interpretability research agenda
Perform probing classifiers for vulnerability representations
Publish original research
Recruit and develop research scientists
Run behavioral testing and capability elicitation
Set priorities for open problems
Set technical bar through personal contributions
Translate findings into training interventions
Use activation analysis for risk surfacing

Perks/Benefits

Skills/Tech-stack

Education

Master of Science | PhD

Roles

Regions

North America

Countries

United States

Apply Save

Language: en Views:

0 Clicks:

0 Saves: 0

Related jobs

AI Security Engineer - Mid-Atlantic region (Remote in VA, MD, PA, NC, DE, NJ, or DC) USD 110K-176K

AI Foundry | API Integration | AWS Bedrock | AWS CloudFormation | AWS SageMaker

Corporate holidays | Dental insurance | Flexible time off | Home internet allowance | Medical insurance

Senior-level Full Time

Remote R

2d ago
AI Security Engineer USD 100K-150K

Access Control | Access Management | Adversarial ML | Application Security | Authorization

Senior-level Full Time

United States - Remote R

2d ago
Data Scientist (Remote) USD 120K-180K

Abuse Resistance | Agent safety | Agentic Planning | Data scaling | DeepSpeed

Employee networks | Great Place to Work certified | Office culture | Paid adoption leave | Paid parental leave

Mid-level Full Time

USA VA Remote, United States R

2d ago
AI Solutions Architect- Federal USD 170K-240K

AWS | Adversarial Machine Learning | Agentic AI | Air-gapped | Air-gapped environments

Annual workspace upgrades | Flexible time off | Fully remote | Home office stipend | Internet and phone stipend

Senior-level Full Time

Remote- US R

3d ago
AI Security Architect (REMOTE - United States) USD 140K-195K

AI Security | Artificial Intelligence | Azure | Azure Data | Azure Data Lake

Remote work environment

Senior-level Full Time

Franklin, TN R

3d ago
AI Security Engineer USD 100K-150K

Access Controls | Access Management | Adversarial Machine Learning | Application Security | Authorization

Senior-level Full Time

United States - Remote R

3d ago
AI Security Engineer USD 100K-150K

Access Management | Adversarial Machine Learning | Application Security | Cloud Security | Cryptography

Long term multi year engagement | Remote work | Visa transfer support for qualified candidates

Senior-level Full Time

United States - Remote R

3d ago
AI Security Engineer USD 100K-150K

Access Management | Adversarial Machine Learning | Application Security | Authorization | Cloud Security

Health benefits | Remote work | W2 employment

Senior-level Full Time

United States - Remote R

3d ago
Urgent Hiring: NLP Architect (Security Architect) | Hybrid Role | Local to Texas Preferred USD 124K-188K

AI Governance | AI platforms | Agent systems | Cloud AI | Cloud AI Platforms

1 week onsite every month | Hybrid work

Senior-level Contract Full Time

Houston, TX, United States R

4d ago
Director, Product Management, Customer Security Outcomes USD 199K-285K

Artificial Intelligence | Automation | B2B | Cybersecurity | Generative AI

Education reimbursement | Health plans | Parental leave options | Remote work | Retirement options

Executive-level Full Time

Remote - USA R

4d ago
Director of AI & Machine Learning USD 194K-272K

AI Governance | API Integration | Access Control | Audit Logging | Cloud Computing

401k plan | Company-Paid Holidays | Corporate discounts | Dental insurance | Health insurance

Executive-level Full Time

Remote (All), United States R

4d ago
AI Security Engineer USD 100K-150K

Access Management | Adversarial Machine Learning | Authorization | Cloud Security | Cryptography

Career growth | Equal opportunity employer | Remote work

Senior-level Full Time

United States - Remote R

4d ago
AI Security Engineer USD 100K-150K

Access Control | Access Management | Authorization | Cloud Security | Cryptography

Senior-level Full Time

United States - Remote R

4d ago
Staff Data Scientist USD 195K-265K

A/B | A/B Testing | Automl | B testing | Convolutional Neural Network

Senior-level Full Time

Remote - USA R

5d ago
AI/ML Engineer II USD 159K-211K

API Design | AWS | Agent Orchestration | Agent systems | Azure

Health benefits | Onsite collaboration | Paid time off | Professional development

Mid-level Full Time

Remote, USA R

5d ago
AI/ML Engineer USD 150K-211K

AWS | Agent systems | Cloud platform | Data Pipelines | Docker

Onsite schedule | WFH Friday

Entry-level Full Time

Remote, USA R

5d ago
AI Security Engineer USD 100K-150K

Access Management | Adversarial Machine Learning | Application Security | Authorization | Cloud Security

Senior-level Full Time

United States - Remote R

6d ago
AI Security Engineer USD 100K-150K

Access Management | Adversarial Machine Learning | Application Security | Cloud Security | Cryptography

Senior-level Full Time

United States - Remote R

6d ago
Sr. Director, Analyst, CIO & AI Leader Group – Cybersecurity & Emerging Technologies, Enterprise Risk - Remote, US USD 172K-202K

Artificial Intelligence | Blockchain | CCPA | CIS Controls | Cloud Security

Flexible work environment | Mentoring and coaching | Professional development | Remote work | Travel up to 25 percent

Senior-level Full Time

Remote - Texas, United States R

6d ago
Sr. AI/LLM Threat Researcher, Agentic Systems - AI Detection and Response (Hybrid) USD 140K-215K

Agent Orchestration | Attention Mechanisms | Guardrails | Language Processing | Machine Learning

Employee networks | Employee volunteer opportunities | Paid adoption leave | Paid parental leave | Paid time off

Senior-level Full Time

Sunnyvale, United States R

6d ago
Cyber Data Scientist USD 119K-189K

Artificial Intelligence | Data Transformation | Data Visualization | Data virtualization | Database Management Systems

Senior-level Full Time

Remote (United States) R

9d ago
Senior AI Security Architect USD 117K-161K

AI RMF | Artificial Intelligence | Cloud Security | Cloud Security Architecture | Cloud infrastructure

Senior-level Full Time

Work at Home - Kentucky, United … R

10d ago
Staff AI Security Engineer USD 208K-251K

AI Security | Access Management | Adversarial Testing | Audit Logging | CI/CD

401k match | Child care support | Donation matching | FSA | Fertility care support

Senior-level Full Time

Seattle, WA (hybrid) R

10d ago
Sr. AI Scientist - AI Detection and Response (AIDR) (Hybrid) USD 140K-215K

AI Agents | AWS | Agentic AI | CUDA | Deep learning

Competitive vacation and holidays | Comprehensive wellness programs | Employee networks | Great Place to Work certified | Paid adoption leave

Senior-level Full Time

Austin, United States R

12d ago
Director, AI & Security Development USD 210K-214K

API Development | Advanced Analytics | Amazon Web Services | Artificial Intelligence | Automation

401k | Dental insurance | Disability insurance | Employee stock purchase plan | Enhanced Advocacy Services

Executive-level Full Time

Remote - USA, United States R

12d ago