AI Safety Specialist (AI Engineering)
San Francisco Bay Area, USA
A USD 141K-202K (estimate) Mid-level Full Time
Tasks
- Assist with RLHF alignment pipelines
- Conduct adversarial testing for LLMs and multimodal agents
- Develop constitutional AI principles
- Identify edge cases and failure modes
- Implement guardrails and real time filtering for autonomous tool use
- Perform automated red teaming
Perks/Benefits
- N/A
Skills/Tech-stack
Adversarial Machine Learning | Automated Red Teaming | Cybersecurity | Guardrails | Human Feedback | Jailbreak Taxonomies | LLM safety | Learning from Human Feedback | Machine Learning | Prompt engineering | Real Time | Real-time filtering | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Time Filtering
Education
N/A
Related jobs
-
Cybersecurity Engineer USD 100K-203KBackup and Recovery | Cloud Security | Compliance | Continuity of Operations | CybersecurityMid-level Full TimeArlington, VA17h ago
-
Technical Solutions Engineer USD 120K-160KCarbon Black | CrowdStrike | Cybersecurity | Information security | Microsoft DefenderMid-level Full TimeUSA Chicago19h ago
-
Adversarial Machine Learning | Cybersecurity | Guardrails | Human Feedback | Jailbreak detectionMid-level Full TimeOregon, USA20h ago
-
Adversarial Machine Learning | Cybersecurity | Guardrails | Human Feedback | Language ModelsMid-level Full TimeSeattle, USA20h ago
-
Adversarial Machine Learning | Cybersecurity | Guardrails | Human Feedback | Jailbreak TaxonomiesMid-level Full TimeBoston, USA20h ago
-
Mid-level Full TimeArlington/Rosslyn, Virginia, United States21h ago
-
API Development | Artificial Intelligence | Asset Management | Computer Network Security | Computer networkEmployee Networking | Great Place to Work certified | Paid Holidays | Paid adoption leave | Paid parental leaveSenior-level Full TimeMassachusetts R23h ago
-
API Development | Artificial Intelligence | Cybersecurity | Data Science | Incident ResponseEmployee networking opportunities | Great Place to Work certified | Paid adoption leave | Paid parental leave | Professional development opportunitiesSenior-level Full TimeMinnesota R23h ago
-
API Integration | Artificial Intelligence | Asset Management | Computer Security | CybersecurityEmployee Networking | Great Place to Work certified | Paid Holidays | Paid adoption leave | Paid parental leaveSenior-level Full TimeColumbia R23h ago
-
API Development | Artificial Intelligence | Asset Management | Cybersecurity | Incident ResponseCompetitive vacation and holidays | Employee networking opportunities | Paid adoption leave | Paid parental leave | Professional development opportunitiesSenior-level Full TimeFlorida R23h ago
-
API Development | Artificial Intelligence | Asset Management | Computer Network Security | Computer networkGreat Place to Work certified | Networking opportunities | Paid Holidays | Paid adoption leave | Paid parental leaveSenior-level Full TimeCalifornia R23h ago
-
API Integration | Asset Management | Computer Network Security | Computer network | CybersecurityEmployee networking opportunities | Paid adoption leave | Paid parental leave | Professional development opportunities | Vacation and holidaysSenior-level Full TimeArizona R23h ago
-
Cybersecurity | Knowledge graphs | LLM | Language Processing | Machine LearningFractional engagement | Remote workSenior-level Full TimeNew York, New York, United States R1d ago
-
Cybersecurity Specialist IV USD 110K-180KCertification and accreditation | Cybersecurity | Cybersecurity Testing | Cybersecurity strategy | DOD Risk Management FrameworkSenior-level Full TimeFort Belvoir, VA, United States1d ago
-
SR. Cybersecurity Engineer USD 119K-225KAccess Control | Access Management | Access Review | Azure | Cloud Security401k match | Dental insurance | Flexible spending account | Life insurance | Long-term disabilitySenior-level Full TimeTamarac, FL, United States1d ago
-
Embedded Software Engineer (cleared) USD 105K-111KARM | C++ | Defect Tracking | Development Environment | Device Drivers401k matching | 9/80 schedule | Employee resource groups | Every other Friday off | Flexible scheduleMid-level Full TimeTaunton, MA, United States1d ago
-
Mid-level Full TimeSan Antonio, TX, United States1d ago
-
GRC and AI Governance - Senior Manager USD 150K-200K800-53 | AI Act | AI Governance | AI RMF | AI RiskSenior-level Full TimeUnited States1d ago
-
OnSite Cybersecurity Custodian USD 114K-201KAccess Control | Access Management | Account Management | Antivirus | Asset InventoryMid-level Full TimeAnn Arbor, MI, US1d ago
-
Cybersecurity AI Risk and Governance Director, Global USD 220K-250KAPI Security | Access Governance | Access Management | Artificial Intelligence | Artificial Intelligence Security401k match | Employee assistance program | Life and ADND coverage | Long-term disability | Medical, dental & vision coverageExecutive-level Full TimeCO11, United States1d ago
-
Senior Engineer (Sr. Data Security Analyst) USD 153K-166KAWS | Access Control | Anomaly Detection | Audit Support | AzureFully remote | Hybrid option | Office days Tuesday and WednesdaySenior-level Full TimeRemote - Virginia, United States R1d ago
-
Senior Natural Catastrophe Specialist, Vulnerability USD 164K-246KArtificial Intelligence | Business Interruption | Catastrophe modeling | Claims Analysis | GISHybrid work modelSenior-level Full TimeArmonk, NY, US1d ago
-
APIs | C++ | Cameo Systems Modeler | Configuration Management | CybersecurityMid-level Full TimeDayton, OH1d ago
-
AI machine learning | Cloud Native | Cloud-native development | Cybersecurity | Machine Learning401k contribution | Annual professional development stipend | Company paid medical premiums | Flexible benefits trade for hourly rate | Paid time offMid-level Full TimeFt. Meade, MD1d ago
-
Senior Software Engineer, Intelligence Systems USD 191K-253KAir-gapped | Air-gapped systems | Automated testing | Backend Development | Build AutomationCaregiver leave | Commuter benefits | Dental insurance | Disability insurance | Family planning supportSenior-level Full TimeReston, Virginia, United States1d ago