AI Safety Specialist (AI Engineering)
Tasks
- Assist with RLHF alignment pipelines
- Conduct adversarial testing on LLMs
- Develop constitutional AI principles
- Identify edge cases
- Implement guardrails and real time filtering
Perks/Benefits
- N/A
Skills/Tech-stack
Adversarial Machine Learning | Automated Red Teaming | Cybersecurity | Human Feedback | Jailbreak Taxonomies | Language Models | Large Language Models | Learning from Human Feedback | Machine Learning | Multimodal agents | Prompt engineering | Real Time | Real-time filtering | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Time Filtering
Education
N/A
Roles
Related jobs
-
Senior AI Security Engineer II AUD 145K-170KAI Agents | AI RMF | AI architecture | API Security | Abuse detectionDiscounted private health insurance | Discounts with merchant partners | Employee assistance program | Fee free company products | Fitness session discountsSenior-level Full TimeMelbourne10d ago
-
Senior AI Security Engineer II AUD 145K-170KAI Agents | AI RMF | AI Security | API Security | Abuse detectionDiscounted private health insurance | Employee assistance program | Family support policies | Fee-free Zip products | Fitness session discountsSenior-level Full TimeSydney10d ago
-
Cyber GRC Specialist AUD 113K-130KCybersecurity | Defence ICT | Governance | Information security | Risk AssessmentCareer development | Flexible work arrangements | Supportive team cultureMid-level Full TimeAdelaide, South Australia 5000, Australia10d ago
-
Security Operations Specialist AUD 129K-142KAtlas | CASB | Cause analysis | Cyber Risk | Cyber Risk AnalysisCareer development | Employee discounts | Health and wellbeing support | Hybrid work options | Paid parental leaveSenior-level Full TimeMelbourne, VIC, AU, 300022d ago
-
AI Security | By Design | Code review | Data extraction | Detection engineeringEquity packages | Flexible leave | Parental leave | Wellbeing allowanceSenior-level Full TimeSydney, Australia R1mo ago
-
ICT Security Specialist AUD 150K-190KCybersecurity | Defence ICT | Defence ICT systems | ICT systems | Information Security ManagementFlexible supportive team culture | Interstate travel | On-call roster | Supportive collaborative environmentMid-level Full TimeAdelaide, South Australia 5000, Australia1mo ago