AI Safety Specialist (AI Engineering)
Tasks
- Assist with RLHF alignment pipelines
- Conduct adversarial testing on LLMs
- Develop constitutional AI principles
- Identify edge cases
- Implement guardrails and real time filtering
Perks/Benefits
- N/A
Skills/Tech-stack
Adversarial Machine Learning | Automated Red Teaming | Cybersecurity | Human Feedback | Jailbreak Taxonomies | Language Models | Large Language Models | Learning from Human Feedback | Machine Learning | Multimodal agents | Prompt engineering | Real Time | Real-time filtering | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Time Filtering
Education
N/A
Roles
Related jobs
-
AI Governance | AI and ML | AI and ML Risk Assessment | AWS | Access ManagementHealth coverage | Home office stipend | Inclusive culture | Parental leave | Professional development budgetMid-level Full TimeAustralia R3d ago
-
ICT Security Specialist AUD 150K-190KCybersecurity | Defence ICT | Defence ICT systems | ICT systems | Information Security ManagementFlexible supportive team culture | Interstate travel | On-call roster | Supportive collaborative environmentMid-level Full TimeAdelaide, South Australia 5000, Australia4d ago
-
Cyber Security Specialist AUD 124K-138KCompliance | Control Assessment | Cyber GRC | Cybersecurity | IT administrationCareer development | Hybrid working | Travel opportunitiesMid-level Full TimeSydney, Australia1mo ago
-
Technology Specialist Cyber GRC AUD 132K-143KCybersecurity | Defense systems | Governance | Risk Management | Security ComplianceBirthday leave | Charity donation matching | Employee recognition | First year leave | Health and wellbeing allowanceSenior-level Full TimePenrith, NSW, Australia1mo ago
-
AI Security Principal AUD 80K-120KA2A Architecture | AI threat modeling | AWS | Azure | Cloud SecurityCommunity engagement opportunities | Flexible work arrangements | Healthcare benefits | In-house training | Parental leaveSenior-level Full TimeMelbourne, Australia1mo ago