AI Safety Specialist (AI Engineering)
Tasks
- Align AI behavior with ethical principles
- Conduct adversarial testing on LLMs
- Develop constitutional AI principles
- Implement guardrails and real time filtering
- Support RLHF alignment pipelines
Perks/Benefits
- N/A
Skills/Tech-stack
Adversarial Machine Learning | Automated Red Teaming | Cybersecurity | Ethical AI | Guardrails | Human Feedback | Jailbreak Taxonomy | Language Models | Large Language Models | Learning from Human Feedback | Machine Learning | Multimodal agents | Prompt engineering | Real Time | Real-time filtering | Red Teaming | Reinforcement Learning | Reinforcement Learning from Human Feedback | Time Filtering
Education
N/A
Roles
AI | AI Engineer | AI Safety Specialist | Engineer | Safety Specialist | Specialist
Related jobs
-
Digital & Intelligent Specialist (Risk Management) HKD 312K-586KAPI Orchestration | Agent Frameworks | Algorithms | Chain-of-Thought | Chain-of-Thought promptingMid-level Full TimeHong Kong3d ago
-
API Integration | Artificial Intelligence | IT Audit | IT Security | Knowledge BaseSenior-level Full TimeHong Kong17d ago
-
AI Security Engineer HKD 112K-162KAPI Integration | Agent Orchestration | Agent systems | Authentication Security | AutomationAnnual leave | Crypto visa card | Extended medical coverage for dependents | Hybrid or remote work | Medical insuranceSenior-level Full TimeHong Kong, Hong Kong SAR23d ago
-
800-53 | Blockchain Security | Code auditing | Cybersecurity | HIPSMid-level Full TimeHong Kong, Hong Kong SAR, Hong …28d ago
-
Application Security | Attack Simulation | Automation | Cloud Security | Cyber SecurityExecutive-level Full TimeHK-TWO ES 7/F, Hong Kong1mo ago
-
Mid-level Full TimeHong Kong, HK, HK1mo ago
-
Analyst - Information Security (Ref: 26000047) HKD 300K-300KAccess Management | Active Directory | Alibaba Cloud | Application Security | AzureCareer development | Training opportunitiesMid-level Full TimeHong Kong1mo ago