Large Model Application Algorithm Research Scientist-International Content Security Algorithm Research-Soaring Star Talent Program
Tasks
- Apply reinforcement learning to natural language tasks
- Design reward models for reinforcement learning
- Develop large language models
- Evaluate model reasoning performance
- Improve reasoning efficiency
- Monitor and mitigate content risks
- Train and stabilize reinforcement learning without supervised fine tuning
Perks/Benefits
- N/A
Skills/Tech-stack
Chain-of-Thought | Data Compliance | Fine Tuning | Knowledge Distillation | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Monte Carlo | Monte-Carlo Tree Search | Natural Language | Natural Language Processing | Process-based Reward Model | Reinforcement Learning | Reward Model | Supervised Fine Tuning | Tree search
Education
Roles
Related jobs
-
Analytical Chemistry | Computational Analysis | Data Integrity | Data analytics | DigitalizationMid-level Full TimeHEALTH SCIENCES AUTHORITY BUILDING, Singapore2d ago
-
Research Associate (Computer Science/Cybersecurity/Software Engineering/Mathematics/Statistics) SGD 60K-120KAlgorithm Design | C++ | Differential Privacy | Experimental evaluation | JavaMid-level Full TimeNTU Main Campus, Singapore3d ago
-
G03 - Data Scientist (Data Practice) SGD 60K-100KAnonymization | Data Analysis | Differential Privacy | Experiment design | Machine LearningEntry-level Full TimeSingapore10d ago
-
Android security | Code Analysis | Data Mining | Java | JavaScriptSenior-level Full TimeSingapore30d ago
-
Staff Research Scientist, App and Ecosystem Trust SGD 90K-134KAdversarial Machine Learning | Data Mining | Java | JavaScript | Language ProcessingSenior-level Full TimeSingapore30d ago