Jobs requiring Reinforcement Learning
20 matching live roles 路 312 total open in this vertical
Senior Machine Learning Engineer - Model Evaluations, Public Sector
Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026
Machine Learning Research Scientist, Post-Training
Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026
Research Engineer, Performance RL
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
ML/Research Engineer, Safeguards
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer, Machine Learning (Reinforcement Learning)
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer, Knowledge Team
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
GenAI Strategic Projects Lead, Public Sector
Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026
Research Engineer, Pretraining
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Engineering Manager (AI Research & Model Training)
Negotiable
馃懁 Human Full-time
Perplexity 路 Posted Jun 17, 2026
Forward Deployed Engineer, GenAI
Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026
Senior Machine Learning Engineer, Public Sector
Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026
Researcher, Alignment Science
Negotiable
馃懁 Human Full-time
Openai 路 Posted Jun 17, 2026
Member of Technical Staff (AI Researcher)
Negotiable
馃懁 Human Full-time
Perplexity 路 Posted Jun 17, 2026
Staff Research Engineer, Discovery Team
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer/Research Scientist, RL/Reasoning
Negotiable
馃懁 Human Full-time
Openai 路 Posted Jun 17, 2026
Research Engineer/Research Scientist, Pre-training
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer, Machine Learning (Reinforcement Learning)
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer, Discovery
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Lead, Training Insights
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026
Research Engineer/Research Scientist, Audio
Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026