H
Humanoidvia Google Jobs
Reinforcement Learning (RL) Engineer, Manipulation
Boston, MAPosted 3mo ago
RoboticsMid Level#python#pytorch#jax#reinforcement-learning#ray
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against Reinforcement Learning (RL) Engineer, Manipulation at Humanoid. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
Humanoid is the first AI and robotics company in the UK, creating advanced humanoid robots. They are seeking a Reinforcement Learning Engineer to train manipulation policies using reinforcement learning and collaborate on real-world applications.
Responsibilities
• Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world
• Construct challenging and diverse suites of manipulation tasks in simulation
• Partner with teleoperations to collect trajectories in simulation for behavior cloning
• Partner with testing and operations to establish real-world RL training pipelines
• Experiment with various ways of bringing policies trained in simulation to the real world
Skills
• 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it
• Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference
• Experience solving real problems using reinforcement learning with deep neural networks in any domain
• Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code
• You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply
• Experience with simulators for robotics (Isaac Sim, MuJoCo etc.)
• Experience in RL for robotics
• Experience building infrastructure for large-scale RL (e.g. using ray)
• Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions
• Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks
Benefits
• Participation in our Stock Option Plan
• Paid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave days
• Travel opportunities to our Vancouver and Boston offices
• Office perks: free breakfasts, lunches, snacks, and regular team events
• Freedom to influence the product and own key initiatives
• Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics
• Startup culture prioritising speed, transparency, and minimal bureaucracy
Company Overview
• Humanoid is the first AI and robotics company in the UK creating the world’s leading, commercially scalable, and safe humanoid robots It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is https://thehumanoid.ai/.
Responsibilities
• Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world
• Construct challenging and diverse suites of manipulation tasks in simulation
• Partner with teleoperations to collect trajectories in simulation for behavior cloning
• Partner with testing and operations to establish real-world RL training pipelines
• Experiment with various ways of bringing policies trained in simulation to the real world
Skills
• 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it
• Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference
• Experience solving real problems using reinforcement learning with deep neural networks in any domain
• Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code
• You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply
• Experience with simulators for robotics (Isaac Sim, MuJoCo etc.)
• Experience in RL for robotics
• Experience building infrastructure for large-scale RL (e.g. using ray)
• Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions
• Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks
Benefits
• Participation in our Stock Option Plan
• Paid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave days
• Travel opportunities to our Vancouver and Boston offices
• Office perks: free breakfasts, lunches, snacks, and regular team events
• Freedom to influence the product and own key initiatives
• Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics
• Startup culture prioritising speed, transparency, and minimal bureaucracy
Company Overview
• Humanoid is the first AI and robotics company in the UK creating the world’s leading, commercially scalable, and safe humanoid robots It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is https://thehumanoid.ai/.
Ready to apply?
This job is active. Apply now to get in early.