Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d) at autonomous-teaming. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
Your mission
- Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems)
- Design and implement use-cases for DRL on edge devices
- Translate theory into scalable systems with support from our engineering teams
- Collaborate with simulation, autonomy and AI infrastructure teams
- Develop decision-making for intelligent behavior and architectures
Your profile
- Deep knowledge of RL theory: policy gradients, value iteration, Q-learning, etc.
- Experience with simulation-based learning and probabilistic models
- Python proficiency; strong math/stats foundation
- Publications at NeurIPS, ICLR, ICML, ICRA, IROS, etc. are a plus
- You think rigorously and build practically
Nice to have
- Experience of deploying AI models to real-life systems
Why us?
Join us to shape the future of AI-driven defense!
Do you feel that you fit the description, but don't think you fulfill all the criteria 100%? Apply to us anyway.
We look forward to receiving your detailed application via our online form.
The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.
About us
Who we are: Autonomous Teaming is a defence-tech start-up specializing in machine vision solutions. Driven by cutting-edge innovation, our team works on next-generation technologies designed to meet rapidly evolving security challenges.
What we do: We develop systems that enable computers and sensors to operate as coordinated teams, collaborating in real time to counter AI-powered asymmetric threats at scale — including drone swarms and other UXVs. Our mission is to build resilient, intelligent defence capabilities that perform reliably in the most demanding environments.
How we work: We value close, in-person collaboration as the foundation for building complex, high-impact technology, while maintaining flexibility aligned to role and team needs. Our culture is built on ownership, responsibility, and trust — with a shared commitment to growing and building together.
Where we are: Based in Munich, Berlin, and Toulouse, we are expanding rapidly across Europe with plans to open additional office hubs.
Ready to apply?
This job is active. Apply now to get in early.