Member of Technical Staff, AI - Reinforcement Systems

Microsoft
London
Help build the world's most advanced reinforcement learning systems at Microsoft AI.

We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you'll help advance state-of-the-art model capabilities by contributing to core systems, infrastructure, and research.

We are looking for distributed systems experts with a scientific mindset. The ideal candidate will be able to build complex systems from the ground up, discover and diagnose causes of suboptimal performance, and contribute to solving scientific and research challenges. Specifically, they should:
  • Excel in programming (especially parallel/concurrent), software engineering, and API design
  • Have experience in large-scale systems, preferably having built some components from scratch.
  • Thrive in a highly collaborative, fast-paced environment
  • Have a high degree of craftsmanship and pay close attention to details
  • Effectively manage multiple responsibilities and can adjust to shifting priorities
  • Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users
A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics, competitive programming, and related domains are a plus.

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities
  • Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs
  • Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments.
  • Embody our culture of collaboration, innovation, and excellence.
Qualifications

Required Qualifications:
  • Bachelor's Degree in Computer Science, Software Engineering, Computer Engineering, Machine Learning, Mathematics, or related STEM fields and experience in coding in languages including, but not limited to, C, C++, C#, Rust, Java, or Python
  • Experience with large-scale software systems and infrastructure.
  • Demonstrated interest in reinforcement learning, language modelling, generative modelling, or related domains.
  • Ability to work collaboratively in a fast-paced, innovative environment.
Preferred Qualifications:
  • Background in machine learning research.
  • Experience with large scale distributed AI systems.
#copilot #microsoftAI
Posted 2025-07-15

Recommended Jobs

Senior Events Project Manager

RCR
London

Great opportunity to join a leading charity with a focus on climate change who are looking for a Senior Events Project Manager to join their growing team. We are looking for someone who can inspire ch…

View Details
Posted 2025-07-10

Quantity Surveyor

Lorclon Ltd
London

Intermediate Quantity Surveyor – Lorclon Ltd &##128205; Location: London and West London Projects &##128188; Job Type: Full-Time &##128183; Salary: Competitive, based on experience Join Lorcl…

View Details
Posted 2025-07-01

Commercial Director - Maternity Cover

Heatherwick Studio
London

The Studio is seeking an experienced and strategically-minded Commercial Director to join our London Studio. The role combined commercial acumen with strategic foresight and confident leadership, …

View Details
Posted 2025-05-31

White Male Aged 30-60 and is Between 5'8-5'10 Tall Required for Body Double Role 25th, 29th or 30th July. Paid

Talent Talks Ltd
London

We are looking for a white male aged 30-60 and is between 5'8-5'10 tall for a body double role taking place on 25th, 29th or 30th July.  You may be needed for one or more dates so please only apply i…

View Details
Posted 2025-07-09

Principal

STRAT7
London

Incite is an award-winning strategic research and planning consultancy. We unlock opportunity by combining inspiring insight with commercial acumen. We are part of Strat7 – a global tech-enabled stra…

View Details
Posted 2025-07-04

R&D Tax Incentives Manager - Software

BDO
London

We’re BDO. An accountancy and business advisory firm, providing the advice and solutions entrepreneurial organisations need to navigate today’s changing world. We work with the companies that are …

View Details
Posted 2025-06-26

Rhino/Revit Specialist

Architecture Social
London

Rhino/Revit Specialist at Prestigious London Practice Step into a world where your design skills make a global impact. An international, award-winning architectural design studio based in London i…

View Details
Posted 2025-06-26

CT Engineer - London (North Circular)

Omega Resource Group
London

Job Title: CT Engineer Location: London - North Circular Pay Range/details: £45,000 per annum + bonus Contract Type: Permanent Omega are supporting a fast-growing technology-based business in th…

View Details
Posted 2025-07-02

Senior Contracts and Procurement Lawyer

Venn Group - London
London

Job Details Locum Senior Contracts and Procurement Lawyer – London Local Authority – Remote working considered – Once weekly office attendance – £55+ umbrella per hour Venn Group is working wit…

View Details
Posted 2025-06-30

Teleradiology Systems Specialist

Telemedicine Clinic
London

Teleradiology Systems Specialist About TMC Telemedicine Clinic (TMC) pioneered teleradiology services in Europe when it was founded in 2002 and has since become a vital partner for more than 120 r…

View Details
Posted 2025-06-04