Research Engineer, Machine Learning (Horizons)
As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic / open-ended tasks. Representative projects:
- Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models.
- Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks.
- Design and run experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics
- 5+ years of industry-related experience
- Are proficient in Python and have experience with deep learning frameworks such as PyTorch or Jax
- Have a strong software engineering background and are interested in working closely with researchers and other engineers
- Enjoy pair programming (we love to pair!)
- Care about code quality, testing, and performance
- Are passionate about the potential impact of AI and are committed to developing safe and beneficial systems
- Have a strong background in machine learning, reinforcement learning, or high performance computing
- Have experience with virtualization and sandboxed code execution environments
- Have experience with Kubernetes
- Have contributed to open-source projects or published research papers in relevant fields
- Formal certifications or education credentials
- Experience with LLMs or machine learning research before
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process
Recommended Jobs
Administrator
Job summary Grove medical centre is looking for a receptionist/administrator to join their busy practice team. The role will be full-time working between the hours of 8am - 6:30pm. The hours will …
Financial Accountant Renewable Energy
Your new company A high growth business with strong financial backers, going from strength to strength in the renewable market. The company are ideally looking to hire a qualified professional with …
Cash Supervisor - Newcastle
Location: Eldon Square, Newcastle Type of contract: permanent, full time 40h At Sephora, beauty is about feeling seen, valued, and empowered, individually and collectively. It is connectin…
EMEA Rates Exotics Product Control - Vice President
Job Description Are you ready to make a significant impact in the world of finance? As a Product Control Vice President, you'll play a crucial role in ensuring financial accuracy and supporting tr…
Project Manager (M&E)
Project Manager Our client is a growing mechanical contractor based in London, delivering full design and build solutions across a wide range of mechanical services. Due to ongoing expansion, th…
Studio Coordinator
About Charlotte Tilbury Beauty Founded by British makeup artist and beauty entrepreneur Charlotte Tilbury MBE in 2013, Charlotte Tilbury Beauty has revolutionised the face of the global beauty ind…
SRE and App Management Engineer
Work Location : London, United Kingdom Hours: 35 Line of Business: Technology Solutions Pay Details: We're committed to providing fair and equitable compensation to all our collea…
IT Field Support Engineer / Onsite Second Line Technician
Onsite IT Field Support Engineer who has good IT troubleshooting and user facing skills is required to provide onsite deskside technical support for a multi award-winning Managed Service Provider. SAL…
Trustee - Safeguarding Lead
Join our Board of Directors Trustee Lead for Safeguarding required for an innovative, best-practice charity Help to make a real difference to the lives of women and girls by joining the board o…
VP Sales
GK8 is seeking an experienced and strategic-minded Vice President of Sales to lead our sales department. This role is critical in driving our business growth, expanding market reach, and establishing…