Research Scientist / Research Engineer, Pre-training
- Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development
- Independently lead small research projects while collaborating with team members on larger initiatives
- Design, run, and analyze scientific experiments to advance our understanding of large language models
- Optimize and scale our training infrastructure to improve efficiency and reliability
- Develop and improve dev tooling to enhance team productivity
- Contribute to the entire stack, from low-level optimizations to high-level model design
- Advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field
- Strong software engineering skills with a proven track record of building complex systems
- Expertise in Python and experience with deep learning frameworks (PyTorch preferred)
- Familiarity with large-scale machine learning, particularly in the context of language models
- Ability to balance research goals with practical engineering constraints
- Strong problem-solving skills and a results-oriented mindset
- Excellent communication skills and ability to work in a collaborative environment
- Care about the societal impacts of your work
- Work on high-performance, large-scale ML systems
- Familiarity with GPUs, Kubernetes, and OS internals
- Experience with language modeling using transformer architectures
- Knowledge of reinforcement learning techniques
- Background in large-scale ETL processes
- Have significant software engineering experience
- Are results-oriented with a bias towards flexibility and impact
- Willingly take on tasks outside your job description to support the team
- Enjoy pair programming and collaborative work
- Are eager to learn more about machine learning research
- Are enthusiastic to work at an organization that functions as a single, cohesive team pursuing large-scale AI research projects
- Are working to align state of the art models with human values and preferences, understand and interpret deep neural networks, or develop new models to support these areas of research
- View research and engineering as two sides of the same coin, and seek to understand all aspects of our research program as well as possible, to maximize the impact of your insights
- Have ambitious goals for AI safety and general progress in the next few years, and you're working to create the best outcomes over the long-term.
- Optimizing the throughput of novel attention mechanisms
- Comparing compute efficiency of different Transformer variants
- Preparing large-scale datasets for efficient model consumption
- Scaling distributed training jobs to thousands of GPUs
- Designing fault tolerance strategies for our training infrastructure
- Creating interactive visualizations of model internals, such as attention patterns
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.
Recommended Jobs
Sales Executive / Account Executive (South West London)
Office Based in Chessington, Surrey. Are you hungry to make money and grow your sales career? A leading broadcast equipment supplier is looking to expand their elite team of sales people. Are you c...
Business Pricing Manager
We are looking for an enthusiastic and collaborative Business Pricing Manager to join our team. The Business Pricing Manager is a recently introduced role within the organisation. The focus of the...
Locum Acute/Inpatients
Globe Locums About Globe Globe Locums, the UK's medical recruitment agency run by clinicians for clinicians have the following Physiotherapy job available: Physiotherapy Job Description S...
Technical Solution Architect-Capgi
Your Role: • Lead and manage software development projects using Java, Python, Typescript, AWS, React JS, Angular, and other technologies. • Collaborate with cross-functional teams to define proj...
Sales Representative, Foot & Ankle - London
Why join Stryker? Are you looking to be part of a motivated, highly visible team with a leader in the medical technology industry? Do you have a passion and a drive for quality? Do you thrive in a ...
Demand Planner
Who are we? Helping you have healthy hair has been our mane goal since 2007. No matter your hair type – straight, curly, fine, fragile, or coily – our range of gentle brushes will leave you with ...
Band 5-6 Speech and Language Therapist
About Us: Words First Ltd, an independent multi-disciplinary practice, is at the forefront of providing integrated SEN services to schools in and around London. With a dedicated team, including 90 S...
(Launching Jan 2026) Toddler Room Leader- Barnes
Kido is a place where innovation and imagination unite to create modern Early Years settings. We’re looking for a Room Leader to join our team at Kido location: PRIESTS BRIDGE, SW14 8TA (Launchi...
Solutions Engineer / Pre-Sales
Job purpose:As a Solutions Engineer, you will play a pivotal role in the success of Becrypt, bridging the gap between technical delivery and customer satisfaction. You will be the key technical point...
Chef
Chef - Whitehall London 37.5 Hours per week £13.85 per hour We're currently recruiting an ambitious Chef to help us create exceptional food experiences for Government Services on a full time b...