Data Scientist
Job Description: AI Task Evaluation & Statistical Analysis Specialist
Role Overview
Were seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. Youll identify patterns root causes and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types file types criteria etc.).
Key Responsibilities
-
Statistical Failure Analysis : Identify patterns in AI agent failures across task components (prompts rubrics templates file types tags)
-
Root Cause Analysis : Determine whether failures stem from task design rubric clarity file complexity or agent limitations
-
Dimension Analysis : Analyze performance variations across finance sub-domains file types and task categories
-
Reporting & Visualization : Create dashboards and reports highlighting failure clusters edge cases and improvement opportunities
-
Quality Framework : Recommend improvements to task design rubric structure and evaluation criteria based on statistical findings
-
Stakeholder Communication : Present insights to data labeling experts and technical teams
Required Qualifications
-
Statistical Expertise : Strong foundation in statistical analysis hypothesis testing and pattern recognition
-
Programming : Proficiency in Python (pandas scipy matplotlib/seaborn) or R for data analysis
-
Data Analysis : Experience with exploratory data analysis and creating actionable insights from complex datasets
-
AI/ML Familiarity : Understanding of LLM evaluation methods and quality metrics
-
Tools : Comfortable working with Excel data visualization tools (Tableau/Looker) and SQL
Preferred Qualifications
-
Experience with AI/ML model evaluation or quality assurance
-
Background in finance or willingness to learn finance domain concepts
-
Experience with multi-dimensional failure analysis
-
Familiarity with benchmark datasets and evaluation frameworks
-
2-4 years of relevant experience
Recommended Jobs
Dynamics CE Functional Consultant
Background One of the UK’s leading Microsoft Solutions Partners for over 20 years, accredited to the highest level and a member of Microsoft’s elite Inner Circle for Business Applications, Xpediti…
School Bus Driver
MB829: School Bus Driver Location: Central London and surrounding areas Salary: £13.85ph Working Hours: 29 hrs per week: Mon - Thu 07:00-09:30 / 15:30-19:00, Fri 07:00-09:30 / 15:30-18:00 Term time o…
Band 6 Locum Audiologist - London
Job Title: Locum Audiologist Banding : Band 6 Location : London Start Date: ASAP Duration : 8-12 weeks Working Pattern: Monday – Thursday, 9.5 hours per day Rate : £24:00 – £27…
Legal Counsel (Employment)
Hello, we’re Starling. We built a new kind of bank because we knew technology had the power to help people save, spend and manage their money in a new and transformative way. We’re a fully licensed U…
Senior Client Delivery Manager - ECS EMEA NORTH
We help the world run better At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and w…
UAS Operations & Maintenance Technician, UK
Matternet is on a mission to make access to goods as frictionless and universal as access to information. Since 2017, we've flown over 60,000 drone flights for healthcare and e-commerce, from buildin…
Fusion Exercise Planner
The mission of the Fusion Response and Recovery department is to understand, prepare for, respond to, recover, and learn from operational threats and incidents that impact the Firm. Fusion delivers a…
Year 3 Teacher | Supportive Primary School in Hounslow |...
A vibrant and inclusive primary school in Hounslow is seeking an enthusiastic and dedicated Year 3 Job Share Teacher to join their team from November 2025 . This is a fantastic opportunity for a…
Sales Consultant
Sales Consultant Safestyle, a trusted household brand in the UK for over 30 years, is now seeking ambitious and results-driven self employed sales professionals to join our dynamic and friendly team.…