AI research scientist post training

Salary:

$240,000 - 300,000 - Per Annum

Locations:

san francisco, san francisco, - None Specified -

Type:

Permanent

Published:

July 21, 2025

Contact:

Jack Marsh

Ref:

17520

Required Skills:

Python

Share this job

Apply

Job title: Research AI scientist

Job type: Perm

Salary: $250,000- $300,00

Role Location: San fran

The company:

Role and responsibilities:

Job requirements:

A rapidly growing AI research group is seeking exceptional Research Scientists to advance the field of large language model (LLM) evaluation and understanding. This team is focused on building the next generation of evaluation frameworks, interpretability tools, and alignment techniques that will enable more transparent and responsible AI development.

In this role, you will conduct original research to probe LLM reasoning, comprehension, and emergent behaviors. You’ll help design methodologies and benchmarks that push the boundaries of how we measure model intelligence, reliability, and alignment with human values.

Key Responsibilities

Lead cutting-edge research in LLM evaluation, model understanding, and human-AI knowledge interaction
Design novel experiments and probing techniques to assess model behavior, limitations, and emergent properties
Develop scalable frameworks and tools for large-scale model assessment
Collaborate with cross-functional teams to implement and iterate on experimental insights
Write clean, maintainable code to support experimentation, data collection, and automated evaluation pipelines
Contribute to publications and presentations that advance understanding in the AI research community
Support the development of standards for responsible model assessment and interpretability

Requirements

PhD or equivalent research experience in machine learning, AI, cognitive science, computer science, or a related discipline
Deep knowledge of large language models, model evaluation, interpretability, or post-training assessment techniques
Proven track record of independently driving research projects from hypothesis to validation
Strong programming skills in Python, with hands-on experience using PyTorch or similar frameworks
Experience designing experiments, building benchmarks, or analyzing complex model behavior
Excellent communication skills with the ability to convey deep technical insights clearly

Benefits:

Accessibility Statement:

Read and apply for this role in the way that works for you by using our Recite Me assistive technology tool. Click the circle at the bottom right side of the screen and select your preferences.

We make an active choice to be inclusive towards everyone every day.? Please let us know if you require any accessibility adjustments through the application or interview process.

Our Commitment to Diversity, Equity, and Inclusion:

Signify’s mission is to empower every person, regardless of their background or?circumstances, with an equitable chance to achieve the careers?they deserve. Building a diverse future, one placement at a?time. Check out our DE&I page here