AI research scientist post training
Salary:
$240,000 - 300,000 - Per Annum
Locations:
san francisco, san francisco, - None Specified -
Type:
Permanent
Published:
July 21, 2025
Contact:
Jack Marsh
Ref:
17520
Required Skills:
Python
Share this job
Apply

Job title: Research AI scientist 

Job type: Perm

Salary: $250,000- $300,00

Role Location: San fran 

The company: 

Role and responsibilities:

Job requirements:
 

A rapidly growing AI research group is seeking exceptional Research Scientists to advance the field of large language model (LLM) evaluation and understanding. This team is focused on building the next generation of evaluation frameworks, interpretability tools, and alignment techniques that will enable more transparent and responsible AI development.

In this role, you will conduct original research to probe LLM reasoning, comprehension, and emergent behaviors. You’ll help design methodologies and benchmarks that push the boundaries of how we measure model intelligence, reliability, and alignment with human values.


Key Responsibilities

  • Lead cutting-edge research in LLM evaluation, model understanding, and human-AI knowledge interaction

  • Design novel experiments and probing techniques to assess model behavior, limitations, and emergent properties

  • Develop scalable frameworks and tools for large-scale model assessment

  • Collaborate with cross-functional teams to implement and iterate on experimental insights

  • Write clean, maintainable code to support experimentation, data collection, and automated evaluation pipelines

  • Contribute to publications and presentations that advance understanding in the AI research community

  • Support the development of standards for responsible model assessment and interpretability


Requirements

  • PhD or equivalent research experience in machine learning, AI, cognitive science, computer science, or a related discipline

  • Deep knowledge of large language models, model evaluation, interpretability, or post-training assessment techniques

  • Proven track record of independently driving research projects from hypothesis to validation

  • Strong programming skills in Python, with hands-on experience using PyTorch or similar frameworks

  • Experience designing experiments, building benchmarks, or analyzing complex model behavior

  • Excellent communication skills with the ability to convey deep technical insights clearly

Benefits:

 

Accessibility Statement:

Read and apply for this role in the way that works for you by using our Recite Me assistive technology tool. Click the circle at the bottom right side of the screen and select your preferences.

 

We make an active choice to be inclusive towards everyone every day.? Please let us know if you require any accessibility adjustments through the application or interview process.

 

Our Commitment to Diversity, Equity, and Inclusion:

Signify’s mission is to empower every person, regardless of their background or?circumstances, with an equitable chance to achieve the careers?they deserve. Building a diverse future, one placement at a?time. Check out our DE&I page here

Apply