JetBrains18 дней назад

Senior Research Engineer (Code World Models)

Amsterdam

Обязанности

  • 01Design and run pre-training, continued pre-training, and mid-training experiments for code models
  • 02Build and improve data pipelines for large-scale model training, including filtering, deduplication, mixture design, and dataset quality checks
  • 03Work with code corpora, repositories, tests, execution traces, and synthetic data
  • 04Develop evaluations for complex repository-level code reasoning tasks
  • 05Collaborate with researchers and engineers working on ML for code and AI developer tools

Требования

  • 01Have hands-on experience with model pre-training, continued training, or mid-training
  • 02Have strong engineering skills in Python and experience with modern ML frameworks
  • 03Understand large-scale ML training workflows, including data processing, distributed training, checkpointing, evaluation, experiment tracking, and debugging
  • 04Have experience working with large datasets and care about data quality, contamination, sampling, and reproducibility
  • 05Have a background in NLP, ML for software engineering, or a similar domain
  • 06Enjoy working on research problems with high uncertainty and turning ideas into working experiments

Условия

  • 01Equal opportunity employer
  • 02Open and inclusive workplace
  • 03Processing of job application data according to Recruitment Privacy Policy