JetBrains18 дней назад
Senior Research Engineer (Code World Models)
Amsterdam
Обязанности
- 01Design and run pre-training, continued pre-training, and mid-training experiments for code models
- 02Build and improve data pipelines for large-scale model training, including filtering, deduplication, mixture design, and dataset quality checks
- 03Work with code corpora, repositories, tests, execution traces, and synthetic data
- 04Develop evaluations for complex repository-level code reasoning tasks
- 05Collaborate with researchers and engineers working on ML for code and AI developer tools
Требования
- 01Have hands-on experience with model pre-training, continued training, or mid-training
- 02Have strong engineering skills in Python and experience with modern ML frameworks
- 03Understand large-scale ML training workflows, including data processing, distributed training, checkpointing, evaluation, experiment tracking, and debugging
- 04Have experience working with large datasets and care about data quality, contamination, sampling, and reproducibility
- 05Have a background in NLP, ML for software engineering, or a similar domain
- 06Enjoy working on research problems with high uncertainty and turning ideas into working experiments
Условия
- 01Equal opportunity employer
- 02Open and inclusive workplace
- 03Processing of job application data according to Recruitment Privacy Policy