Character.AI06.03.2026

Research Engineer, Multimodal

Полная занятостьУдалёнка

Обязанности

01Lead fine-tuning and continued training of video generation models, including image-to-video and joint audio-visual generation
02Design and experiment with novel model architectures for multimodal generation, including multimodal conditioning (voice, structured text, reference images)
03Leverage techniques such as LoRA, RLHF, and full-parameter fine-tuning to improve model quality across diverse visual scenarios
04Design and build large-scale data pipelines and automated annotation workflows to support continuous model improvement
05Explore model compression, inference acceleration, and serving optimizations to enable efficient real-time video processing at scale

01Strong passion for pushing the boundaries of visual AI, with a self-driven, hands-on approach to solving complex technical problems
02Proficient in PyTorch with end-to-end experience across data processing, model training, and deployment
03Solid understanding of video and image generation architectures, including diffusion models, DiT, ControlNet, and SOTA video generation models
04Experience with multimodal model training, including working with audio, vision, and language modalities together
05Experience with distributed training tools (FSDP, DeepSpeed, etc.)
06Experience with large-scale data processing, dataset construction, and automated data cleaning

01Join a unicorn startup with rapid growth and innovation in AI
02Work on cutting-edge multimodal AI models shaping user experiences for millions
03Collaborate with a diverse and inclusive team in a dynamic environment
04Opportunities to publish research at top-tier conferences (NeurIPS, ICLR, CVPR, etc.)