Baseten11.05.2026
Solution Architect (AI/LLM Inference)
Полная занятостьУдалёнка
Обязанности
- 01Partner with Sales on customer discovery calls (most often second calls, occasionally first calls for large accounts).
- 02Lead demos and technical scoping to align on success criteria, architecture, and deployment approach.
- 03Own benchmarking and repeatable deployments, including handling standard deployment patterns and configurations across many modalities – LLMs, embeddings, image and video generation, VoiceAI, etc.
- 04Advising on tradeoffs like H100s vs B200s and latency-optimized vs throughput-optimized setups.
- 05Driving consistent "playbook" style deployments for common models and use cases.
- 06Become a power user of different runtimes such as vllm, sglang, and TRT-LMM and all the common configurations and tradeoffs between them
- 07Scoping POCs and keeping stakeholders aligned on timeline, deliverables, and next steps.
- 08Acting as the "ringleader" or project manager for POCs.
- 09Pulling in Forward Deployed Engineering (FDE) support when deeper or more complex technical work is needed.
Требования
- 01AI/ML background and the ability to credibly discuss AI/ML topics with technical stakeholders.
- 02Strong customer-facing communication skills, including the ability to run structured discovery and clarify ambiguous requirements.
- 03Technical depth to scope solutions, without needing to write production code.
- 04Ability to script and prototype as needed, including comfort "vibe coding" to move quickly in technical workflows.
Условия
- 01Competitive compensation, including meaningful equity.
- 02100% coverage of medical, dental, and vision insurance for employee and dependents
- 03Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- 04Paid parental leave
- 05Fertility and family-building stipend through Carrot
- 06Company-facilitated 401(k)
- 07Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.