World Model Research Scientist- Physical AI
Kodiak
About the Role
The role involves designing and training generative world models to synthesize realistic multi-camera video and LiDAR data conditioned on various inputs like ego trajectories, 3D scene context, and text. This includes researching and implementing conditional diffusion architectures and developing techniques for multi-view geometric consistency in generated outputs.
Requirements
Candidates must possess an MS or PhD in a relevant field with a strong focus on generative modeling, neural rendering, or video synthesis, demonstrated through publications or research contributions in areas like diffusion models or world models. Essential technical requirements include proficiency in Python and PyTorch, experience training large generative models at scale, and familiarity with multimodal sensor data and 3D representations.
Full Job Description
Kodiak Robotics, Inc. was founded in 2018 and has become a leader in autonomous ground transportation committed to a safer and more efficient future for all. The company has developed an artificial intelligence (AI) powered technology stack purpose-built for commercial trucking and the public sector. The company delivers freight daily for its customers across the southern United States using its autonomous technology. In 2024, Kodiak became the first known company to publicly announce delivering a driverless semi-truck to a customer. Kodiak is also leveraging its commercial self-driving software to develop, test and deploy autonomous capabilities for the U.S. Department of Defense.
- Design and train generative world models that synthesize realistic multi-camera video and LiDAR conditioned on ego trajectories, 3D scene context, and text
- Research and implement conditional diffusion architectures for driving, including spatiotemporal attention, latent space design, and action-conditioned generation
- Develop techniques for multi-view geometric consistency in generated outputs, drawing on neural rendering, cross-view attention, and 3D-aware generative approaches
- Build methods for joint multimodal generation that maintain cross-sensor consistency between camera, LiDAR, and radar outputs
- Design evaluation frameworks that measure world model quality beyond pixel-level metrics, including scenario fidelity and autoregressive stability
- Scale training pipelines to learn from thousands of hours of real-world driving data across multiple sensor modalities
- MS or PhD in Computer Science, AI, Robotics, or a related field, with a focus on generative modeling, neural rendering, or video synthesis
- Strong publication record or demonstrated research contributions in diffusion models, video generation, neural radiance fields, 3D-aware generative models, or world models
- Experience with neural rendering and view synthesis and an understanding of multi-view geometric consistency
- Proficiency working with multimodal sensor data (camera, LiDAR, radar) and familiarity with 3D representations such as BEV grids, voxel fields, or tri-planes
- Strong implementation skills in Python and PyTorch, with experience training large generative models at scale using distributed training
- Passion for building AI that understands and predicts the physical world to enable safe autonomous driving
What We Offer:
- Competitive compensation package including equity and annual bonuses
- Excellent Medical, Dental, and Vision plans through Kaiser Permanente, Cigna, and MetLife (including a medical plan with infertility benefits)
- MetLife Legal Services, Identity & Fraud Protection, Hospital Indemnity Insurance, Accident Insurance, & Critical Illness Insurance
- Flexible PTO, 10 paid holidays, and generous parental leave policies
- Our office is centrally located in Mountain View, CA
- Office perks: dog-friendly, free catered lunch, a fully stocked kitchen, and free EV charging
- Long Term Disability, Short Term Disability, Life Insurance
- Wellbeing Benefits - Headspace through Cigna, Calm through Kaiser, One Medical, Gympass, Spring Health through Cigna, Rula (mental health navigation)
- Fidelity 401(k)
- Commuter, FSA, Dependent Care FSA, HSA
- Various incentive programs (referral bonuses, patent bonuses, etc.)
The pay range listed below reflects the base salary in our SF/Silicon Valley location, across several internal levels. Actual starting pay will be based on job-related factors including: work location, experience, relevant training, education, skill level and performance during interview. Total compensation at Kodiak includes base pay, equity, bonus and a competitive benefits package
AI Resume Tailoring
Generate a resume tailored to this job's requirements based on your uploaded resume.
Compensation
Base Salary (from JD)
$180,000 – $240,000
AI Est. Total Comp
$335,000
Details
Location
Mountain View
Work Type
On-site
Seniority
technical lead
Experience
5-10 years
Category
ml ai
Quality Score
7.2