data
Posted 4 weeks agoMultimodal LLM Researcher (MLLM)
at Pika
Hybrid
Responsibilities
- WHAT YOU’LL DO - Lead and contribute to research efforts focused on real-time, multimodal generation—including text, image, video, and audio synthesis—as well as orchestration of agentic platform infrastructure - Design and prototype novel algorithms and architectures for high-fidelity, real-time multimodal synthesis and interactive experiences - Focus on real-time aspects of model inference and synthesis across modalities - Work on diffusion model distillation and/or develop diffusion-based world models
Requirements
- experience with generative models, including autoregressive and diffusion models, and their real-time deployment - Hands-on
- Join us and shape the next evolution of creative technology! If you are a leading researcher excited by real-time multimodal AI and agentic platforms, we want to hear from you.
Benefits
- Experience developing and deploying real-time systems and/or agentic orchestration infrastructure - Strong programming and prototyping skills (Python, PyTorch, TensorFlow, etc.) - Passion for building creative tools and platforms that empower users - Excellent communication and collaboration skills WHAT WE OFFER - Competitive salary and substantial equity in a high-growth startup - Full health
- benefits + 401k matching and more - Collaborative, mission-driven team environment with major growth opportunities - Flexible on-site/remote hybrid (HQ in Palo Alto, CA) ABOUT PIKA Pika empowers creators by building state-of-the-art agentic and multimedia platforms.
- Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all.
Additional details
- ABOUT THE ROLE At Pika, we are pioneering next-generation creative infrastructure built around real-time, multimodal generation and intelligent, agentic platforms.
- We are seeking accomplished Multimodal LLM Researchers (LLM, VLM, and Audio LM) to drive forward our mission to make agentic real-time generative technology accessible, dynamic, and transformative for millions of creators.