
Department of Brain and Cognitive Sciences (BCS)
Optimizing the Full Stack for Generative Image and Video Models
Description
Abstract: This talk will present a holistic framework for optimizing flow-based image and video generation models beyond simply improving speed. We address their compute-intensive nature by exploring strategies across the full stack, including hardware-aware architecture, use-case-specific fine-tuning, and advanced distillation techniques. The talk argues that effective optimization requires balancing the trade-offs between speed, memory, and performance to align with specific user-facing applications and their requirements.