Pippo is a generative model designed to create high-resolution dense turnaround videos of individuals from a single casual photograph, utilizing a multi-view diffusion transformer without the need for additional inputs. The codebase includes training configurations for various resolutions, sample training code, and methods for preparing custom datasets. Future updates are planned to enhance the functionality and usability of the model.