uni4dUni4D is a framework that uses multiple pretrained vision models to understand dynamic scenes from casual videos. It performs dynamic 3D reconstruction, camera poseUni4D is modular and any component can be swapped for other visual foundation model outputs. For custom vdeo depth estimation and dynamic masks, save them in