2 links tagged with all of: computer-vision + 3d-reconstruction
Click any tag below to further narrow down your results
Links
Meta has released SAM 3 and SAM 3D, new image segmentation models that enhance object recognition and enable 3D reconstruction of images. SAM 3 allows users to edit images through detailed text prompts, while SAM 3D can rebuild objects and people in 3D. Both models aim to improve creative applications and user interactions in various digital environments.
InteractVLM is a new method for estimating 3D contact points on human bodies and objects from single images, addressing challenges like occlusions and depth ambiguities. It combines Vision-Language Models and a Render-Localize-Lift module to enhance 3D reconstruction and introduces a Semantic Human Contact estimation task for improved interaction modeling. The approach outperforms existing methods and is scalable due to its reliance on limited 3D contact data.
computer-vision ✓
3d-reconstruction ✓
+ human-object-interaction
+ vision-language-models
+ contact-estimation