Quit Emailing Yourself

# computer-vision → human-object-interaction

1 link tagged with all of: computer-vision + human-object-interaction

Click any tag below to further narrow down your results

Links

InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

InteractVLM is a new method for estimating 3D contact points on human bodies and objects from single images, addressing challenges like occlusions and depth ambiguities. It combines Vision-Language Models and a Render-Localize-Lift module to enhance 3D reconstruction and introduces a Semantic Human Contact estimation task for improved interaction modeling. The approach outperforms existing methods and is scalable due to its reliance on limited 3D contact data.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

computer-vision ✓ + 3d-reconstruction human-object-interaction ✓ + vision-language-models + contact-estimation