2 links tagged with all of: computer-vision + depth-estimation
Click any tag below to further narrow down your results
Links
Depth Anything 3 (DA3) is a model designed for accurate depth estimation and 3D geometry recovery from various visual inputs, regardless of camera pose. It simplifies the process using a single transformer backbone and a depth-ray representation, outperforming previous models in both monocular and multi-view scenarios. Various specialized models within the DA3 series cater to different depth estimation tasks.
CUPS is a novel Scene-Centric Unsupervised Panoptic Segmentation method that utilizes motion and depth from stereo pairs to create high-resolution pseudo-labels for training a monocular panoptic network. This approach allows for the effective segmentation of complex scenes without the need for annotated data, achieving superior performance compared to existing unsupervised methods, particularly on benchmarks like Cityscapes. CUPS demonstrates strong generalization capabilities across multiple datasets while significantly enhancing panoptic quality metrics.
+ panoptic-segmentation
+ unsupervised-learning
computer-vision ✓
+ scene-understanding
depth-estimation ✓