Segmenting Object Affordances: Reproducibility and Sensitivity to Scale

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Apicella_ECCVW_2024
Publication status:	Accepted
Booktitle:	Proceedings of the European Conference on Computer Vision (ECCV) Workshops
Year:	2024
Abstract:	Visual affordance segmentation identifies image regions of an object an agent can interact with. Existing methods re-use and adapt learning-based architectures for semantic segmentation to the affordance segmentation task and evaluate on small-size datasets. However, experimental setups are often not reproducible, thus leading to unfair and inconsistent comparisons. In this work, we benchmark these methods under a reproducible setup on two single objects scenarios, tabletop without occlusions and hand-held containers, to facilitate future comparisons. We include a version of a recent architecture, Mask2Former, re-trained for affordance segmentation and show that this model is the best-performing on most testing sets of both scenarios. Our analysis shows that models are not robust to scale variations when object resolutions differ from those in the training set.
Keywords:	Affordances, Benchmarking, semantic segmentation
Projects	Idiap
Authors	Apicella, Tommaso Xompero, Alessio Gastaldo, Paolo Cavallaro, Andrea
Added by:	[UNK]
Total mark:	0
Attachments
Apicella_ECCVW_2024.pdf
Notes

processing time: 0.0002 seconds.