Scene parsing, or recognizing and segmenting objects and stuff in an image, is one of the key problems in computer vision. Despite the community’s efforts in data collection, there are still few image datasets covering a wide range of scenes and object categories with dense and detailed annotations for scene parsing. Here we provide a pixel-wise annotated dataset called ADE20K for scene parsing. It provides precise concept annotations in hierarchical levels, which allows closer look for semantic scene understanding.
If you would like to contact us about our work, please scroll down to the people section and click on one of the group leads' people pages, where you can reach out to them directly.