The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts; Mohamed Omran; Sebastian Ramos; Timo Rehfeld; Markus Enzweiler; Rodrigo Benenson; Uwe Franke; Stefan Roth; Bernt Schiele

doi:10.1109/cvpr.2016.350

JOURNAL ARTICLE

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts Mohamed Omran Sebastian Ramos Timo Rehfeld Markus Enzweiler Rodrigo Benenson Uwe Franke Stefan Roth Bernt Schiele

Year: 2016 Pages: 3213-3223

DOI: 10.1109/cvpr.2016.350

Get Full-Text PDF Get Analytical Report

Abstract

Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling. Cityscapes is comprised of a large, diverse set of stereo video sequences recorded in streets from 50 different cities. 5000 of these images have high quality pixel-level annotations, 20 000 additional images have coarse annotations to enable methods that leverage large volumes of weakly-labeled data. Crucially, our effort exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity. Our accompanying empirical study provides an in-depth analysis of the dataset characteristics, as well as a performance evaluation of several state-of-the-art approaches based on our benchmark.

Keywords:

Computer science Leverage (statistics) Artificial intelligence Suite Benchmark (surveying) Context (archaeology) Scale (ratio) Set (abstract data type) Pixel Semantics (computer science) Machine learning Computer vision Cartography Geography

Metrics

11339

Cited By

325.17

FWCI (Field Weighted Citation Impact)

101

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

The Cityscapes Dataset for Semantic Urban Scene Understanding

Abstract

Metrics

Citation History

Topics

Related Documents

Understanding Cityscapes: Efficient Urban Semantic Scene Understanding

Urban Aquatic Scene Expansion for Semantic Segmentation in Cityscapes

Cityscapes TL++: Semantic Traffic Light Annotations for the Cityscapes Dataset

RailSem19: A Dataset for Semantic Rail Scene Understanding

The Fieldscapes Dataset for Semantic Field Scene Understanding