ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Derivation of Geometrically and Semantically Annotated UAV Datasets at Large Scales from 3D City Models

Sidi Wu, Lukas Liebel, Marco Körner

Auto-TLDR; Large-Scale Dataset of Synthetic UAV Imagery for Geometric and Semantic Annotation

Abstract Slides Poster

While in high demand for the development of deep learning approaches, extensive datasets of annotated UAV imagery are still scarce today. Manual annotation, however, is time-consuming and, thus, has limited the potential for creating large-scale datasets. We tackle this challenge by presenting a procedure for the automatic creation of simulated UAV image sequences in urban areas and pixel-level annotations from publicly available data sources. We synthesize photo-realistic UAV imagery from Goole Earth Studio and derive annotations from an open CityGML model that not only provides geometric but also semantic information. The first dataset we exemplarily created using our approach contains 144000 images of Berlin, Germany, with four types of annotations, namely semantic labels as well as depth, surface normals, and edge maps. In the future, a complete pipeline regarding all the technical problems will be provided, together with more accurate models to refine some of the empirical settings currently, to automatically generate a large-scale dataset with reliable ground-truth annotations over the whole city of Berlin. The dataset, as well as the source code, will be published by then. Different methods will also be facilitated to test the usability of the dataset. We believe our dataset can be used for, and not limited to, tasks like pose estimation, geo-localization, monocular depth estimation, edge detection, building/surface classification, and plane segmentation. A potential research pipeline for geo-localization based on the synthetic dataset is provided.

Similar papers

RISEdb: A Novel Indoor Localization Dataset

Carlos Sanchez Belenguer, Erik Wolfart, Álvaro Casado Coscollá, Vitor Sequeira

Auto-TLDR; Indoor Localization Using LiDAR SLAM and Smartphones: A Benchmarking Dataset

Derivation of Geometrically and Semantically Annotated UAV Datasets at Large Scales from 3D City Models

Similar papers

RISEdb: A Novel Indoor Localization Dataset

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Partially Supervised Multi-Task Network for Single-View Dietary Assessment

Learning Non-Rigid Surface Reconstruction from Spatio-Temporal Image Patches

Automatically Gather Address Specific Dwelling Images Using Google Street View

Deep Realistic Novel View Generation for City-Scale Aerial Images

EAGLE: Large-Scale Vehicle Detection Dataset in Real-World Scenarios Using Aerial Imagery

Multiple Future Prediction Leveraging Synthetic Trajectories

Vehicle Lane Merge Visual Benchmark

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Polarimetric Image Augmentation

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Street-Map Based Validation of Semantic Segmentation in Autonomous Driving

Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and Visual Geometry

CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Calibration and Absolute Pose Estimation of Trinocular Linear Camera Array for Smart City Applications

Unconstrained Vision Guided UAV Based Safe Helicopter Landing

Quantization in Relative Gradient Angle Domain for Building Polygon Estimation

Self-Supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data

Distortion-Adaptive Grape Bunch Counting for Omnidirectional Images

Edge-Aware Monocular Dense Depth Estimation with Morphology

RescueNet: Joint Building Segmentation and Damage Assessment from Satellite Imagery

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

Benchmarking Cameras for OpenVSLAM Indoors

Anomaly Detection, Localization and Classification for Railway Inspection

Aerial Road Segmentation in the Presence of Topological Label Noise

The DeepScoresV2 Dataset and Benchmark for Music Object Detection

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Attention Based Coupled Framework for Road and Pothole Segmentation

Future Urban Scenes Generation through Vehicles Synthesis

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

SAILenv: Learning in Virtual Visual Environments Made Simple

One Step Clustering Based on A-Contrario Framework for Detection of Alterations in Historical Violins

OmniFlowNet: A Perspective Neural Network Adaptation for Optical Flow Estimation in Omnidirectional Images

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

5D Light Field Synthesis from a Monocular Video

Minimal Solvers for Indoor UAV Positioning

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

Object Detection on Monocular Images with Two-Dimensional Canonical Correlation Analysis

Occlusion-Tolerant and Personalized 3D Human Pose Estimation in RGB Images

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal