ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Multimodal End-To-End Learning for Autonomous Steering in Adverse Road and Weather Conditions

Jyri Sakari Maanpää, Josef Taher, Petri Manninen, Leo Pakola, Iaroslav Melekhov, Juha Hyyppä

Auto-TLDR; End-to-End Learning for Autonomous Steering in Adverse Road and Weather Conditions with Lidar Data

Abstract Slides Poster

Autonomous driving is challenging in adverse road and weather conditions in which there might not be lane lines, the road might be covered in snow and the visibility might be poor. We extend the previous work on end-to-end learning for autonomous steering to operate in these adverse real-life conditions with multimodal data. We collected 28 hours of driving data in several road and weather conditions and trained convolutional neural networks to predict the car steering wheel angle from front-facing color camera images and lidar range and reflectance data. We compared the CNN model performances based on the different modalities and our results show that the lidar modality improves the performances of different multimodal sensor-fusion models. We also performed on-road tests with different models and they support this observation.

Similar papers

Holistic Grid Fusion Based Stop Line Estimation

Runsheng Xu, Faezeh Tafazzoli, Li Zhang, Timo Rehfeld, Gunther Krehl, Arunava Seal

Auto-TLDR; Fused Multi-Sensory Data for Stop Lines Detection in Intersection Scenarios

Abstract Slides Similar

Intersection scenarios provide the most complex traffic situations in Autonomous Driving and Driving Assistance Systems. Knowing where to stop in advance in an intersection is an essential parameter in controlling the longitudinal velocity of the vehicle. Most of the existing methods in literature solely use cameras to detect stop lines, which is typically not sufficient in terms of detection range. To address this issue, we propose a method that takes advantage of fused multi-sensory data including stereo camera and lidar as input and utilizes a carefully designed convolutional neural network architecture to detect stop lines. Our experiments show that the proposed approach can improve detection range compared to camera data alone, works under heavy occlusion without observing the ground markings explicitly, is able to predict stop lines for all lanes and allows detection at a distance up to 50 meters.

PolyLaneNet: Lane Estimation Via Deep Polynomial Regression

Talles Torres, Rodrigo Berriel, Thiago Paixão, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

Auto-TLDR; Real-Time Lane Detection with Deep Polynomial Regression

Multimodal End-To-End Learning for Autonomous Steering in Adverse Road and Weather Conditions

Similar papers

Holistic Grid Fusion Based Stop Line Estimation

PolyLaneNet: Lane Estimation Via Deep Polynomial Regression

Vehicle Lane Merge Visual Benchmark

CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations

Real-Time End-To-End Lane ID Estimation Using Recurrent Networks

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Ghost Target Detection in 3D Radar Data Using Point Cloud Based Deep Neural Network

Attention Based Coupled Framework for Road and Pothole Segmentation

RISEdb: A Novel Indoor Localization Dataset

Multiple Future Prediction Leveraging Synthetic Trajectories

RONELD: Robust Neural Network Output Enhancement for Active Lane Detection

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Street-Map Based Validation of Semantic Segmentation in Autonomous Driving

Visual Prediction of Driver Behavior in Shared Road Areas

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Surface Material Dataset for Robotics Applications (SMDRA): A Dataset with Friction Coefficient and RGB-D for Surface Segmentation

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Anomaly Detection, Localization and Classification for Railway Inspection

Wireless Localisation in WiFi Using Novel Deep Architectures

Weight Estimation from an RGB-D Camera in Top-View Configuration

Lane Detection Based on Object Detection and Image-To-Image Translation

Real-Time Drone Detection and Tracking with Visible, Thermal and Acoustic Sensors

Improving Robotic Grasping on Monocular Images Via Multi-Task Learning and Positional Loss

Polarimetric Image Augmentation

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Localization of Unmanned Aerial Vehicles in Corridor Environments Using Deep Learning

RWF-2000: An Open Large Scale Video Database for Violence Detection

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

Loop-closure detection by LiDAR scan re-identification

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data Segmentation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Location Prediction in Real Homes of Older Adults based on K-Means in Low-Resolution Depth Videos

A Two-Step Approach to Lidar-Camera Calibration

RefiNet: 3D Human Pose Refinement with Depth Maps

Benchmarking Cameras for OpenVSLAM Indoors

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Map-Based Temporally Consistent Geolocalization through Learning Motion Trajectories

Sensor-Independent Pedestrian Detection for Personal Mobility Vehicles in Walking Space Using Dataset Generated by Simulation

Uncertainty-Sensitive Activity Recognition: A Reliability Benchmark and the CARING Models

Spatial Bias in Vision-Based Voice Activity Detection

Removing Raindrops from a Single Image Using Synthetic Data