ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

Kurauchi Kanya, Kanji Tanaka

Auto-TLDR; Active Visual Place Recognition using Deep Convolutional Neural Network

Abstract Slides Poster

This paper addresses the problem of active visual place recognition (VPR) from a novel perspective of long-term autonomy. In our approach, a next-best-view (NBV) planner plans an optimal action-observation-sequence to maximize the expected cost-performance for a visual route classification task. A difficulty arises from the fact that the NBV planner is trained and tested in different domains (times of day, weather conditions, and seasons). Existing NBV methods may be confused and deteriorated by the domain-shifts, and require significant efforts for adapting them to a new domain. We address this issue by a novel deep convolutional neural network (DNN) -based NBV planner that does not require the adaptation. Our main contributions in this paper are summarized as follows: (1) We present a novel domain-invariant NBV planner that is specifically tailored for DNN-based VPR. (2) We formulate the active VPR as a POMDP problem and present a feasible solution to address the inherent intractability. Specifically, the probability distribution vector (PDV) output by the available DNN is used as a domain-invariant observation model without the need to retrain it. (3) We verify efficacy of the proposed approach through challenging cross-season VPR experiments, where it is confirmed that the proposed approach clearly outperforms the previous single-view-based or multi-view-based VPR in terms of VPR accuracy and/or action-observation-cost.

Similar papers

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Junzhe Xu, Jianhua Zhang, Shengyong Chen, Honghai Liu

Auto-TLDR; Auxiliary Task Aided Deep Reinforcement Learning for Environment Exploration by Autonomous Robots

Abstract Similar

Environment exploration by autonomous robots through deep reinforcement learning (DRL) based methods has attracted more and more attention. However, existing methods usually focus on robot navigation to single or multiple fixed goals, while ignoring the perception and construction of external environments. In this paper, we propose a novel environment exploration task based on DRL, which requires a robot fast and completely perceives all objects of interest, and reconstructs their poses in a global environment map, as much as the robot can do. To this end, we design an auxiliary task aided DRL model, which is integrated with the auxiliary object detection and 6-DoF pose estimation components. The outcome of auxiliary tasks can improve the learning speed and robustness of DRL, as well as the accuracy of object pose estimation. Comprehensive experimental results on the indoor simulation platform AI2-THOR have shown the effectiveness and robustness of our method.

Towards life-long mapping of dynamic environments using temporal persistence modeling

Georgios Tsamis, Ioannis Kostavelis, Dimitrios Giakoumis, Dimitrios Tzovaras

Auto-TLDR; Lifelong Mapping for Mobile Robot Navigation in Dynamic Environments

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

Similar papers

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Towards life-long mapping of dynamic environments using temporal persistence modeling

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Map-Based Temporally Consistent Geolocalization through Learning Motion Trajectories

On Embodied Visual Navigation in Real Environments through Habitat

RISEdb: A Novel Indoor Localization Dataset

Low Dimensional State Representation Learning with Reward-Shaped Priors

Do We Really Need Scene-Specific Pose Encoders?

Loop-closure detection by LiDAR scan re-identification

Benchmarking Cameras for OpenVSLAM Indoors

Multiple Future Prediction Leveraging Synthetic Trajectories

Developing Motion Code Embedding for Action Recognition in Videos

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze

Single View Learning in Action Recognition

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Real-Time End-To-End Lane ID Estimation Using Recurrent Networks

Explore and Explain: Self-Supervised Navigation and Recounting

Weight Estimation from an RGB-D Camera in Top-View Configuration

Trajectory Representation Learning for Multi-Task NMRDP Planning

Vacant Parking Space Detection Based on Task Consistency and Reinforcement Learning

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning without a Supercomputer

Deep Reinforcement Learning for Autonomous Driving by Transferring Visual Features

Shape Consistent 2D Keypoint Estimation under Domain Shift

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Learning Dictionaries of Kinematic Primitives for Action Classification

Vehicle Lane Merge Visual Benchmark

Improving Robotic Grasping on Monocular Images Via Multi-Task Learning and Positional Loss

Attention Based Coupled Framework for Road and Pothole Segmentation

Supervised Domain Adaptation Using Graph Embedding

Localization of Unmanned Aerial Vehicles in Corridor Environments Using Deep Learning

Can You Trust Your Pose? Confidence Estimation in Visual Localization

Learning to Segment Dynamic Objects Using SLAM Outliers

Surface Material Dataset for Robotics Applications (SMDRA): A Dataset with Friction Coefficient and RGB-D for Surface Segmentation

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Object Segmentation Tracking from Generic Video Cues

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Generic Merging of Structure from Motion Maps with a Low Memory Footprint

What and How? Jointly Forecasting Human Action and Pose

Transformer Networks for Trajectory Forecasting

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings

AV-SLAM: Autonomous Vehicle SLAM with Gravity Direction Initialization

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

The Effect of Multi-Step Methods on Overestimation in Deep Reinforcement Learning

Can Reinforcement Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs

AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features

A Simple Domain Shifting Network for Generating Low Quality Images