ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Junzhe Xu, Jianhua Zhang, Shengyong Chen, Honghai Liu

Auto-TLDR; Auxiliary Task Aided Deep Reinforcement Learning for Environment Exploration by Autonomous Robots

Abstract

Environment exploration by autonomous robots through deep reinforcement learning (DRL) based methods has attracted more and more attention. However, existing methods usually focus on robot navigation to single or multiple fixed goals, while ignoring the perception and construction of external environments. In this paper, we propose a novel environment exploration task based on DRL, which requires a robot fast and completely perceives all objects of interest, and reconstructs their poses in a global environment map, as much as the robot can do. To this end, we design an auxiliary task aided DRL model, which is integrated with the auxiliary object detection and 6-DoF pose estimation components. The outcome of auxiliary tasks can improve the learning speed and robustness of DRL, as well as the accuracy of object pose estimation. Comprehensive experimental results on the indoor simulation platform AI2-THOR have shown the effectiveness and robustness of our method.

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

Nicolò Botteghi, Ruben Obbink, Daan Geijs, Mannes Poel, Beril Sirmacek, Christoph Brune, Abeje Mersha, Stefano Stramigioli

Auto-TLDR; Unsupervised Learning for Unsupervised Reinforcement Learning in Robotics

Abstract Poster Similar

Reinforcement Learning has been able to solve many complicated robotics tasks without any need of feature engineering in an end-to-end fashion. However, learning the optimal policy directly from the sensory inputs, i.e the observations, often requires processing and storage of huge amount of data. In the context of robotics, the cost of data from real robotics hardware is usually very high, thus solutions that achieves high sample-efficiency are needed. We propose a method that aims at learning a mapping from the observations into a lower dimensional state space. This mapping is learned with unsupervised learning using loss functions shaped to incorporate prior knowledge of the environment and the task. Using the samples from the state space, the optimal policy is quickly and efficiently learned. We test the method on several mobile robot navigation tasks in simulation environment and also on a real robot.

AVD-Net: Attention Value Decomposition Network for Deep Multi-Agent Reinforcement Learning

Zhang Yuanxin, Huimin Ma, Yu Wang

Auto-TLDR; Attention Value Decomposition Network for Cooperative Multi-agent Reinforcement Learning

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

AVD-Net: Attention Value Decomposition Network for Deep Multi-Agent Reinforcement Learning

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning without a Supercomputer

Explore and Explain: Self-Supervised Navigation and Recounting

The Effect of Multi-Step Methods on Overestimation in Deep Reinforcement Learning

Learning from Learners: Adapting Reinforcement Learning Agents to Be Competitive in a Card Game

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

On Embodied Visual Navigation in Real Environments through Habitat

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

Deep Reinforcement Learning for Autonomous Driving by Transferring Visual Features

Adaptive Remote Sensing Image Attribute Learning for Active Object Detection

Meta Learning Via Learned Loss

Map-Based Temporally Consistent Geolocalization through Learning Motion Trajectories

Trajectory Representation Learning for Multi-Task NMRDP Planning

Vacant Parking Space Detection Based on Task Consistency and Reinforcement Learning

Detecting and Adapting to Crisis Pattern with Context Based Deep Reinforcement Learning

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Towards life-long mapping of dynamic environments using temporal persistence modeling

Visual Object Tracking in Drone Images with Deep Reinforcement Learning

Self-Play or Group Practice: Learning to Play Alternating Markov Game in Multi-Agent System

SAILenv: Learning in Virtual Visual Environments Made Simple

AOAM: Automatic Optimization of Adjacency Matrix for Graph Convolutional Network

A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

Improving Robotic Grasping on Monocular Images Via Multi-Task Learning and Positional Loss

Benchmarking Cameras for OpenVSLAM Indoors

Localization of Unmanned Aerial Vehicles in Corridor Environments Using Deep Learning

Self-Supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data

RISEdb: A Novel Indoor Localization Dataset

Can Reinforcement Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs

Vehicle Lane Merge Visual Benchmark

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

P2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

Recurrent Deep Attention Network for Person Re-Identification

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

AV-SLAM: Autonomous Vehicle SLAM with Gravity Direction Initialization

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Transformer Networks for Trajectory Forecasting

6D Pose Estimation with Correlation Fusion

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration

Multiple Future Prediction Leveraging Synthetic Trajectories

Loop-closure detection by LiDAR scan re-identification