ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Trajectory Representation Learning for Multi-Task NMRDP Planning

Firas Jarboui, Vianney Perchet

Auto-TLDR; Exploring Non Markovian Reward Decision Processes for Reinforcement Learning

Abstract Slides Poster

Expanding Non Markovian Reward Decision Processes (NMRDP) into Markov Decision Processes (MDP) enables the use of state of the art Reinforcement Learning (RL) techniques to identify optimal policies. In this paper an approach to exploring NMRDPs and expanding them into MDPs, without the prior knowledge of the reward structure, is proposed. The non Markovianity of the reward function is disentangled under the assumption that sets of similar and dissimilar trajectory batches can be sampled. More precisely, within the same batch, measuring the similarity between any couple of trajectories is permitted, although comparing trajectories from different batches is not possible. A modified version of the triplet loss is optimised to construct a representation of the trajectories under which rewards become Markovian.

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

Nicolò Botteghi, Ruben Obbink, Daan Geijs, Mannes Poel, Beril Sirmacek, Christoph Brune, Abeje Mersha, Stefano Stramigioli

Auto-TLDR; Unsupervised Learning for Unsupervised Reinforcement Learning in Robotics

Abstract Poster Similar

Reinforcement Learning has been able to solve many complicated robotics tasks without any need of feature engineering in an end-to-end fashion. However, learning the optimal policy directly from the sensory inputs, i.e the observations, often requires processing and storage of huge amount of data. In the context of robotics, the cost of data from real robotics hardware is usually very high, thus solutions that achieves high sample-efficiency are needed. We propose a method that aims at learning a mapping from the observations into a lower dimensional state space. This mapping is learned with unsupervised learning using loss functions shaped to incorporate prior knowledge of the environment and the task. Using the samples from the state space, the optimal policy is quickly and efficiently learned. We test the method on several mobile robot navigation tasks in simulation environment and also on a real robot.

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning without a Supercomputer

Edward Beeching, Jilles Steeve Dibangoye, Olivier Simonin, Christian Wolf

Auto-TLDR; Deep Reinforcement Learning in Mobile Robots Using 3D Environment Scenarios

Trajectory Representation Learning for Multi-Task NMRDP Planning

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning without a Supercomputer

AVD-Net: Attention Value Decomposition Network for Deep Multi-Agent Reinforcement Learning

Can Reinforcement Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs

The Effect of Multi-Step Methods on Overestimation in Deep Reinforcement Learning

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Meta Learning Via Learned Loss

Detecting and Adapting to Crisis Pattern with Context Based Deep Reinforcement Learning

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Learning from Learners: Adapting Reinforcement Learning Agents to Be Competitive in a Card Game

Deep Reinforcement Learning for Autonomous Driving by Transferring Visual Features

DAG-Net: Double Attentive Graph Neural Network for Trajectory Forecasting

Vacant Parking Space Detection Based on Task Consistency and Reinforcement Learning

Trajectory-User Link with Attention Recurrent Networks

Multiple Future Prediction Leveraging Synthetic Trajectories

Switching Dynamical Systems with Deep Neural Networks

Recurrent Deep Attention Network for Person Re-Identification

Self-Play or Group Practice: Learning to Play Alternating Markov Game in Multi-Agent System

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Map-Based Temporally Consistent Geolocalization through Learning Motion Trajectories

Explore and Explain: Self-Supervised Navigation and Recounting

AG-GAN: An Attentive Group-Aware GAN for Pedestrian Trajectory Prediction

A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

Road Network Metric Learning for Estimated Time of Arrival

Visual Object Tracking in Drone Images with Deep Reinforcement Learning

AOAM: Automatic Optimization of Adjacency Matrix for Graph Convolutional Network

Transformer Networks for Trajectory Forecasting

On Embodied Visual Navigation in Real Environments through Habitat

Interpolation in Auto Encoders with Bridge Processes

Progressive Learning Algorithm for Efficient Person Re-Identification

ILS-SUMM: Iterated Local Search for Unsupervised Video Summarization

Naturally Constrained Online Expectation Maximization

SAILenv: Learning in Virtual Visual Environments Made Simple

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

Class-Incremental Learning with Pre-Allocated Fixed Classifiers

Aggregating Dependent Gaussian Experts in Local Approximation

Adaptive Remote Sensing Image Attribute Learning for Active Object Detection

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Deep Top-Rank Counter Metric for Person Re-Identification

Attentive Visual Semantic Specialized Network for Video Captioning

Kernel-based Graph Convolutional Networks

Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

Loop-closure detection by LiDAR scan re-identification

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm