ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Meta Learning Via Learned Loss

Sarah Bechtle, Artem Molchanov, Yevgen Chebotar, Edward Thomas Grefenstette, Ludovic Righetti, Gaurav Sukhatme, Franziska Meier

Auto-TLDR; meta-learning for learning parametric loss functions that generalize across different tasks and model architectures

Abstract Slides

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process, with the view of producing models which train faster and more robustly. Concretely, we present a meta-learning method for learning parametric loss functions that can generalize across different tasks and model architectures. We develop a pipeline for “meta-training” such loss functions, targeted at maximizing the performance of the model trained under them. The loss landscape produced by our learned losses significantly improves upon the original task-specific losses in both supervised and reinforcement learning tasks. Furthermore, we show that our meta-learning framework is flexible enough to incorporate additional information at meta-train time. This information shapes the learned loss function such that the environment does not need to provide this information during meta-test time.

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

Nicolò Botteghi, Ruben Obbink, Daan Geijs, Mannes Poel, Beril Sirmacek, Christoph Brune, Abeje Mersha, Stefano Stramigioli

Auto-TLDR; Unsupervised Learning for Unsupervised Reinforcement Learning in Robotics

Abstract Poster Similar

Reinforcement Learning has been able to solve many complicated robotics tasks without any need of feature engineering in an end-to-end fashion. However, learning the optimal policy directly from the sensory inputs, i.e the observations, often requires processing and storage of huge amount of data. In the context of robotics, the cost of data from real robotics hardware is usually very high, thus solutions that achieves high sample-efficiency are needed. We propose a method that aims at learning a mapping from the observations into a lower dimensional state space. This mapping is learned with unsupervised learning using loss functions shaped to incorporate prior knowledge of the environment and the task. Using the samples from the state space, the optimal policy is quickly and efficiently learned. We test the method on several mobile robot navigation tasks in simulation environment and also on a real robot.

The Effect of Multi-Step Methods on Overestimation in Deep Reinforcement Learning

Lingheng Meng, Rob Gorbet, Dana Kulić

Auto-TLDR; Multi-Step DDPG for Deep Reinforcement Learning

Meta Learning Via Learned Loss

Similar papers

Low Dimensional State Representation Learning with Reward-Shaped Priors

The Effect of Multi-Step Methods on Overestimation in Deep Reinforcement Learning

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning without a Supercomputer

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Learning from Learners: Adapting Reinforcement Learning Agents to Be Competitive in a Card Game

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Object-Oriented Map Exploration and Construction Based on Auxiliary Task Aided DRL

Trajectory Representation Learning for Multi-Task NMRDP Planning

Detecting and Adapting to Crisis Pattern with Context Based Deep Reinforcement Learning

AOAM: Automatic Optimization of Adjacency Matrix for Graph Convolutional Network

AVD-Net: Attention Value Decomposition Network for Deep Multi-Agent Reinforcement Learning

Can Reinforcement Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs

Vacant Parking Space Detection Based on Task Consistency and Reinforcement Learning

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

Explore and Explain: Self-Supervised Navigation and Recounting

Complementing Representation Deficiency in Few-Shot Image Classification: A Meta-Learning Approach

Deep Reinforcement Learning for Autonomous Driving by Transferring Visual Features

Self-Play or Group Practice: Learning to Play Alternating Markov Game in Multi-Agent System

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Meta Generalized Network for Few-Shot Classification

A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning

Visual Object Tracking in Drone Images with Deep Reinforcement Learning

Meta Soft Label Generation for Noisy Labels

ARCADe: A Rapid Continual Anomaly Detector

Rethinking Experience Replay: A Bag of Tricks for Continual Learning

Improving Robotic Grasping on Monocular Images Via Multi-Task Learning and Positional Loss

E-DNAS: Differentiable Neural Architecture Search for Embedded Systems

On Embodied Visual Navigation in Real Environments through Habitat

Learning with Delayed Feedback

Multiple Future Prediction Leveraging Synthetic Trajectories

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Graph-Based Interpolation of Feature Vectors for Accurate Few-Shot Classification

Transferable Model for Shape Optimization subject to Physical Constraints

Rethinking Domain Generalization Baselines

TAAN: Task-Aware Attention Network for Few-Shot Classification

VOWEL: A Local Online Learning Rule for Recurrent Networks of Probabilistic Spiking Winner-Take-All Circuits

Switching Dynamical Systems with Deep Neural Networks

Task-based Focal Loss for Adversarially Robust Meta-Learning

Separation of Aleatoric and Epistemic Uncertainty in Deterministic Deep Neural Networks

Joint Supervised and Self-Supervised Learning for 3D Real World Challenges

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

Learning Stable Deep Predictive Coding Networks with Weight Norm Supervision

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

PROPEL: Probabilistic Parametric Regression Loss for Convolutional Neural Networks

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

A Close Look at Deep Learning with Small Data

Algorithm Recommendation for Data Streams