ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Surface Material Dataset for Robotics Applications (SMDRA): A Dataset with Friction Coefficient and RGB-D for Surface Segmentation

Donghun Noh, Hyunwoo Nam, Min Sung Ahn, Hosik Chae, Sangjoon Lee, Kyle Gillespie, Dennis Hong

Auto-TLDR; A Surface Material Dataset for Robotics Applications

Abstract Slides Poster

In this paper, we introduce the Surface Material Dataset for Robotics Applications (SMDRA), a collection of RGB color image, depth data, and pixel-wise friction coefficient data of 10 different materials for computer vision research specifically with robotics applications in mind that require physical contact between the robot and its environment such as robotic manipulators or walking robots. These selected surface materials are both easily accessible around our daily lives and cover a wide range of friction coefficients. Our dataset is unique in that while there is an abundance of RGB-D data due to the popularization of imaging sensors, additional pixel-wise aligned data of a different modality are not readily available. The depth data is collected by an active stereo camera which has shown promise on a variety of different robotic applications. In addition, this dataset is greatly expanded with friction coefficient data. Similarly to humans, this additional information can be helpful in ensuing proper decision making in tasks ranging from grasping orientation and strength to path determination in an unstructured environment. A newly developed friction measuring device was used to obtain this data. We verify that existing Convolutional Neural Network (CNN) architectures, the Fully Convolutional Network (FCN) and U-Net, can be trained on the SMDRA. This result demonstrates that the SMDRA can be utilized to train a neural network model for segmentation and these different modes are not just additional information, but valuable modes that researchers can incorporate and exploit when applying computer vision algorithms on robotic platforms.

Similar papers

Weight Estimation from an RGB-D Camera in Top-View Configuration

Marco Mameli, Marina Paolanti, Nicola Conci, Filippo Tessaro, Emanuele Frontoni, Primo Zingaretti

Auto-TLDR; Top-View Weight Estimation using Deep Neural Networks

Surface Material Dataset for Robotics Applications (SMDRA): A Dataset with Friction Coefficient and RGB-D for Surface Segmentation

Similar papers

Weight Estimation from an RGB-D Camera in Top-View Configuration

Improving Robotic Grasping on Monocular Images Via Multi-Task Learning and Positional Loss

Polarimetric Image Augmentation

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Benchmarking Cameras for OpenVSLAM Indoors

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

SAILenv: Learning in Virtual Visual Environments Made Simple

Multimodal End-To-End Learning for Autonomous Steering in Adverse Road and Weather Conditions

RISEdb: A Novel Indoor Localization Dataset

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Localization of Unmanned Aerial Vehicles in Corridor Environments Using Deep Learning

Surface IR Reflectance Estimation and Material Recognition Using ToF Camera

6D Pose Estimation with Correlation Fusion

Developing Motion Code Embedding for Action Recognition in Videos

Attention Based Coupled Framework for Road and Pothole Segmentation

Holistic Grid Fusion Based Stop Line Estimation

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Hyperspectral Imaging for Analysis and Classification of Plastic Waste

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Human Segmentation with Dynamic LiDAR Data

A Lumen Segmentation Method in Ureteroscopy Images Based on a Deep Residual U-Net Architecture

Quantifying the Use of Domain Randomization

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

On Embodied Visual Navigation in Real Environments through Habitat

Early Wildfire Smoke Detection in Videos

Anticipating Activity from Multimodal Signals

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Real-Time End-To-End Lane ID Estimation Using Recurrent Networks

Video Analytics Gait Trend Measurement for Fall Prevention and Health Monitoring

IPN Hand: A Video Dataset and Benchmark for Real-Time Continuous Hand Gesture Recognition

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Deep Next-Best-View Planner for Cross-Season Visual Route Classification

Towards life-long mapping of dynamic environments using temporal persistence modeling

Deep Photo Relighting by Integrating Both 2D and 3D Lighting Information

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

DE-Net: Dilated Encoder Network for Automated Tongue Segmentation

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

RefiNet: 3D Human Pose Refinement with Depth Maps

Learning to Segment Dynamic Objects Using SLAM Outliers

Incorporating Depth Information into Few-Shot Semantic Segmentation

Wireless Localisation in WiFi Using Novel Deep Architectures

What and How? Jointly Forecasting Human Action and Pose

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Multiple Future Prediction Leveraging Synthetic Trajectories

Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration