ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

DmifNet:3D Shape Reconstruction Based on Dynamic Multi-Branch Information Fusion

Lei Li, Suping Wu

Auto-TLDR; DmifNet: Dynamic Multi-branch Information Fusion Network for 3D Shape Reconstruction from a Single-View Image

Abstract Slides

3D object reconstruction from a single-view image is a long-standing challenging problem. Previous works are difficult to accurately reconstruct 3D shapes with a complex topology which has rich details at the edges and corners. Moreover, previous works use synthetic data to train their network, but domain adaptation problems occurred when testing on real data. In this paper, we propose a Dynamic Multi-branch Information Fusion Network (DmifNet) which can recover a high-fidelity 3D shape of arbitrary topology from a 2D image. Specifically, we design several side branches from the intermediate layers to make the network produce more diverse representations to improve the generalization ability of network. In addition, we utilize DoG (Difference of Gaussians) to extract edge geometry and corners information from input images. Then, we use a separate side branch network to process the extracted data to better capture edge geometry and corners feature information. Finally, we dynamically fuse the information of all branches to gain final predicted probability. Extensive qualitative and quantitative experiments on a large-scale publicly available dataset demonstrate the validity and efficiency of our method. Code and models are publicly available at https://github.com/leilimaster/DmifNet.

Similar papers

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Haiyan Wang, Liang Yang, Xuejian Rong, Ying-Li Tian

Auto-TLDR; 3D Point Cloud Completion with Depth View Synthesis and Depth View synthesis

Abstract Poster Similar

3D point cloud completion has been a long-standing challenge at scale, and corresponding per-point supervised training strategies suffered from the cumbersome annotations. 2D supervision has recently emerged as a promising alternative for 3D tasks, but specific approaches for 3D point cloud completion still remain to be explored. To overcome these limitations, we propose an end-to-end method that directly lifts a single depth map to a completed point cloud. With one depth map as input, a multi-way novel depth view synthesis network (NDVNet) is designed to infer coarsely completed depth maps under various viewpoints. Meanwhile, a geometric depth perspective rendering module is introduced to utilize the raw input depth map to generate a re-projected depth map for each view. Therefore, the two parallelly generated depth maps for each view are further concatenated and refined by a depth completion network (DCNet). The final completed point cloud is fused from all refined depth views. Experimental results demonstrate the effectiveness of our proposed approach composed of aforementioned components, to produce high-quality state-of-the-art results on the popular SUNCG benchmark.

Learning to Implicitly Represent 3D Human Body from Multi-Scale Features and Multi-View Images

Zhongguo Li, Magnus Oskarsson, Anders Heyden

Auto-TLDR; Reconstruction of 3D human bodies from multi-view images using multi-stage end-to-end neural networks

DmifNet:3D Shape Reconstruction Based on Dynamic Multi-Branch Information Fusion

Similar papers

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Learning to Implicitly Represent 3D Human Body from Multi-Scale Features and Multi-View Images

Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and Visual Geometry

Multi-Attribute Regression Network for Face Reconstruction

Cross-Regional Attention Network for Point Cloud Completion

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

EdgeNet: Semantic Scene Completion from a Single RGB-D Image

A Multi-Task Neural Network for Action Recognition with 3D Key-Points

Learning Interpretable Representation for 3D Point Clouds

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Face Super-Resolution Network with Incremental Enhancement of Facial Parsing Information

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Learning Semantic Representations Via Joint 3D Face Reconstruction and Facial Attribute Estimation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Deep Space Probing for Point Cloud Analysis

Joint Face Alignment and 3D Face Reconstruction with Efficient Convolution Neural Networks

PointSpherical: Deep Shape Context for Point Cloud Learning in Spherical Coordinates

Delivering Meaningful Representation for Monocular Depth Estimation

PEAN: 3D Hand Pose Estimation Adversarial Network

Free-Form Image Inpainting Via Contrastive Attention Network

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

DAPC: Domain Adaptation People Counting Via Style-Level Transfer Learning and Scene-Aware Estimation

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Small Object Detection by Generative and Discriminative Learning

Detail Fusion GAN: High-Quality Translation for Unpaired Images with GAN-Based Data Augmentation

Orthographic Projection Linear Regression for Single Image 3D Human Pose Estimation

Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

Detail-Revealing Deep Low-Dose CT Reconstruction

A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition

PC-Net: A Deep Network for 3D Point Clouds Analysis

Dynamic Guided Network for Monocular Depth Estimation

FourierNet: Compact Mask Representation for Instance Segmentation Using Differentiable Shape Decoders

Residual Fractal Network for Single Image Super Resolution by Widening and Deepening

Boundary-Aware Graph Convolution for Semantic Segmentation

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Joint Supervised and Self-Supervised Learning for 3D Real World Challenges

Sample-Aware Data Augmentor for Scene Text Recognition

DEN: Disentangling and Exchanging Network for Depth Completion

Manual-Label Free 3D Detection Via an Open-Source Simulator

Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

Shape Consistent 2D Keypoint Estimation under Domain Shift

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

Multi-Resolution Fusion and Multi-Scale Input Priors Based Crowd Counting

End-To-End Multi-Task Learning for Lung Nodule Segmentation and Diagnosis

Dynamic Low-Light Image Enhancement for Object Detection Via End-To-End Training

Wavelet Attention Embedding Networks for Video Super-Resolution