ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Madhav Agarwal, Ajoy Mondal, C. V. Jawahar

Auto-TLDR; CDeC-Net: An End-to-End Trainable Deep Network for Detecting Tables in Document Images

Abstract Slides

Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (CDeC-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on all the publicly available benchmark datasets— ICDAR-2013, ICDAR-2017, ICDAR-2019, UNLV, Marmot, PubLayNet, TableBank, and IIIT-AR-13K —with extensive experiments. Our solution has three important properties:(i) a single trained model CDeC-Net‡ performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models will be publicly released for enabling reproducibility of the results.

Similar papers

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Mengshi Zhang, Daniel Perelman, Vu Le, Sumit Gulwani

Auto-TLDR; Deep Learning and Symbolic Reasoning for Unstructured PDF Table Extraction

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Similar papers

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

SyNet: An Ensemble Network for Object Detection in UAV Images

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

Scene Text Detection with Selected Anchors

Detecting Objects with High Object Region Percentage

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

A Novel Region of Interest Extraction Layer for Instance Segmentation

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

Convolutional STN for Weakly Supervised Object Localization

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

SFPN: Semantic Feature Pyramid Network for Object Detection

Forground-Guided Vehicle Perception Framework

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Bidirectional Matrix Feature Pyramid Network for Object Detection

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

Tiny Object Detection in Aerial Images

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Object Detection in the DCT Domain: Is Luminance the Solution?

Small Object Detection by Generative and Discriminative Learning

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

Detective: An Attentive Recurrent Model for Sparse Object Detection

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text

EAGLE: Large-Scale Vehicle Detection Dataset in Real-World Scenarios Using Aerial Imagery

Iterative Bounding Box Annotation for Object Detection

Hierarchical Head Design for Object Detectors

TGCRBNW: A Dataset for Runner Bib Number Detection (and Recognition) in the Wild

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers

A Fast and Accurate Object Detector for Handwritten Digit String Recognition

Which Airline Is This? Airline Logo Detection in Real-World Weather Conditions

Multiple-Step Sampling for Dense Object Detection and Counting

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

Deep Real-Time Hand Detection Using CFPN on Embedded Systems

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

Stratified Multi-Task Learning for Robust Spotting of Scene Texts

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

Automatically Gather Address Specific Dwelling Images Using Google Street View

Object Detection on Monocular Images with Two-Dimensional Canonical Correlation Analysis

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection