ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Mengshi Zhang, Daniel Perelman, Vu Le, Sumit Gulwani

Auto-TLDR; Deep Learning and Symbolic Reasoning for Unstructured PDF Table Extraction

Abstract Slides Poster

Deep learning has shown great success at interpreting unstructured data such as object recognition in images. Symbolic/logical-reasoning techniques have shown great success in interpreting structured data such as table extraction in webpages, custom text files, spreadsheets. The tables in PDF documents are often generated from such structured sources (text-based Word/Latex documents, spreadsheets, webpages) but end up being unstructured. We thus explore novel combinations of deep learning and symbolic reasoning techniques to build an effective solution for PDF table extraction. We evaluate effectiveness without granting partial credit for matching part of a table (which may cause silent errors in downstream data processing). Our method achieves a 0.725 F1 score (vs. 0.339 for the state-of-the-art) on detecting correct table bounds---a much stricter metric than the common one of detecting characters within tables---in a well known public benchmark (ICDAR 2013) and a 0.404 F1 score (vs. 0.144 for the state-of-the-art) on our private benchmark with more widely varied table structures.

Similar papers

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Madhav Agarwal, Ajoy Mondal, C. V. Jawahar

Auto-TLDR; CDeC-Net: An End-to-End Trainable Deep Network for Detecting Tables in Document Images

Abstract Slides Similar

Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (CDeC-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on all the publicly available benchmark datasets— ICDAR-2013, ICDAR-2017, ICDAR-2019, UNLV, Marmot, PubLayNet, TableBank, and IIIT-AR-13K —with extensive experiments. Our solution has three important properties:(i) a single trained model CDeC-Net‡ performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models will be publicly released for enabling reproducibility of the results.

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Huichen Yang, William Hsu

Auto-TLDR; Transfer Learning for Scientific Literature Layout Detection Using Convolutional Neural Networks

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Similar papers

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

Scene Text Detection with Selected Anchors

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

Text Baseline Recognition Using a Recurrent Convolutional Neural Network

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

Detecting Marine Species in Echograms Via Traditional, Hybrid, and Deep Learning Frameworks

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text

Detecting Objects with High Object Region Percentage

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

ID Documents Matching and Localization with Multi-Hypothesis Constraints

A Novel Region of Interest Extraction Layer for Instance Segmentation

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings

The DeepScoresV2 Dataset and Benchmark for Music Object Detection

SyNet: An Ensemble Network for Object Detection in UAV Images

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Generic Document Image Dewarping by Probabilistic Discretization of Vanishing Points

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

End-To-End Deep Learning Methods for Automated Damage Detection in Extreme Events at Various Scales

Visual Style Extraction from Chart Images for Chart Restyling

RLST: A Reinforcement Learning Approach to Scene Text Detection Refinement

Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images

Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

A Fast and Accurate Object Detector for Handwritten Digit String Recognition

Iterative Bounding Box Annotation for Object Detection

Documents Counterfeit Detection through a Deep Learning Approach

Approach for Document Detection by Contours and Contrasts

Detective: An Attentive Recurrent Model for Sparse Object Detection

Point In: Counting Trees with Weakly Supervised Segmentation Network

IPT: A Dataset for Identity Preserved Tracking in Closed Domains

Uncertainty Guided Recognition of Tiny Craters on the Moon

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings

Tracking Fast Moving Objects by Segmentation Network

Mutually Guided Dual-Task Network for Scene Text Detection

Unsupervised deep learning for text line segmentation

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

TGCRBNW: A Dataset for Runner Bib Number Detection (and Recognition) in the Wild

Automatically Gather Address Specific Dwelling Images Using Google Street View