ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Approach for Document Detection by Contours and Contrasts

Daniil Tropin, Sergey Ilyuhin, Dmitry Nikolaev, Vladimir V. Arlazarov

Auto-TLDR; A countor-based method for arbitrary document detection on a mobile device

Abstract Slides Poster

This paper considers the task of arbitrary document detection performed on a mobile device. The classical contour-based approach often mishandles cases with occlusion, complex background, or blur. Region-based approach, which relies on the contrast between object and background, does not have limitations, however its known implementations are highly resource-consuming. We propose a modification of a countor-based method, in which the competing hypotheses of the contour location are ranked according to the contrast between the areas inside and outside the border. In the performed experiments such modification leads to the 40% decrease of alternatives ordering errors and 10% decrease of the overall number of detection errors. We updated state-of-the-art performance on the open MIDV-500 dataset and demonstrated competitive results with the state-of-the-art on the SmartDoc dataset.

Similar papers

ID Documents Matching and Localization with Multi-Hypothesis Constraints

Guillaume Chiron, Nabil Ghanmi, Ahmad Montaser Awal

Auto-TLDR; Identity Document Localization in the Wild Using Multi-hypothesis Exploration

Approach for Document Detection by Contours and Contrasts

Similar papers

ID Documents Matching and Localization with Multi-Hypothesis Constraints

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

Generic Document Image Dewarping by Probabilistic Discretization of Vanishing Points

Text Baseline Recognition Using a Recurrent Convolutional Neural Network

A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

One Step Clustering Based on A-Contrario Framework for Detection of Alterations in Historical Violins

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers

Learning to Segment Clustered Amoeboid Cells from Brightfield Microscopy Via Multi-Task Learning with Adaptive Weight Selection

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

Unsupervised deep learning for text line segmentation

Learning Defects in Old Movies from Manually Assisted Restoration

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

Quantization in Relative Gradient Angle Domain for Building Polygon Estimation

Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

Documents Counterfeit Detection through a Deep Learning Approach

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

Walk the Lines: Object Contour Tracing CNN for Contour Completion of Ships

Early Wildfire Smoke Detection in Videos

Multimodal Side-Tuning for Document Classification

Smart Inference for Multidigit Convolutional Neural Network Based Barcode Decoding

Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding

Writer Identification Using Deep Neural Networks: Impact of Patch Size and Number of Patches

Fusion of Global-Local Features for Image Quality Inspection of Shipping Label

Mobile Augmented Reality: Fast, Precise, and Smooth Planar Object Tracking

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

A Hierarchical Framework for Leaf Instance Segmentation: Application to Plant Phenotyping

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Edge-Aware Monocular Dense Depth Estimation with Morphology

Dynamic Resource-Aware Corner Detection for Bio-Inspired Vision Sensors

A Heuristic-Based Decision Tree for Connected Components Labeling of 3D Volumes

Weight Estimation from an RGB-D Camera in Top-View Configuration

RISEdb: A Novel Indoor Localization Dataset

Attention Based Coupled Framework for Road and Pothole Segmentation

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

TGCRBNW: A Dataset for Runner Bib Number Detection (and Recognition) in the Wild

Fast Implementation of 4-Bit Convolutional Neural Networks for Mobile Devices

Holistic Grid Fusion Based Stop Line Estimation

Textual-Content Based Classification of Bundles of Untranscribed of Manuscript Images

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

Unconstrained Vision Guided UAV Based Safe Helicopter Landing

A Lumen Segmentation Method in Ureteroscopy Images Based on a Deep Residual U-Net Architecture

Camera Calibration Using Parallel Line Segments

Recursive Recognition of Offline Handwritten Mathematical Expressions