ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Unsupervised deep learning for text line segmentation

Berat Kurar Barakat, Ahmad Droby, Reem Alaasam, Borak Madi, Irina Rabaev, Raed Shammes, Jihad El-Sana

Auto-TLDR; Unsupervised Deep Learning for Handwritten Text Line Segmentation without Annotation

Abstract Poster

We present an unsupervised deep learning method for text line segmentation that is inspired by the relative variance between text lines and spaces among text lines. Handwritten text line segmentation is important for the efficiency of further processing. A common method is to train a deep learning network for embedding the document image into an image of blob lines that are tracing the text lines. Previous methods learned such embedding in a supervised manner, requiring the annotation of many document images. This paper presents an unsupervised embedding of document image patches without a need for annotations. The number of foreground pixels over the text lines is relatively different from the number of foreground pixels over the spaces among text lines. Generating similar and different pairs relying on this principle definitely leads to outliers. However, as the results show, the outliers do not harm the convergence and the network learns to discriminate the text lines from the spaces between text lines. Remarkably, with a challenging Arabic handwritten text line segmentation dataset, VML-AHTE, we achieved superior performance over the supervised methods. Additionally, the proposed method was evaluated on the ICDAR 2017 and ICFHR 2010 handwritten text line segmentation datasets.

Similar papers

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

Olfa Mechi, Maroua Mehri, Rolf Ingold, Najoua Essoukri Ben Amara

Auto-TLDR; Text Line Localization in Ancient Handwritten Arabic Document Images using U-Net and Topological Structural Analysis

Unsupervised deep learning for text line segmentation

Similar papers

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

Text Baseline Recognition Using a Recurrent Convolutional Neural Network

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding

Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Writer Identification Using Deep Neural Networks: Impact of Patch Size and Number of Patches

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Generic Document Image Dewarping by Probabilistic Discretization of Vanishing Points

LODENet: A Holistic Approach to Offline Handwritten Chinese and Japanese Text Line Recognition

UDBNET: Unsupervised Document Binarization Network Via Adversarial Game

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

Feature Embedding Based Text Instance Grouping for Largely Spaced and Occluded Text Detection

Multimodal Side-Tuning for Document Classification

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers

Textual-Content Based Classification of Bundles of Untranscribed of Manuscript Images

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

An Accurate Threshold Insensitive Kernel Detector for Arbitrary Shaped Text

Improving Word Recognition Using Multiple Hypotheses and Deep Embeddings

DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents

Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

Approach for Document Detection by Contours and Contrasts

Multi-Task Learning Based Traditional Mongolian Words Recognition

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Scene Text Detection with Selected Anchors

ID Documents Matching and Localization with Multi-Hypothesis Constraints

Documents Counterfeit Detection through a Deep Learning Approach

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

Recursive Recognition of Offline Handwritten Mathematical Expressions

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Local Gradient Difference Based Mass Features for Classification of 2D-3D Natural Scene Text Images

Mutually Guided Dual-Task Network for Scene Text Detection

Online Trajectory Recovery from Offline Handwritten Japanese Kanji Characters of Multiple Strokes

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Radical Counter Network for Robust Chinese Character Recognition

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

Handwritten Digit String Recognition Using Deep Autoencoder Based Segmentation and ResNet Based Recognition Approach

Recognizing Bengali Word Images - A Zero-Shot Learning Perspective

On-Device Text Image Super Resolution

The DeepScoresV2 Dataset and Benchmark for Music Object Detection

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Cut and Compare: End-To-End Offline Signature Verification Network