ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Textual-Content Based Classification of Bundles of Untranscribed of Manuscript Images

José Ramón Prieto Fontcuberta, Enrique Vidal, Vicente Bosch, Carlos Alonso, Carmen Orcero, Lourdes Márquez

Auto-TLDR; Probabilistic Indexing for Text-based Classification of Manuscripts

Abstract Slides Poster

Content-based classification of manuscripts is an important task that is generally performed in archives and libraries by experts with a wealth of knowledge on the manuscripts contents. Unfortunately, many manuscript collections are so vast that it is not feasible to rely solely on experts to perform this task. Current approaches for textual-content-based manuscript classification generally require the handwritten images to be first transcribed into text -- but achieving sufficiently accurate transcripts is generally unfeasible for large sets of historical manuscripts. We propose a new approach to automatically perform this classification task which does not rely on any explicit image transcripts. It is based on ``probabilistic indexing'', a relatively novel technology which allows to effectively represent the intrinsic word-level uncertainty generally exhibited by handwritten text images. We assess the performance of this approach on a large collection of complex manuscripts from the Spanish Archivo General de Indias, with promising results.

Similar papers

Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

Lorenzo Quirós, Enrique Vidal

Auto-TLDR; Automatic Reading Order of Text Lines in Handwritten Text Documents

Abstract Slides Similar

Recent advances in Handwritten Text Recognition and Document Layout Analysis make it possible to extract information from digitized documents and make them accessible beyond the archive shelves. But the reading order of the elements in those documents still is an open problem that has to be solved in order to provide that information with the correct structure. Most of the studies on the reading order task are rule-base approaches that focus on printed documents, while less attention has been paid to handwritten text documents. In this work we propose a new approach to automatically determine the reading order of text lines in handwritten text documents. The task is approached as a sorting problem where the order-relation operator is learned directly from examples. We demonstrate the effectiveness of our method on three different datasets.

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

Verónica Romero, Joan Andreu Sánchez

Auto-TLDR; Automatic Handwritten Text Recognition and Information Extraction from Historical Weather Logs

Textual-Content Based Classification of Bundles of Untranscribed of Manuscript Images

Similar papers

Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

Multimodal Side-Tuning for Document Classification

Writer Identification Using Deep Neural Networks: Impact of Patch Size and Number of Patches

Improving Word Recognition Using Multiple Hypotheses and Deep Embeddings

Generation of Hypergraphs from the N-Best Parsing of 2D-Probabilistic Context-Free Grammars for Mathematical Expression Recognition

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding

Recognizing Bengali Word Images - A Zero-Shot Learning Perspective

Multi-Task Learning Based Traditional Mongolian Words Recognition

Chebyshev-Harmonic-Fourier-Moments and Deep CNNs for Detecting Forged Handwriting

An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers

Combining Deep and Ad-Hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

Unsupervised deep learning for text line segmentation

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

LODENet: A Holistic Approach to Offline Handwritten Chinese and Japanese Text Line Recognition

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

Learning Neural Textual Representations for Citation Recommendation

ID Documents Matching and Localization with Multi-Hypothesis Constraints

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

Recursive Recognition of Offline Handwritten Mathematical Expressions

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Text Baseline Recognition Using a Recurrent Convolutional Neural Network

Generic Document Image Dewarping by Probabilistic Discretization of Vanishing Points

Leveraging Quadratic Spherical Mutual Information Hashing for Fast Image Retrieval

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping

Local Gradient Difference Based Mass Features for Classification of 2D-3D Natural Scene Text Images

Exploiting the Logits: Joint Sign Language Recognition and Spell-Correction

A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata

Enhancing Handwritten Text Recognition with N-Gram Sequencedecomposition and Multitask Learning

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Information Graphic Summarization Using a Collection of Multimodal Deep Neural Networks

Automatic Annotation of Corpora for Emotion Recognition through Facial Expressions Analysis

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

Assessing the Severity of Health States Based on Social Media Posts

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

UDBNET: Unsupervised Document Binarization Network Via Adversarial Game

Force Banner for the Recognition of Spatial Relations

Deep Convolutional Embedding for Digitized Painting Clustering

To Honor Our Heroes: Analysis of the Obituaries of Australians Killed in Action in WWI and WWII

Label Incorporated Graph Neural Networks for Text Classification