ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Stroke Based Posterior Attention for Online Handwritten Mathematical Expression Recognition

Changjie Wu, Qing Wang, Jianshu Zhang, Jun Du, Jiaming Wang, Jiajia Wu, Jin-Shui Hu

Auto-TLDR; Posterior Attention for Online Handwritten Mathematical Expression Recognition

Abstract Slides Poster

Recently, many researches propose to employ attention based encoder-decoder models to convert a sequence of trajectory points into a LaTeX string for online handwritten mathematical expression recognition (OHMER), and the recognition performance of these models critically relies on the accuracy of the attention. In this paper, unlike previous methods which basically employ a soft attention model, we propose to employ a posterior attention model, which modifies the attention probabilities after observing the output probabilities generated by the soft attention model. In order to further improve the posterior attention mechanism, we propose a stroke average pooling layer to aggregate point-level features obtained from the encoder into stroke-level features. We argue that posterior attention is better to be implemented on stroke-level features than point-level features as the output probabilities generated by stroke is more convincing than generated by point, and we prove that through experimental analysis. Validated on the CROHME competition task, we demonstrate that stroke based posterior attention achieves expression recognition rates of 54.26% on CROHME 2014 and 51.75% on CROHME 2016. According to attention visualization analysis, we empirically demonstrate that the posterior attention mechanism can achieve better alignment accuracy than the soft attention mechanism.

Similar papers

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang

Auto-TLDR; Convolutional Sequence Modeling for Mathematical Expressions Recognition

Stroke Based Posterior Attention for Online Handwritten Mathematical Expression Recognition

Similar papers

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

Global Context-Based Network with Transformer for Image2latex

Recursive Recognition of Offline Handwritten Mathematical Expressions

A Transformer-Based Radical Analysis Network for Chinese Character Recognition

Online Trajectory Recovery from Offline Handwritten Japanese Kanji Characters of Multiple Strokes

Gaussian Constrained Attention Network for Scene Text Recognition

LODENet: A Holistic Approach to Offline Handwritten Chinese and Japanese Text Line Recognition

Multi-Task Learning Based Traditional Mongolian Words Recognition

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes

Generation of Hypergraphs from the N-Best Parsing of 2D-Probabilistic Context-Free Grammars for Mathematical Expression Recognition

Enhancing Handwritten Text Recognition with N-Gram Sequencedecomposition and Multitask Learning

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Radical Counter Network for Robust Chinese Character Recognition

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

MEAN: A Multi-Element Attention Based Network for Scene Text Recognition

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

Robust Lexicon-Free Confidence Prediction for Text Recognition

Sample-Aware Data Augmentor for Scene Text Recognition

Context Visual Information-Based Deliberation Network for Video Captioning

Weakly Supervised Attention Rectification for Scene Text Recognition

Switching Dynamical Systems with Deep Neural Networks

2D License Plate Recognition based on Automatic Perspective Rectification

A Multi-Head Self-Relation Network for Scene Text Recognition

Trajectory-User Link with Attention Recurrent Networks

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Sketch-SNet: Deeper Subdivision of Temporal Cues for Sketch Recognition

Context Matters: Self-Attention for Sign Language Recognition

PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

Cut and Compare: End-To-End Offline Signature Verification Network

MA-LSTM: A Multi-Attention Based LSTM for Complex Pattern Extraction

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

Text Recognition in Real Scenarios with a Few Labeled Samples

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

Moto: Enhancing Embedding with Multiple Joint Factors for Chinese Text Classification

Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning

Tackling Contradiction Detection in German Using Machine Translation and End-To-End Recurrent Neural Networks

Writer Identification Using Deep Neural Networks: Impact of Patch Size and Number of Patches

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

Text Baseline Recognition Using a Recurrent Convolutional Neural Network

AG-GAN: An Attentive Group-Aware GAN for Pedestrian Trajectory Prediction

The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction

Two-Stream Temporal Convolutional Network for Dynamic Facial Attractiveness Prediction

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

Continuous Sign Language Recognition with Iterative Spatiotemporal Fine-Tuning

Attentive Visual Semantic Specialized Network for Video Captioning