ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

MA-LSTM: A Multi-Attention Based LSTM for Complex Pattern Extraction

Jingjie Guo, Kelang Tian, Kejiang Ye, Cheng-Zhong Xu

Auto-TLDR; MA-LSTM: Multiple Attention based recurrent neural network for forget gate

Abstract Slides Poster

With the improvement of data, computing powerand algorithms, deep learning has achieved rapid developmentand showing excellent performance. Recently, many deep learn-ing models are proposed to solve the problems in different areas.A recurrent neural network (RNN) is a class of artificial neuralnetworks where connections between nodes form a directedgraph along a temporal sequence. This allows it to exhibittemporal dynamic behavior, which makes it applicable to taskssuch as handwriting recognition or speech recognition. How-ever, the RNN relies heavily on the automatic learning abilityto update parameters which concentrate on the data flow butseldom considers the feature extraction capability of the gatemechanism. In this paper, we propose a novel architecture tobuild the forget gate which is generated by multiple bases.Instead of using the traditional single-layer fully-connectednetwork, we use a Multiple Attention (MA) based network togenerate the forget gate which refines the optimization spaceof gate function and improve the granularity of the recurrentneural network to approximate the map in the ground truth.Credit to the MA structure on the gate mechanism. Our modelhas a better feature extraction capability than other knownmodels. MA-LSTM is an alternative module which can directly replace the recurrent neural network and has achieved good performance in many areas that people are concerned about.

Similar papers

Trajectory-User Link with Attention Recurrent Networks

Tao Sun, Yongjun Xu, Fei Wang, Lin Wu, 塘文钱, Zezhi Shao

Auto-TLDR; TULAR: Trajectory-User Link with Attention Recurrent Neural Networks

MA-LSTM: A Multi-Attention Based LSTM for Complex Pattern Extraction

Similar papers

Trajectory-User Link with Attention Recurrent Networks

Multi-Graph Convolutional Network for Relationship-Driven Stock Movement Prediction

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

Global Context-Based Network with Transformer for Image2latex

Attentive Visual Semantic Specialized Network for Video Captioning

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

Transfer Learning with Graph Neural Networks for Short-Term Highway Traffic Forecasting

Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning

Geographic-Semantic-Temporal Hypergraph Convolutional Network for Traffic Flow Prediction

Context Visual Information-Based Deliberation Network for Video Captioning

Global Feature Aggregation for Accident Anticipation

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

Emerging Relation Network and Task Embedding for Multi-Task Regression Problems

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

Regularized Flexible Activation Function Combinations for Deep Neural Networks

Tackling Contradiction Detection in German Using Machine Translation and End-To-End Recurrent Neural Networks

Flow-Guided Spatial Attention Tracking for Egocentric Activity Recognition

Automatic Student Network Search for Knowledge Distillation

Audio-Visual Speech Recognition Using a Two-Step Feature Fusion Strategy

GCNs-Based Context-Aware Short Text Similarity Model

Moto: Enhancing Embedding with Multiple Joint Factors for Chinese Text Classification

Road Network Metric Learning for Estimated Time of Arrival

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

End-To-End Multi-Task Learning of Missing Value Imputation and Forecasting in Time-Series Data

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

Multi-Scanning Based Recurrent Neural Network for Hyperspectral Image Classification

Detective: An Attentive Recurrent Model for Sparse Object Detection

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Context Matters: Self-Attention for Sign Language Recognition

Adversarial Training for Aspect-Based Sentiment Analysis with BERT

Enhanced User Interest and Expertise Modeling for Expert Recommendation

Predicting Chemical Properties Using Self-Attention Multi-Task Learning Based on SMILES Representation

Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction

Label Incorporated Graph Neural Networks for Text Classification

Stroke Based Posterior Attention for Online Handwritten Mathematical Expression Recognition

Cross-People Mobile-Phone Based Airwriting Character Recognition

2D License Plate Recognition based on Automatic Perspective Rectification

Cross-Media Hash Retrieval Using Multi-head Attention Network

Switching Dynamical Systems with Deep Neural Networks

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Gaussian Constrained Attention Network for Scene Text Recognition

Enriching Video Captions with Contextual Text

Continuous Sign Language Recognition with Iterative Spatiotemporal Fine-Tuning

Enhancing Handwritten Text Recognition with N-Gram Sequencedecomposition and Multitask Learning