ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Saleem Ahmed, Kenny Davila, Srirangaraj Setlur, Venu Govindaraju

Auto-TLDR; Representational Learning for Similarity Based Retrieval of Mathematical Expressions

Abstract Slides Poster

Representational Learning in the form of high dimensional embeddings have been used for multiple pattern recognition applications. There has been a significant interest in building embedding based systems for learning representationsin the mathematical domain. At the same time, retrieval of structured information such as mathematical expressions is an important need for modern IR systems. In this work, our motivation is to introduce a robust framework for learning representations for similarity based retrieval of mathematical expressions. Given a query by example, the embedding can find the closest matching expression as a function of euclidean distance between them. We leverage recent advancements in image-based and graph-based deep learning algorithms to learn our similarity embeddings. We do this first, by using uni-modal encoders in graph space and image space and then, a multi-modal combination of the same. To overcome the lack of training data, we force the networks to learn a deep metric using triplets generated with a heuristic scoring function. We also adopt a custom strategy for mining hard samples to train our neural networks. Our system produces rankings similar to those generated by the original scoring function, but using only a fraction of the time. Our results establish the viability of using such a multi-modal embedding for this task.

Similar papers

Learning Neural Textual Representations for Citation Recommendation

Thanh Binh Kieu, Inigo Jauregi Unanue, Son Bao Pham, Xuan-Hieu Phan, M. Piccardi

Auto-TLDR; Sentence-BERT cascaded with Siamese and triplet networks for citation recommendation

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Similar papers

Learning Neural Textual Representations for Citation Recommendation

What Nodes Vote To? Graph Classification without Readout Phase

Classification of Intestinal Gland Cell-Graphs Using Graph Neural Networks

Transformer Reasoning Network for Image-Text Matching and Retrieval

Region and Relations Based Multi Attention Network for Graph Classification

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

MEG: Multi-Evidence GNN for Multimodal Semantic Forensics

A General Model for Learning Node and Graph Representations Jointly

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation

GCNs-Based Context-Aware Short Text Similarity Model

Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

TreeRNN: Topology-Preserving Deep Graph Embedding and Learning

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

AOAM: Automatic Optimization of Adjacency Matrix for Graph Convolutional Network

Privacy Attributes-Aware Message Passing Neural Network for Visual Privacy Attributes Classification

Label Incorporated Graph Neural Networks for Text Classification

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

Generation of Hypergraphs from the N-Best Parsing of 2D-Probabilistic Context-Free Grammars for Mathematical Expression Recognition

Recursive Recognition of Offline Handwritten Mathematical Expressions

Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with Adversarial Discriminative Domain Regularization

Generalized Local Attention Pooling for Deep Metric Learning

Using Scene Graphs for Detecting Visual Relationships

A Novel Attention-Based Aggregation Function to Combine Vision and Language

Multi-Level Deep Learning Vehicle Re-Identification Using Ranked-Based Loss Functions

Improving Word Recognition Using Multiple Hypotheses and Deep Embeddings

On the Global Self-attention Mechanism for Graph Convolutional Networks

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

Sketch-Based Community Detection Via Representative Node Sampling

Loop-closure detection by LiDAR scan re-identification

Revisiting Graph Neural Networks: Graph Filtering Perspective

Nonlinear Ranking Loss on Riemannian Potato Embedding

Graph-Based Interpolation of Feature Vectors for Accurate Few-Shot Classification

Supervised Domain Adaptation Using Graph Embedding

VSR++: Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching

Self-Supervised Learning with Graph Neural Networks for Region of Interest Retrieval in Histopathology

Probabilistic Word Embeddings in Kinematic Space

Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge

Global Context-Based Network with Transformer for Image2latex

SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition

Audio-Based Near-Duplicate Video Retrieval with Audio Similarity Learning

Kernel-based Graph Convolutional Networks

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

Edge-Aware Graph Attention Network for Ratio of Edge-User Estimation in Mobile Networks

Building Computationally Efficient and Well-Generalizing Person Re-Identification Models with Metric Learning

Deep Top-Rank Counter Metric for Person Re-Identification