ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Visual Style Extraction from Chart Images for Chart Restyling

Danqing Huang, Jinpeng Wang, Guoxin Wang, Chin-Yew Lin

Auto-TLDR; Exploiting Visual Properties from Reference Chart Images for Chart Restyling

Abstract Slides Poster

Creating a good looking chart for better visualization is time consuming. There are plenty of well-designed charts on the Web, which are ideal references for imitation of chart style. However, stored as bitmap images, reference charts have hinder machine interpretation of style settings and thus difficult to be directly applied. In this paper, we extract visual properties from reference chart images as style templates to restyle charts. We first construct a large-scale dataset of 187,059 chart images from real world data, labeled with predefined visual property values. Then we introduce an end-to-end learning network to extract the properties based on two image-encoding approaches. Furthermore, in order to capture spatial relationships of chart objects, which are crucial in solving the task, we propose a novel positional encoding method to integrate clues of relative positions between objects. Experimental results show that our model significantly outperforms baseline models. By adding positional features, our model achieves better performance. Finally, we present the application for chart restyling based on our model.

Similar papers

Multi-Modal Contextual Graph Neural Network for Text Visual Question Answering

Yaoyuan Liang, Xin Wang, Xuguang Duan, Wenwu Zhu

Auto-TLDR; Multi-modal Contextual Graph Neural Network for Text Visual Question Answering

Visual Style Extraction from Chart Images for Chart Restyling

Similar papers

Multi-Modal Contextual Graph Neural Network for Text Visual Question Answering

Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

Question-Agnostic Attention for Visual Question Answering

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

Multi-Scale Relational Reasoning with Regional Attention for Visual Question Answering

Multi-Stage Attention Based Visual Question Answering

A Novel Attention-Based Aggregation Function to Combine Vision and Language

Answer-Checking in Context: A Multi-Modal Fully Attention Network for Visual Question Answering

Information Graphic Summarization Using a Collection of Multimodal Deep Neural Networks

Integrating Historical States and Co-Attention Mechanism for Visual Dialog

Improving Visual Relation Detection Using Depth Maps

Detective: An Attentive Recurrent Model for Sparse Object Detection

MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level

An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction

Transformer Reasoning Network for Image-Text Matching and Retrieval

P ≈ NP, at Least in Visual Question Answering

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

SIMCO: SIMilarity-Based Object COunting

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

VTT: Long-Term Visual Tracking with Transformers

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

A Fast and Accurate Object Detector for Handwritten Digit String Recognition

Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection

Improving Visual Question Answering Using Active Perception on Static Images

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition

Graph Discovery for Visual Test Generation

The DeepScoresV2 Dataset and Benchmark for Music Object Detection

Scene Text Detection with Selected Anchors

Adaptive Word Embedding Module for Semantic Reasoning in Large-Scale Detection

Text Recognition - Real World Data and Where to Find Them

Unsupervised Domain Adaptation for Object Detection in Cultural Sites

Global Context-Based Network with Transformer for Image2latex

SynDHN: Multi-Object Fish Tracker Trained on Synthetic Underwater Videos

Hierarchical Head Design for Object Detectors

Detecting Objects with High Object Region Percentage

Image-Based Table Cell Detection: A New Dataset and an Improved Detection Method

Object Detection Using Dual Graph Network

Point In: Counting Trees with Weakly Supervised Segmentation Network

Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks

Label or Message: A Large-Scale Experimental Survey of Texts and Objects Co-Occurrence

Learning to Rank for Active Learning: A Listwise Approach

Weakly Supervised Attention Rectification for Scene Text Recognition

Iterative Bounding Box Annotation for Object Detection

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

A Novel Region of Interest Extraction Layer for Instance Segmentation

Enhanced User Interest and Expertise Modeling for Expert Recommendation

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN