Multimodal Side-Tuning for Document Classification
Stefano Zingaro,
Giuseppe Lisanti,
Maurizio Gabbrielli
![Responsive image](/icpr/media/video_thumbnails/11492.jpg)
Auto-TLDR; Side-tuning for Multimodal Document Classification
Similar papers
Textual-Content Based Classification of Bundles of Untranscribed of Manuscript Images
José Ramón Prieto Fontcuberta, Enrique Vidal, Vicente Bosch, Carlos Alonso, Carmen Orcero, Lourdes Márquez
![Responsive image](/icpr/media/video_thumbnails/11232.jpg)
Auto-TLDR; Probabilistic Indexing for Text-based Classification of Manuscripts
Abstract Slides Poster Similar
Vision-Based Layout Detection from Scientific Literature Using Recurrent Convolutional Neural Networks
![Responsive image](/icpr/media/video_thumbnails/11649.jpg)
Auto-TLDR; Transfer Learning for Scientific Literature Layout Detection Using Convolutional Neural Networks
Abstract Slides Poster Similar
PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks
Wenwen Yu, Ning Lu, Xianbiao Qi, Ping Gong, Rong Xiao
![Responsive image](/icpr/media/video_thumbnails/11383.jpg)
Auto-TLDR; PICK: A Graph Learning Framework for Key Information Extraction from Documents
Abstract Slides Poster Similar
Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks
Michele Alberti, Angela Botros, Schuetz Narayan, Rolf Ingold, Marcus Liwicki, Mathias Seuret
![Responsive image](/icpr/media/video_thumbnails/11870.jpg)
Auto-TLDR; Trainable and Spectrally Initializable Matrix Transformations for Neural Networks
Abstract Slides Poster Similar
Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents
Manuel Carbonell, Pau Riba, Mauricio Villegas, Alicia Fornés, Josep Llados
![Responsive image](/icpr/media/video_thumbnails/12045.jpg)
Auto-TLDR; Graph Neural Network for Entity Recognition and Relation Extraction in Semi-Structured Documents
Writer Identification Using Deep Neural Networks: Impact of Patch Size and Number of Patches
Akshay Punjabi, José Ramón Prieto Fontcuberta, Enrique Vidal
![Responsive image](/icpr/media/video_thumbnails/12064.jpg)
Auto-TLDR; Writer Recognition Using Deep Neural Networks for Handwritten Text Images
Abstract Slides Poster Similar
Learning Neural Textual Representations for Citation Recommendation
Thanh Binh Kieu, Inigo Jauregi Unanue, Son Bao Pham, Xuan-Hieu Phan, M. Piccardi
![Responsive image](/icpr/media/video_thumbnails/11356.jpg)
Auto-TLDR; Sentence-BERT cascaded with Siamese and triplet networks for citation recommendation
Abstract Slides Poster Similar
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions
Iulian Cojocaru, Silvia Cascianelli, Lorenzo Baraldi, Massimiliano Corsini, Rita Cucchiara
![Responsive image](/icpr/media/video_thumbnails/11604.jpg)
Auto-TLDR; Deformable Convolutional Neural Networks for Handwritten Text Recognition
Abstract Slides Poster Similar
An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers
Manuel Burghardt, Bernhard Liebl
![Responsive image](/icpr/media/video_thumbnails/11485.jpg)
Auto-TLDR; Evaluation of Backbone Architectures for Optical Character Segmentation of Historical Documents
Abstract Slides Poster Similar
Improving Word Recognition Using Multiple Hypotheses and Deep Embeddings
Siddhant Bansal, Praveen Krishnan, C. V. Jawahar
![Responsive image](/icpr/media/video_thumbnails/12029.jpg)
Auto-TLDR; EmbedNet: fuse recognition-based and recognition-free approaches for word recognition using learning-based methods
Abstract Slides Poster Similar
Ancient Document Layout Analysis: Autoencoders Meet Sparse Coding
Homa Davoudi, Marco Fiorucci, Arianna Traviglia
![Responsive image](/icpr/media/video_thumbnails/11583.jpg)
Auto-TLDR; Unsupervised Unsupervised Representation Learning for Document Layout Analysis
Abstract Slides Poster Similar
Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks
Mélodie Boillet, Christopher Kermorvant, Thierry Paquet
![Responsive image](/icpr/media/video_thumbnails/11104.jpg)
Auto-TLDR; A fully convolutional network for document layout analysis
A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping
Hmrishav Bandyopadhyay, Tanmoy Dasgupta, Nibaran Das, Mita Nasipuri
![Responsive image](/icpr/media/video_thumbnails/12162.jpg)
Auto-TLDR; Gated and Bifurcated Stacked U-Net for Dewarping Document Images
Abstract Slides Poster Similar
Sequential Domain Adaptation through Elastic Weight Consolidation for Sentiment Analysis
Avinash Madasu, Anvesh Rao Vijjini
![Responsive image](/icpr/media/video_thumbnails/11451.jpg)
Auto-TLDR; Sequential Domain Adaptation using Elastic Weight Consolidation for Sentiment Analysis
Abstract Slides Poster Similar
Recursive Recognition of Offline Handwritten Mathematical Expressions
Marco Cotogni, Claudio Cusano, Antonino Nocera
![Responsive image](/icpr/media/video_thumbnails/11229.jpg)
Auto-TLDR; Online Handwritten Mathematical Expression Recognition with Recurrent Neural Network
Abstract Slides Poster Similar
Learning with Delayed Feedback
Pranavan Theivendiram, Terence Sim
![Responsive image](/icpr/media/video_thumbnails/11453.jpg)
Auto-TLDR; Unsupervised Machine Learning with Delayed Feedback
Abstract Slides Poster Similar
Recognizing Bengali Word Images - A Zero-Shot Learning Perspective
Sukalpa Chanda, Daniël Arjen Willem Haitink, Prashant Kumar Prasad, Jochem Baas, Umapada Pal, Lambert Schomaker
![Responsive image](/icpr/media/video_thumbnails/11542.jpg)
Auto-TLDR; Zero-Shot Learning for Word Recognition in Bengali Script
Abstract Slides Poster Similar
Information Graphic Summarization Using a Collection of Multimodal Deep Neural Networks
Edward Kim, Connor Onweller, Kathleen F. Mccoy
![Responsive image](/icpr/media/video_thumbnails/12118.jpg)
Auto-TLDR; A multimodal deep learning framework that can generate summarization text supporting the main idea of an information graphic for presentation to blind or visually impaired
Unsupervised deep learning for text line segmentation
Berat Kurar Barakat, Ahmad Droby, Reem Alaasam, Borak Madi, Irina Rabaev, Raed Shammes, Jihad El-Sana
![Responsive image](/icpr/media/video_thumbnails/11124.jpg)
Auto-TLDR; Unsupervised Deep Learning for Handwritten Text Line Segmentation without Annotation
A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification
Pierluigi Carcagni, Marco Leo, Andrea Cuna, Giuseppe Celeste, Cosimo Distante
![Responsive image](/icpr/media/video_thumbnails/11922.jpg)
Auto-TLDR; RegNet: Deep Investigation of Convolutional Neural Networks for Automatic Classification of Skin Lesions
Abstract Slides Poster Similar
Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network
Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin
![Responsive image](/icpr/media/video_thumbnails/11889.jpg)
Auto-TLDR; Semantically Extended Graph Convolutional Network for Zero-shot Text Classification
Abstract Slides Poster Similar
Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering
Yanan Li, Yuetan Lin, Hongrui Zhao, Donghui Wang
![Responsive image](/icpr/media/video_thumbnails/11379.jpg)
Auto-TLDR; TextVQA: An End-to-End Visual Question Answering Model for Text-Based VQA
A Few-Shot Learning Approach for Historical Ciphered Manuscript Recognition
Mohamed Ali Souibgui, Alicia Fornés, Yousri Kessentini, Crina Tudor
![Responsive image](/icpr/media/video_thumbnails/11519.jpg)
Auto-TLDR; Handwritten Ciphers Recognition Using Few-Shot Object Detection
A Novel Attention-Based Aggregation Function to Combine Vision and Language
Matteo Stefanini, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
![Responsive image](/icpr/media/video_thumbnails/10985.jpg)
Auto-TLDR; Fully-Attentive Reduction for Vision and Language
Abstract Slides Poster Similar
Adversarial Training for Aspect-Based Sentiment Analysis with BERT
Akbar Karimi, Andrea Prati, Leonardo Rossi
![Responsive image](/icpr/media/video_thumbnails/11940.jpg)
Auto-TLDR; Adversarial Training of BERT for Aspect-Based Sentiment Analysis
Abstract Slides Poster Similar
LODENet: A Holistic Approach to Offline Handwritten Chinese and Japanese Text Line Recognition
Huu Tin Hoang, Chun-Jen Peng, Hung Tran, Hung Le, Huy Hoang Nguyen
![Responsive image](/icpr/media/video_thumbnails/11442.jpg)
Auto-TLDR; Logographic DEComposition Encoding for Chinese and Japanese Text Line Recognition
Abstract Slides Poster Similar
On-Device Text Image Super Resolution
Dhruval Jain, Arun Prabhu, Gopi Ramena, Manoj Goyal, Debi Mohanty, Naresh Purre, Sukumar Moharana
![Responsive image](/icpr/media/video_thumbnails/11563.jpg)
Auto-TLDR; A Novel Deep Neural Network for Super-Resolution on Low Resolution Text Images
Abstract Slides Poster Similar
Efficient Sentence Embedding Via Semantic Subspace Analysis
Bin Wang, Fenxiao Chen, Yun Cheng Wang, C.-C. Jay Kuo
![Responsive image](/icpr/media/video_thumbnails/10842.jpg)
Auto-TLDR; S3E: Semantic Subspace Sentence Embedding
Abstract Slides Poster Similar
A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata
Tobia Tesan, Pasquale Coscia, Lamberto Ballan
![Responsive image](/icpr/media/video_thumbnails/10858.jpg)
Auto-TLDR; Context-Based Image Annotation with Multiple Semantic Embeddings and Recurrent Neural Networks
Abstract Slides Poster Similar
ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang
![Responsive image](/icpr/media/video_thumbnails/11410.jpg)
Auto-TLDR; Convolutional Sequence Modeling for Mathematical Expressions Recognition
Abstract Slides Poster Similar
Text Synopsis Generation for Egocentric Videos
Aidean Sharghi, Niels Lobo, Mubarak Shah
![Responsive image](/icpr/media/video_thumbnails/11369.jpg)
Auto-TLDR; Egocentric Video Summarization Using Multi-task Learning for End-to-End Learning
KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding
Hyunjae Lee, Jaewoong Yun, Bongkyu Hwang, Seongho Joe, Seungjai Min, Youngjune Gwon
![Responsive image](/icpr/media/video_thumbnails/11536.jpg)
Auto-TLDR; KoreALBERT: A monolingual ALBERT model for Korean language understanding
Abstract Slides Poster Similar
Bridging the Gap between Natural and Medical Images through Deep Colorization
Lia Morra, Luca Piano, Fabrizio Lamberti, Tatiana Tommasi
![Responsive image](/icpr/media/video_thumbnails/10936.jpg)
Auto-TLDR; Transfer Learning for Diagnosis on X-ray Images Using Color Adaptation
Abstract Slides Poster Similar
A Systematic Investigation on End-To-End Deep Recognition of Grocery Products in the Wild
Marco Leo, Pierluigi Carcagni, Cosimo Distante
![Responsive image](/icpr/media/video_thumbnails/11748.jpg)
Auto-TLDR; Automatic Recognition of Products on grocery shelf images using Convolutional Neural Networks
Abstract Slides Poster Similar
Text Baseline Recognition Using a Recurrent Convolutional Neural Network
Matthias Wödlinger, Robert Sablatnig
![Responsive image](/icpr/media/video_thumbnails/11424.jpg)
Auto-TLDR; Automatic Baseline Detection of Handwritten Text Using Recurrent Convolutional Neural Network
Abstract Slides Poster Similar
Fine-Tuning Convolutional Neural Networks: A Comprehensive Guide and Benchmark Analysis for Glaucoma Screening
Amed Mvoulana, Rostom Kachouri, Mohamed Akil
![Responsive image](/icpr/media/video_thumbnails/11607.jpg)
Auto-TLDR; Fine-tuning Convolutional Neural Networks for Glaucoma Screening
Abstract Slides Poster Similar
GCNs-Based Context-Aware Short Text Similarity Model
![Responsive image](/icpr/media/video_thumbnails/11000.jpg)
Auto-TLDR; Context-Aware Graph Convolutional Network for Text Similarity
Abstract Slides Poster Similar
Class-Incremental Learning with Pre-Allocated Fixed Classifiers
Federico Pernici, Matteo Bruni, Claudio Baecchi, Francesco Turchini, Alberto Del Bimbo
![Responsive image](/icpr/media/video_thumbnails/11624.jpg)
Auto-TLDR; Class-Incremental Learning with Pre-allocated Output Nodes for Fixed Classifier
Abstract Slides Poster Similar
Enriching Video Captions with Contextual Text
Philipp Rimle, Pelin Dogan, Markus Gross
![Responsive image](/icpr/media/video_thumbnails/11526.jpg)
Auto-TLDR; Contextualized Video Captioning Using Contextual Text
Abstract Slides Poster Similar
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks
Hyunjin Choi, Judong Kim, Seongho Joe, Youngjune Gwon
![Responsive image](/icpr/media/thumbnails/1492_FI.pdf.jpg)
Auto-TLDR; Sentence Embedding Models for BERT and ALBERT: A Comparison and Evaluation
Abstract Slides Poster Similar
Label Incorporated Graph Neural Networks for Text Classification
Yuan Xin, Linli Xu, Junliang Guo, Jiquan Li, Xin Sheng, Yuanyuan Zhou
![Responsive image](/icpr/media/video_thumbnails/11952.jpg)
Auto-TLDR; Graph Neural Networks for Semi-supervised Text Classification
Abstract Slides Poster Similar
ESResNet: Environmental Sound Classification Based on Visual Domain Models
Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel
![Responsive image](/icpr/media/video_thumbnails/11458.jpg)
Auto-TLDR; Environmental Sound Classification with Short-Time Fourier Transform Spectrograms
Abstract Slides Poster Similar
Extracting Action Hierarchies from Action Labels and their Use in Deep Action Recognition
Konstadinos Bacharidis, Antonis Argyros
![Responsive image](/icpr/media/video_thumbnails/10873.jpg)
Auto-TLDR; Exploiting the Information Content of Language Label Associations for Human Action Recognition
Abstract Slides Poster Similar
Self-Supervised Learning for Astronomical Image Classification
Ana Martinazzo, Mateus Espadoto, Nina S. T. Hirata
![Responsive image](/icpr/media/video_thumbnails/11359.jpg)
Auto-TLDR; Unlabeled Astronomical Images for Deep Neural Network Pre-training
Abstract Slides Poster Similar
Attentive Visual Semantic Specialized Network for Video Captioning
Jesus Perez-Martin, Benjamin Bustos, Jorge Pérez
![Responsive image](/icpr/media/video_thumbnails/11562.jpg)
Auto-TLDR; Adaptive Visual Semantic Specialized Network for Video Captioning
Abstract Slides Poster Similar
Deep Convolutional Embedding for Digitized Painting Clustering
Giovanna Castellano, Gennaro Vessio
![Responsive image](/icpr/media/video_thumbnails/11176.jpg)
Auto-TLDR; A Deep Convolutional Embedding Model for Clustering Artworks
Abstract Slides Poster Similar
Which are the factors affecting the performance of audio surveillance systems?
Antonio Greco, Antonio Roberto, Alessia Saggese, Mario Vento
![Responsive image](/icpr/media/video_thumbnails/11829.jpg)
Auto-TLDR; Sound Event Recognition Using Convolutional Neural Networks and Visual Representations on MIVIA Audio Events
The HisClima Database: Historical Weather Logs for Automatic Transcription and Information Extraction
Verónica Romero, Joan Andreu Sánchez
![Responsive image](/icpr/media/video_thumbnails/12112.jpg)
Auto-TLDR; Automatic Handwritten Text Recognition and Information Extraction from Historical Weather Logs
Abstract Slides Poster Similar