ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

Peilin Zhou, Zhiqi Huang, Fenglin Liu, Yuexian Zou

Auto-TLDR; Parallel Interactive Network for Spoken Language Understanding

Abstract Slides Poster

Spoken Language Understanding (SLU) is an essential part of the spoken dialogue system, which typically consists of intent detection (ID) and slot filling (SF) tasks. Recently, recurrent neural networks (RNNs) based methods achieved the state-of-the-art for SLU. It is noted that, in the existing RNN-based approaches, ID and SF tasks are often jointly modeled to utilize the correlation information between them. However, we noted that, so far, the efforts to obtain better performance by supporting bidirectional and explicit information exchange between ID and SF are not well studied. In addition, few studies attempt to capture the local context information to enhance the performance of SF. Motivated by these findings, in this paper, Parallel Interactive Network (PIN) is proposed to model the mutual guidance between ID and SF. Specifically, given an utterance, a Gaussian self-attentive encoder is introduced to generate the context-aware feature embedding of the utterance which is able to capture local context information. Taking the feature embedding of the utterance, Slot2Intent module and Intent2Slot module are developed to capture the bidirectional information flow for ID and SF tasks. Finally, a cooperation mechanism is constructed to fuse the information obtained from Slot2Intent and Intent2Slot modules to further reduce the prediction bias. The experiments on two benchmark datasets, i.e., SNIPS and ATIS, demonstrate the effectiveness of our approach, which achieves a competitive result with state-of-the-art models. More encouragingly, by using the feature embedding of the utterance generated by the pre-trained language model BERT, our method achieves the state-of-the-art among all comparison approaches.

Similar papers

Attentive Visual Semantic Specialized Network for Video Captioning

Jesus Perez-Martin, Benjamin Bustos, Jorge Pérez

Auto-TLDR; Adaptive Visual Semantic Specialized Network for Video Captioning

PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

Similar papers

Attentive Visual Semantic Specialized Network for Video Captioning

GCNs-Based Context-Aware Short Text Similarity Model

Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction

Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks

Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning

Cross-Supervised Joint-Event-Extraction with Heterogeneous Information Networks

Adversarial Training for Aspect-Based Sentiment Analysis with BERT

KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding

Tackling Contradiction Detection in German Using Machine Translation and End-To-End Recurrent Neural Networks

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images

Assessing the Severity of Health States Based on Social Media Posts

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

Context Visual Information-Based Deliberation Network for Video Captioning

Automatic Student Network Search for Knowledge Distillation

Label Incorporated Graph Neural Networks for Text Classification

MA-LSTM: A Multi-Attention Based LSTM for Complex Pattern Extraction

Moto: Enhancing Embedding with Multiple Joint Factors for Chinese Text Classification

Dual Path Multi-Modal High-Order Features for Textual Content Based Visual Question Answering

Text Synopsis Generation for Egocentric Videos

Global Context-Based Network with Transformer for Image2latex

Learning Neural Textual Representations for Citation Recommendation

CKG: Dynamic Representation Based on Context and Knowledge Graph

Trajectory-User Link with Attention Recurrent Networks

Enriching Video Captions with Contextual Text

Gaussian Constrained Attention Network for Scene Text Recognition

Efficient Sentence Embedding Via Semantic Subspace Analysis

Context Matters: Self-Attention for Sign Language Recognition

A Novel Attention-Based Aggregation Function to Combine Vision and Language

Predicting Chemical Properties Using Self-Attention Multi-Task Learning Based on SMILES Representation

Multi-Graph Convolutional Network for Relationship-Driven Stock Movement Prediction

Transformer Reasoning Network for Image-Text Matching and Retrieval

Transformer Networks for Trajectory Forecasting

MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

Feature-Aware Unsupervised Learning with Joint Variational Attention and Automatic Clustering

ConvMath : A Convolutional Sequence Network for Mathematical Expression Recognition

End-To-End Hierarchical Relation Extraction for Generic Form Understanding

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Multi-Modal Contextual Graph Neural Network for Text Visual Question Answering

VSR++: Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

Video Summarization with a Dual Attention Capsule Network

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

Enhanced User Interest and Expertise Modeling for Expert Recommendation

Weakly Supervised Attention Rectification for Scene Text Recognition