Interpreting Convolutional Networks Trained on Textual Data

Reza Marzban; Christopher Crick

doi:10.5220/0010205901960203

ScienceGate Book Chapters

JOURNAL ARTICLE

Interpreting Convolutional Networks Trained on Textual Data

Reza Marzban Christopher Crick

Year: 2021 Pages: 196-203

DOI: 10.5220/0010205901960203

Get Full-Text PDF Get Analytical Report

Abstract

There have been many advances in the artificial intelligence field due to the\nemergence of deep learning. In almost all sub-fields, artificial neural\nnetworks have reached or exceeded human-level performance. However, most of the\nmodels are not interpretable. As a result, it is hard to trust their decisions,\nespecially in life and death scenarios. In recent years, there has been a\nmovement toward creating explainable artificial intelligence, but most work to\ndate has concentrated on image processing models, as it is easier for humans to\nperceive visual patterns. There has been little work in other fields like\nnatural language processing. In this paper, we train a convolutional model on\ntextual data and analyze the global logic of the model by studying its filter\nvalues. In the end, we find the most important words in our corpus to our\nmodels logic and remove the rest (95%). New models trained on just the 5% most\nimportant words can achieve the same performance as the original model while\nreducing training time by more than half. Approaches such as this will help us\nto understand NLP models, explain their decisions according to their word\nchoices, and improve them by finding blind spots and biases.\n

Keywords:

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.66

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Interpreting Convolutional Networks Trained on Textual Data

Abstract

Metrics

Citation History

Topics

Related Documents

Interpreting Adversarially Trained Convolutional Neural Networks

Missing data imputation with adversarially-trained graph convolutional networks

Augmenting Graph Convolutional Networks with Textual Data for Recommendations

Real-data earthquake localization using convolutional neural networks trained with synthetic data

Interpreting Textual Data in Writing Research