Interfaces for Explaining Transformer Language Models
Jay Alammar
DECEMBER 16, 2020
This article focuses on auto-regressive models, but these methods are applicable to other architectures and tasks as well. input saliency is a method that explains individual predictions. The literature is most often concerned with this application for classification tasks, rather than natural language generation.
Let's personalize your content