Beyond Post-Hoc: Generating Inherently Verifiable Natural Language Explanations from AI Models

Revista, Zen; IA, 10

doi:10.5281/zenodo.17826745

ScienceGate Book Chapters

JOURNAL ARTICLE

Beyond Post-Hoc: Generating Inherently Verifiable Natural Language Explanations from AI Models

Revista, Zen IA, 10

Year: 2025 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.17826745

Get Full-Text PDF Get Analytical Report

Abstract

The increasing complexity and widespread deployment of Artificial Intelligence (AI) models, particularly deep learning systems, have amplified the demand for explainability. Traditional Explainable AI (XAI) methods often rely on post-hoc approaches, generating explanations after a model has made a prediction. While valuable, these post-hoc explanations can suffer from issues of fidelity, consistency, and a lack of direct verifiability against the model's true decision-making process. This paper proposes a paradigm shift towards generating inherently verifiable natural language explanations. We argue for the integration of symbolic reasoning and formal verification techniques directly into AI model architectures, enabling systems to produce explanations that are not merely plausible but are demonstrably grounded in the model's internal logic. Such an approach aims to foster greater trust, accountability, and reliability in AI systems, especially in high-stakes domains where erroneous or unexplainable decisions can have severe consequences. We discuss a conceptual framework for constructing such models, outline the methodological challenges, and highlight the potential for hybrid neuro-symbolic AI to bridge the gap between high performance and verifiable transparency.

Keywords:

Verifiable secret sharing Natural language Bridge (graph theory) Natural (archaeology) Software deployment Key (lock) Reliability (semiconductor) Model checking

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Beyond Post-Hoc: Generating Inherently Verifiable Natural Language Explanations from AI Models

Abstract

Metrics

Topics

Related Documents

Beyond Post-Hoc: Generating Inherently Verifiable Natural Language Explanations from AI Models

Interpretable AI: Beyond Post-Hoc Explanations Towards Inherently Understandable Models

Interpretable AI: Beyond Post-Hoc Explanations Towards Inherently Understandable Models

Generating Natural Language Explanations from Plans

Bridging the Gap: From Post Hoc Explanations to Inherently Interpretable Models for Medical Imaging