Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Thomas, Graeme Roger

doi:10.25949/29286395

ScienceGate Book Chapters

JOURNAL ARTICLE

Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Thomas, Graeme Roger

Year: 2025 Journal: Macquarie University

DOI: 10.25949/29286395

Get Full-Text PDF Get Analytical Report

Abstract

This study investigates privacy-preserving techniques and secure communication protocols for safeguarding user data in healthcare chatbots. With the increasing deployment of AI-driven chatbots for medical support, concerns regarding the privacy and security of sensitive patient data have escalated. The objectives guiding this study include i) determining how encryption and anonymisation techniques affect NLP performance; ii) identifying the vulnerabilities in communication channels used by NLP chatbots; iii) to determine the best combination of privacy and communication techniques that offers the best protection for chatbots; iv) to determine the effect of privacy-preserving techniques on user experience; and v) the regulations and ethics surrounding the use and implementation of privacy preservation techniques. The study employs a mixed-methods approach, combining quantitative and qualitative analyses to assess privacy-preserving NLP techniques and their implications for healthcare chatbot security. This method enables a more nuanced exploration of the research problem, offering both empirical data and contextual insights. The findings reveal that AES, a symmetric encryption method, consistently outperformed RSA, an asymmetric method, in terms of speed, efficiency, and impact on chatbot response times. AES demonstrated significantly lower encryption times compared to RSA with minimal computational overhead, enabling near-instantaneous responses even with stronger encryption keys. The study also evaluates the effectiveness of anonymisation techniques. Before anonymisation, the dataset exhibited a high privacy leakage rate (PLR) of 92%, with 100% Identifiable Data Residuals (IDR) and an 85% re-identification risk (RR), underscoring the significant exposure of sensitive information. After applying tokenisation, redaction, and data masking, privacy leakage rates were substantially reduced: tokenisation achieved an 8% PLR, redaction reduced PLR to 4%, and data masking resulted in a 14% PLR. Redaction was the most effective among these methods, eliminating identifiable data with a low re-identification risk of 7%. Several policy implications are also discussed, providing guidelines for regulatory authorities to improve the privacy protection of healthcare chatbot applications. Considering these considerations and limitations, the present study provides a valuable first step toward developing private chatbots that protect user privacy by augmenting existing work, datasets, and resources. It can also be extended to any application that demands privacy preservation. Possible research avenues are proposed, such as analysing quantum/invertible encryption, synthesising data, and considering privacy models for other high-sensitivity areas, including finance and education.

Keywords:

Encryption Chatbot Health care Safeguarding Software deployment Information privacy Health informatics Data Protection Act 1998

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.55

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Geochemistry and Geologic Mapping

Physical Sciences → Computer Science → Artificial Intelligence

Geological Modeling and Analysis

Physical Sciences → Earth and Planetary Sciences → Geochemistry and Petrology

Electrical and Electromagnetic Research

Physical Sciences → Physics and Astronomy → Atomic and Molecular Physics, and Optics

Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Abstract

Metrics

Topics

Related Documents

Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Privacy-Preserving Natural Language Processing

Privacy preserving methods for Natural Language Processing

Privacy-Preserving Models for Legal Natural Language Processing

Application of Natural Language Processing for Creating Chatbots in Healthcare