JOURNAL ARTICLE

Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Abstract

This study investigates privacy-preserving techniques and secure communication protocols for safeguarding user data in healthcare chatbots. With the increasing deployment of AI-driven chatbots for medical support, concerns regarding the privacy and security of sensitive patient data have escalated. The objectives guiding this study include i) determining how encryption and anonymisation techniques affect NLP performance; ii) identifying the vulnerabilities in communication channels used by NLP chatbots; iii) to determine the best combination of privacy and communication techniques that offers the best protection for chatbots; iv) to determine the effect of privacy-preserving techniques on user experience; and v) the regulations and ethics surrounding the use and implementation of privacy preservation techniques. The study employs a mixed-methods approach, combining quantitative and qualitative analyses to assess privacy-preserving NLP techniques and their implications for healthcare chatbot security. This method enables a more nuanced exploration of the research problem, offering both empirical data and contextual insights. The findings reveal that AES, a symmetric encryption method, consistently outperformed RSA, an asymmetric method, in terms of speed, efficiency, and impact on chatbot response times. AES demonstrated significantly lower encryption times compared to RSA with minimal computational overhead, enabling near-instantaneous responses even with stronger encryption keys. The study also evaluates the effectiveness of anonymisation techniques. Before anonymisation, the dataset exhibited a high privacy leakage rate (PLR) of 92%, with 100% Identifiable Data Residuals (IDR) and an 85% re-identification risk (RR), underscoring the significant exposure of sensitive information. After applying tokenisation, redaction, and data masking, privacy leakage rates were substantially reduced: tokenisation achieved an 8% PLR, redaction reduced PLR to 4%, and data masking resulted in a 14% PLR. Redaction was the most effective among these methods, eliminating identifiable data with a low re-identification risk of 7%. Several policy implications are also discussed, providing guidelines for regulatory authorities to improve the privacy protection of healthcare chatbot applications. Considering these considerations and limitations, the present study provides a valuable first step toward developing private chatbots that protect user privacy by augmenting existing work, datasets, and resources. It can also be extended to any application that demands privacy preservation. Possible research avenues are proposed, such as analysing quantum/invertible encryption, synthesising data, and considering privacy models for other high-sensitivity areas, including finance and education.

Keywords:
Encryption Chatbot Health care Safeguarding Software deployment Information privacy Health informatics Data Protection Act 1998

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Geochemistry and Geologic Mapping
Physical Sciences →  Computer Science →  Artificial Intelligence
Geological Modeling and Analysis
Physical Sciences →  Earth and Planetary Sciences →  Geochemistry and Petrology
Electrical and Electromagnetic Research
Physical Sciences →  Physics and Astronomy →  Atomic and Molecular Physics, and Optics

Related Documents

JOURNAL ARTICLE

Privacy-Preserving Natural Language Processing Techniques in Healthcare Chatbots

Thomas, Graeme Roger

Journal:   OPAL (Open@LaTrobe) (La Trobe University) Year: 2025
JOURNAL ARTICLE

Privacy preserving methods for Natural Language Processing

Plant, Richard

Journal:   Edinburgh Napier Research Repository (Edinburgh Napier University) Year: 2025
JOURNAL ARTICLE

Application of Natural Language Processing for Creating Chatbots in Healthcare

Sheetal S ChakravarthySowmya N S

Journal:   International Journal of Research Publication and Reviews Year: 2024 Vol: 5 (1)Pages: 1472-1481
© 2026 ScienceGate Book Chapters — All rights reserved.