This study investigates the use of Large Language Models (LLMs) for enhancing medical anamnesis transcriptions via Text Style Transfer (TST). It involves benchmarking three models (Phi3, Llama, and Mistral) and fine-tuning Mistral based on evaluator feedback. Mistral achieved the best initial performance but showed limited improvement after fine-tuning. The work highlights the challenges of adapting LLMs to clinical tasks and discusses limitations such as data quality and evaluator bias. Future directions include extended training, dataset expansion, and exploring new machine learning techniques.
Weijie LiZhidong GuXiaochao FanWenjun DengYong YangXinyuan ZhaoYufeng DiaoLiang Yang
Sourabrata MukherjeeAtul Kr. OjhaOndřej Dušek
Zhen TaoDinghao XiZhiyu LiLiumin TangWei Xu
Chun‐Yan KongJianyi LiuYifan TangRu Zhang