Omnihealth Practice

News & Perspective

HKUMed unveils AI model achieving over 90% accuracy in thyroid cancer diagnosis

19 Jul 2025

This AI model aims to alleviate the burden on clinicians by automating the extraction of vital clinical information.³ It employs advanced large language model (LLM) strategies and classifies patients according to the 8th edition of the American Joint Committee on Cancer (AJCC) staging system and the American Thyroid Association (ATA) risk categories.^3,4 By integrating multiple offline open-source LLMs, including Mistral, Gemma, Llama, and Qwen, the model streamlines data extraction, significantly reducing the manual effort needed to gather relevant information from unstructured notes.

To validate their model, the research team sourced 339 semi-structured pathology reports from the public Cancer Genome Atlas-Thyroid Cancer (TCGA-THCA) database.³ They used 50 of these reports for model development and the remaining 289 for validation.³ The development set included both papillary and follicular thyroid carcinomas in proportions that reflect real-world cases.³ Based on the AJCC staging system, patients were categorized into various stages: stage I (n=31), stage II (n=15), stage III (n=2), and stage IVB (n=2).³ The ATA risk categories—low, intermediate, and high—were also evenly represented, with expert clinicians annotating these reports to identify key entities necessary for staging and risk classification.³

The model's performance was assessed using F1-score, which measure both precision and recall.³ In the development phase, the ensemble strategies produced impressive results, achieving F1-score of 100% for ATA risk classification and 94.1% for AJCC staging.³ For the 289 validation cases from the TCGA-THCA dataset, the ensemble model maintained high accuracy, obtaining F1-score ranging from 95.2% to 95.5% for ATA risk and 98.1% for AJCC staging.³ Individual LLMs also performed well, with scores ranging from 88.5% to 99.7%.3 Additionally, the model was tested on 35 pseudo-clinical cases that mirrored real-world scenarios, yielding F1-score of 88.5% for ATA risk categorization and between 90.4% and 92.9% for AJCC staging.³ Despite these promising results, the researchers acknowledge some limitations.³ The model occasionally struggles to differentiate between microscopic and gross extra-thyroidal extension.³ Furthermore, the limited representation of patients with advanced-stage thyroid cancer in the development set may impact performance in these cases.³ Therefore, human verification of AI-generated outputs remains crucial.³ In conclusion, this study highlights the potential of AI, particularly lightweight LLMs, to enhance the extraction and classification of critical data from unstructured clinical notes.³ By enabling local deployment with its offline capability, the AI model ensured patient privacy while improving the speed and consistency of thyroid cancer staging and risk assessment.³ This innovation represents a major step toward integrating AI into clinical workflows and sets the stage for broader applications in oncology and beyond.³

Get access to our exclusive articles.

Multidisciplinary MTB-guided treatment improves survival outcomes of patients with heavily pretreated advanced solid tumors

A recent study, conducted by the Department of Clinical Oncology, School of Clinical Medicine, LKS Faculty of Medicine, the University of Hong Kong (HKUMed) and the Division of Clinical Pathology & Molecular Pathology, Hong Kong Sanatorium & Hospital (HKSH), has shown that patients with heavily pretreated advanced solid tumors and who had undergone treatment guided by the multidisciplinary molecular tumor board (MTB) exhibited a significantly longer overall survival (OS) than those who were treated with non-MTB-guided therapy.¹

ONCOLOGY

28 Jul 2023

Artificial intelligence platforms enable population-wide lung cancer screening programs

Lung cancer is the leading cause of cancer death worldwide that accounted for 18.4% of all cancer deaths.¹ In Hong Kong, lung cancer was associated with a crude mortality rate of 51.7% in 2018 and is considered the most common cause of cancer death.² Previously, population screening with low-dose co

ONCOLOGY RESPIROLOGY

28 Feb 2021

Capturing the silent thief of sight: An artificial intelligence screening system for glaucoma

Glaucoma is the global leading cause of irreversible blindness and the second cause of blindness after cataracts.¹ The current estimated global prevalence of glaucoma for the population aged 40-80 years is 3.54% (95% CI: 2.09-5.82) and is projected to increase in the aging

OPHTHALMOLOGY

29 Oct 2020