Staff View: A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Alamoodi A.H., Zughoul O., David D., Garfan S., Pamucar D., Albahri O.S., Albahri A.S., Yussof S., Sharaf I.M.
Other Authors:	57205435311
Format:	Article
Published:	Springer 2025
Subjects:	Artificial Intelligence Decision Making Fuzzy Logic Humans Article case study data analysis data integration discourse analysis evaluation study feature extraction fuzzy logic fuzzy weighted zero inconsistency method health care practice large language model medical decision making medical education medical research multiatributive ideal real comparative analysis multicriteria decision analysis sensitivity analysis treatment outcome uncertainty artificial intelligence decision making human
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.uniten.dspace-36196
record_format	dspace
spelling	my.uniten.dspace-361962025-03-03T15:41:33Z A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction Alamoodi A.H. Zughoul O. David D. Garfan S. Pamucar D. Albahri O.S. Albahri A.S. Yussof S. Sharaf I.M. 57205435311 57204659906 58043918300 57213826607 54080216100 57201013684 57201009814 16023225600 17435789800 Artificial Intelligence Decision Making Fuzzy Logic Humans Article case study data analysis data integration discourse analysis evaluation study feature extraction fuzzy logic fuzzy weighted zero inconsistency method health care practice large language model medical decision making medical education medical research multiatributive ideal real comparative analysis multicriteria decision analysis sensitivity analysis treatment outcome uncertainty artificial intelligence decision making human Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using these LLMs for the medical domain, there?s a need for an evaluation platform to determine their suitability and drive future development efforts. Towards that end, this study aims to address this concern by developing a comprehensive Multi-Criteria Decision Making (MCDM) approach that is specifically designed to evaluate medical LLMs. The success of AI, particularly LLMs, in the healthcare domain, depends on their efficacy, safety, and ethical compliance. Therefore, it is essential to have a robust evaluation framework for their integration into medical contexts. This study proposes using the Fuzzy-Weighted Zero-InConsistency (FWZIC) method extended to p, q-quasirung orthopair fuzzy set (p, q-QROFS) for weighing evaluation criteria. This extension enables the handling of uncertainties inherent in medical decision-making processes. The approach accommodates the imprecise and multifaceted nature of real-world medical data and criteria by incorporating fuzzy logic principles. The MultiAtributive Ideal-Real Comparative Analysis (MAIRCA) method is employed for the assessment of medical LLMs utilized in the case study of this research. The results of this research revealed that ?Medical Relation Extraction? criteria with its sub-levels had more importance with (0.504) than ?Clinical Concept Extraction? with (0.495). For the LLMs evaluated, out of 6 alternatives, (A4) ?GatorTron S 10B? had the 1st rank as compared to (A1) ?GatorTron 90B? had the 6th rank. The implications of this study extend beyond academic discourse, directly impacting healthcare practices and patient outcomes. The proposed framework can help healthcare professionals make more informed decisions regarding the adoption and utilization of LLMs in medical settings. ? The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Final 2025-03-03T07:41:33Z 2025-03-03T07:41:33Z 2024 Article 10.1007/s10916-024-02090-y 2-s2.0-85202703844 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85202703844&doi=10.1007%2fs10916-024-02090-y&partnerID=40&md5=4502a35b5cacea143aa18aae89abdd53 https://irepository.uniten.edu.my/handle/123456789/36196 48 1 81 Springer Scopus
institution	Universiti Tenaga Nasional
building	UNITEN Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Tenaga Nasional
content_source	UNITEN Institutional Repository
url_provider	http://dspace.uniten.edu.my/
topic	Artificial Intelligence Decision Making Fuzzy Logic Humans Article case study data analysis data integration discourse analysis evaluation study feature extraction fuzzy logic fuzzy weighted zero inconsistency method health care practice large language model medical decision making medical education medical research multiatributive ideal real comparative analysis multicriteria decision analysis sensitivity analysis treatment outcome uncertainty artificial intelligence decision making human
spellingShingle	Artificial Intelligence Decision Making Fuzzy Logic Humans Article case study data analysis data integration discourse analysis evaluation study feature extraction fuzzy logic fuzzy weighted zero inconsistency method health care practice large language model medical decision making medical education medical research multiatributive ideal real comparative analysis multicriteria decision analysis sensitivity analysis treatment outcome uncertainty artificial intelligence decision making human Alamoodi A.H. Zughoul O. David D. Garfan S. Pamucar D. Albahri O.S. Albahri A.S. Yussof S. Sharaf I.M. A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
description	Artificial intelligence (AI) has become a crucial element of modern technology, especially in the healthcare sector, which is apparent given the continuous development of large language models (LLMs), which are utilized in various domains, including medical beings. However, when it comes to using these LLMs for the medical domain, there?s a need for an evaluation platform to determine their suitability and drive future development efforts. Towards that end, this study aims to address this concern by developing a comprehensive Multi-Criteria Decision Making (MCDM) approach that is specifically designed to evaluate medical LLMs. The success of AI, particularly LLMs, in the healthcare domain, depends on their efficacy, safety, and ethical compliance. Therefore, it is essential to have a robust evaluation framework for their integration into medical contexts. This study proposes using the Fuzzy-Weighted Zero-InConsistency (FWZIC) method extended to p, q-quasirung orthopair fuzzy set (p, q-QROFS) for weighing evaluation criteria. This extension enables the handling of uncertainties inherent in medical decision-making processes. The approach accommodates the imprecise and multifaceted nature of real-world medical data and criteria by incorporating fuzzy logic principles. The MultiAtributive Ideal-Real Comparative Analysis (MAIRCA) method is employed for the assessment of medical LLMs utilized in the case study of this research. The results of this research revealed that ?Medical Relation Extraction? criteria with its sub-levels had more importance with (0.504) than ?Clinical Concept Extraction? with (0.495). For the LLMs evaluated, out of 6 alternatives, (A4) ?GatorTron S 10B? had the 1st rank as compared to (A1) ?GatorTron 90B? had the 6th rank. The implications of this study extend beyond academic discourse, directly impacting healthcare practices and patient outcomes. The proposed framework can help healthcare professionals make more informed decisions regarding the adoption and utilization of LLMs in medical settings. ? The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
author2	57205435311
author_facet	57205435311 Alamoodi A.H. Zughoul O. David D. Garfan S. Pamucar D. Albahri O.S. Albahri A.S. Yussof S. Sharaf I.M.
format	Article
author	Alamoodi A.H. Zughoul O. David D. Garfan S. Pamucar D. Albahri O.S. Albahri A.S. Yussof S. Sharaf I.M.
author_sort	Alamoodi A.H.
title	A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
title_short	A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
title_full	A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
title_fullStr	A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
title_full_unstemmed	A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction
title_sort	novel evaluation framework for medical llms: combining fuzzy logic and mcdm for medical relation and clinical concept extraction
publisher	Springer
publishDate	2025
_version_	1825816217882984448
score	13.244413

A Novel Evaluation Framework for Medical LLMs: Combining Fuzzy Logic and MCDM for Medical Relation and Clinical Concept Extraction

Similar Items