- Research
- Open access
- Published:
Comparison of AI applications and anesthesiologist’s anesthesia method choices
BMC Anesthesiology volume 25, Article number: 2 (2025)
Abstract
Background
In medicine, Artificial intelligence has begun to be utilized in nearly every domain, from medical devices to the interpretation of imaging studies. There is still a need for more experience and more studies related to the comprehensive use of AI in medicine. The aim of the present study is to evaluate the ability of AI to make decisions regarding anesthesia methods and to compare the most popular AI programs from this perspective.
Methods
The study included orthopedic patients over 18 years of age scheduled for limb surgery within a 1-month period. Patients classified as ASA I-III who were evaluated in the anesthesia clinic during the preoperative period were included in the study. The anesthesia method preferred by the anesthesiologist during the operation and the patient’s demographic data, comorbidities, medications, and surgical history were recorded. The obtained patient data were discussed as if presenting a patient scenario using the free versions of the ChatGPT, Copilot, and Gemini applications by a different anesthesiologist who did not perform the operation.
Results
Over the course of 1 month, a total of 72 patients were enrolled in the study. It was observed that both the anesthesia specialists and the Gemini application chose spinal anesthesia for the same patient in 68.5% of cases. This rate was higher compared to the other AI applications. For patients taking medication, it was observed that the Gemini application presented choices that were highly compatible (85.7%) with the anesthesiologists’ preferences.
Conclusion
AI cannot fully master the guidelines and exceptional and specific cases that arrive in the course of medical treatment. Thus, we believe that AI can serve as a valuable assistant rather than replacing doctors.
Introduction
In medicine, AI has begun to be utilized in nearly every domain, from medical devices to the interpretation of imaging studies. AI can analyze large amounts of data in a short time and provide better recommendations than humans analyzing the same data [1]. One of the early AI applications, ChatGPT, was launched in 2022. With a vocabulary of over 300 billion words, it possesses extensive knowledge on numerous subjects. In its first month of availability, ChatGPT reached 57 million users, and by January 2023, its user base exceeded 100 million [2]. The use of ChatGPT has facilitated rapid and practical access to information [3]. In the field of anesthesia, robots, software, and AI are currently being utilized for various purposes. AI is predominantly used for drug infusions, sedation applications, the monitoring of anesthesia depth, ultrasound guidance, and pain management [4]. Additionally, AI provides extra benefits such as increasing efficiency and reducing costs [5, 6]. Studies have also demonstrated that ChatGPT possesses sufficient knowledge to pass the United States Medical Licensing Examination (USMLE) [7, 8].
AI technology is now being frequently used in fields such as plastic surgery, oncology, pulmonology, cardiology, orthopedics, hepatology, and neurology [9]. Studies have shown that AI is particularly beneficial in imaging and diagnostic processes in these fields. AI-assisted systems approved by the US Food and Drug Administration have begun to be used to locate congenital heart pathologies [10]. In other areas of medicine, however, concerns about ethics and safety persist due to the insufficient quality and quantity of relevant academic studies [11].
The potential future use of AI in patient management and its adequacy in this field are subjects of great interest. There are also fears that AI might eventually replace physicians, though the prevailing view is that AI will complement, rather than replace, the human practice of medicine [12]. There is still a need for more experience and more studies related to the comprehensive use of AI in medicine. The aim of the present study is to evaluate the ability of AI to make decisions regarding anesthesia methods and to compare the most popular AI programs from this perspective.
Materials and methods
This study procedures were followed in accordance with the ethical standards of the responsible committee on human experimentation and with the Helsinki Declaration of 1975. The study was approved by The Ethics Committee of Mardin Artuklu University on February 13, 2024 (No. 2024/2–40). The researchers assessed the patients’ eligibility and secured their written consent before the procedure. The study included orthopedic patients over 18 years of age scheduled for extremity surgery within a 1-month period. Patients classified as ASA I-III who were evaluated in the anesthesia clinic during the preoperative period were included in the study. Patients directed by the surgeon regarding the choice of anesthesia method, as well as cases in which the patient’s preference conflicted with the anesthesiologist’s recommendation, were excluded from the study. Additionally, patients were excluded if the preferred anesthesia method was changed during the course of the operation (Fig. 1).
The anesthesia method preferred by the anesthesiologist during the operation and the patient’s demographic data, comorbidities, medications and surgical history were recorded. The obtained patient data were discussed as if presenting a patient scenario using the free versions of the ChatGPT, Copilot and Gemini applications by a different anesthesiologist who did not perform the operation. It was noted that the AI applications’ recommendations were not to be considered as medical consultations. Personal data were not shared with the AI applications; the shared data included the patient’s age, comorbidities, planned surgery, medications and surgical history. The preferences of the anesthesiologist were compared with the preferences suggested by the three AI applications in the context of the patient scenarios. Data for each patient were recorded.
The AI programs were instructed to recommend only one anesthesia method based on the patient information provided. The results were recorded and compared with the preferences of the anesthesiologist. Answers were recorded impartially, without any guidance. In cases where the AI applications suggested anesthesia methods using general terms, such as “brachial plexus block,” they were asked to provide more specific responses. Those detailed responses were then recorded.
Statistical analysis
Sample size calculation was performed using G-Power version 3.1.9.7 program (Universität Kiel, Germany). In calculation; For the chi-square test, effect size (w) was determined as 0.5, significance level (α) was 0.05, power (1 − β) was 0.95 and degree of freedom (Df) was 3. According to these parameters, the minimum required number of patients was calculated as 69 people [13].
The obtained data were analyzed using IBM SPSS Statistics 26. Categorical data were presented as numbers and percentages, while continuous data were presented as means and standard deviations. The chi-square test and Fisher’s exact test were used for statistical analysis. Values of p < 0.05 were considered statistically significant.
Results
Over the course of 1 month, a total of 72 patients were enrolled in the study. Of these patients, 35 (48.6%) were male and 37 (51.4%) were female. The average age of the patients was 47.86 ± 17.7 years. Twenty-two patients (30.6%) had comorbidities such as diabetes and hypertension. In 57 cases (79.2%), the surgical site was a lower extremity. Anesthesia specialists most commonly preferred the spinal anesthesia method, which was used in 75.0% of the cases.
The anesthesia specialists opted for general anesthesia for 4 patients, whereas ChatGPT selected general anesthesia for only 1 patient (Table 1). Like the anesthesia specialists, who preferred spinal anesthesia most frequently (61.1%), ChatGPT chose spinal anesthesia in 50% of the cases. The rate of selecting general anesthesia for the same patient was observed to be 100% with both Gemini and CoPilot.
Among all patients, the highest preference for general anesthesia occurred with the CoPilot application, with this option being chosen for 42 patients (58.3%) (Table 1).
It was observed that both the anesthesia specialists and the Gemini application chose spinal anesthesia for the same patient in 68.5% of cases. This rate was higher compared to the other AI applications (Table 1).
The preference for anesthesia methods showed 70.2% agreement between the Gemini application and the anesthesiologists for patients undergoing lower extremity surgery, while this agreement decreased to 40% for upper extremity surgeries (p = 0.03) (Table 2). In contrast, the CoPilot application’s choice of anesthesia method for upper extremity surgeries was found to be 93.3% discordant with the anesthesiologists’ preferences (Table 2). Overall, when considering all patients, CoPilot had the highest level of discordance (70.8%) with the anesthesiologists’ choices (Table 3).
For patients taking medication, it was observed that the Gemini application presented choices that were highly compatible (85.7%) with the anesthesiologists’ preferences (Table 4). Additionally, it was noted that the rate at which both the Gemini and CoPilot applications preferred general anesthesia for the same patient was 23.6% (17 patients) (Table 5).
Discussion
In this study evaluating the ability of AI applications to analyze patient information and determine the appropriate anesthesia method, it was observed that these applications often made choices close to those of anesthesiologists. Specifically, both the Gemini and CoPilot applications chose general anesthesia in 100% of cases in which anesthesiologists did the same. Furthermore, the Gemini application demonstrated a high concordance rate of 85.7% with anesthesiologists’ preferences for patients who were taking medication, indicating a robust capability to align with expert human decisions in specialized medical settings.
Clinicians’ confidence in AI is likely to increase as they understand its potential and limitations, especially when they observe its successful implementation in real-world scenarios [10]. In their review, Singh and Nath noted that allowing robots to perform routine typical procedures can save anesthesiologists time in critical situations, thereby enhancing their ability to think and make decisions. Additionally, studies have addressed instances in which robots have performed intubations in limited patient series or for simulated patients, or in which nerve recognition software-supported ultrasounds have been used to administer block procedures [14, 15].
Although there are publications related to drug infusion systems, pain management, and even intubation-performing robots in the literature, we did not encounter any studies specifically addressing AI technologies that determine anesthesia management in our literature review. Our study provides examples from current practice to increase confidence in AI. As noted in the reviews conducted by Singh and Nath and Bellini et al., the impact of AI in clinical practice generally remains confined to the level of digital display data. There is an ongoing need for more studies that directly explore real-life applications, as demonstrated by our research [14, 16].
In line with the objectives of our study, the review conducted by Lopes et al. emphasized that, despite the rapid AI advancements in the field of medicine, there is still a lack of clinical applications in anesthesia practice [17]. In many AI-related studies in the field of anesthesia, factors that an anesthesiologist could predict without the need for an extensive literature review have been considered, such as calculating the Cormack-Lehane score from human face photographs. In our planning of the present study, we aimed to analyze the results of AI analysis of patient data. For assessments of patients undergoing upper extremity surgery, we found that ChatGPT considered alternatives such as general anesthesia, axillary block, supraclavicular block, and interscalene block and made choices that were 80% consistent with the anesthesiologists’ preferences. This indicates significant potential for AI to support decision-making in anesthesia, with outcomes aligning closely with expert human judgments. AI can quickly select the most appropriate anesthesia method from among numerous alternatives, which is a substantial benefit. The fact that AI applications yield objective and emotion-free choices while considering anesthesia methods could also be a reason for their preference. Kambale and Jadhaw predicted that AI will contribute to the standardization of anesthesia management and the reduction of human errors. This capability highlights AI’s potential to enhance the efficiency and safety of medical procedures by leveraging consistent data-driven decision-making processes [18]. This leads us to suggest that, in the future, anesthesiologists could use AI applications to quickly review and validate their choices.
AI technology is currently being utilized in the field of anesthesia for critical tasks such as monitoring anesthesia depth or pain control and predicting adverse events [19]. The ability of AI applications to predict possible complications may put them a step ahead compared to physicians. The advantage of such AI applications lies in their ability to access and analyze far greater volumes of data in a short time compared to humans, and in a practical way. In the present study, AI was able to suggest anesthesia methods within seconds for patient scenarios. However, it did not respond according to current guideline practices regarding the use of regional anesthesia in patients using anticoagulants. It is clear that AI needs to be developed for specialized patient groups such as those with medication use and surgical history. Due to its incomplete mastery of guidelines and exceptional or specific situations, the prevailing view of AI at present is that it can be a good assistant for doctors [9, 10, 14, 16].
Singh and Nath noted in their review that AI is particularly beneficial in special cases, such as those of patients with rare diseases [14]. In surgeries where drug infusions are computer-assisted through closed-loop systems, the analysis of anesthesia depth and hemodynamic data has allowed the maintenance of vital parameters within narrower ranges. Joosten et al. demonstrated that patients using a computer-assisted closed-loop drug infusion system exhibited improved cognitive functions in the postoperative period [20]. In Hemmerling’s study, it was indicated that closed-loop anesthetic drug infusion systems under the control of an anesthesiologist could become routine in the future [21].
In our study, although AI was able to suggest anesthesia methods based on shared patient information, it was concluded that further development is needed regarding adherence to the current guidelines and the management of specific patient groups. Furthermore, while no statistical differences were found among the AI applications in terms of their approaches to patients, low-level differences were observed. The Copilot application chose general anesthesia more frequently than other applications. It was noted that for patients with pulmonary pathologies such as asthma or COPD, general anesthesia was sometimes recommended without querying vital parameters like the patient’s saturation value. Özsahin analyzed the chosen anesthetic agents and their alternatives while using various algorithms with the Fuzzy PROMETHEE application, emphasizing the necessity of expert support [22].
This study has several limitations due to its single-center design, the inclusion of patients within only a 1-month period, and the exclusion of patients who declined the anesthesiologist’s preference. The exclusion of patients who did not consent to the anesthesiologist’s choices and the lack of pediatric age groups prevent us from reflecting a broader population with our findings. Furthermore, as a potential limitation of the present study, only free versions of AI programs were used.
AI cannot fully master the guidelines and exceptional and specific cases that arrive in the course of medical treatment. Thus, we believe that AI can serve as a valuable assistant rather than replacing doctors.
Data availability
Data availability The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Roppelt JS, Kanbach DK, Kraus S. Artificial intelligence in healthcare institutions: a systematic literature review on influencing factors. Technol Soc. 2024;76:102443.
Eroğlu Çakmakoğlu E. The place of ChatGPT in the future of dental education. J Clin Trials Exp Investig. 2023;2(3):121–9. Available from: https://jctei.com/index.php/jctei/article/view/56. [cited 2024 May 22].
Levy J, Madrigal E, Vaickus L. Editorial: artificial intelligence: applications in clinical medicine. Front Med Technol. 2023;9(5):1206969.
Zaouter C, Joosten A, Rinehart J, Struys MMRF, Hemmerling TM. Autonomous systems in anesthesia: where do we stand in 2020? A narrative review. Anesth Analg. 2020;130(5):1120–32.
Cascella M, Tracey MC, Petrucci E, Bignami EG. Exploring artificial intelligence in anesthesia: a primer on ethics, and clinical applications. Surgeries. 2023;4(2):264–74.
Hashimoto DA, Witkowski E, Gao L, Meireles O, Rosman G. Artificial intelligence in anesthesiology: current techniques, clinical applications, and limitations. Anesthesiology. 2020;132(2):379–94.
Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, Chartash D. How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The implications of large language models for medical education and knowledge assessment. JMIR Med Educ. 2023;8(9):e45312.
Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepaño C, Madriaga M, Aggabao R, Diaz-Candido G, Maningo J, Tseng V. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLOS Digit Health. 2023;2(2):e0000198.
TerKonda SP, TerKonda AA, Sacks JM, Kinney BM, Gurtner GC, Nachbar JM, Reddy SK, Jeffers LL. Artificial intelligence: singularity approaches. Plast Reconstr Surg. 2024;153(1):204e–17e.
Canales C, Lee C, Cannesson M. Science without conscience is but the ruin of the soul: the ethics of big data and artificial intelligence in perioperative medicine. Anesth Analg. 2020;130(5):1234–43.
Bitkina OV, Park J, Kim HK. Application of artificial intelligence in medical technologies: a systematic review of main trends. Digit Health. 2023;9:20552076231189332.
Briganti G, Le Moine O. Artificial intelligence in medicine: today and tomorrow. Front Med (Lausanne). 2020;7:27.
Faul F, Erdfelder E, Lang AG, Buchner A. G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods. 2007;39:175–91. https://doiorg.publicaciones.saludcastillayleon.es/10.3758/bf03193146.
Singh M, Nath G. Artificial intelligence and anesthesia: a narrative review. Saudi J Anaesth. 2022;16(1):86–93.
Singhal M, Gupta L, Hirani K. A comprehensive analysis and review of artificial intelligence in anaesthesia. Cureus. 2023;15(9):e45038.
Bellini V, Rafano Carnà E, Russo M, Di Vincenzo F, Berghenti M, Baciarello M, Bignami E. Artificial intelligence and anesthesia: a narrative review. Ann Transl Med. 2022;10(9):528.
Lopes S, Rocha G, Guimarães-Pereira L. Artificial intelligence and its clinical application in anesthesiology: a systematic review. J Clin Monit Comput. 2024;38(2):247–59.
Kambale M, Jadhav S. Applications of artificial intelligence in anesthesia: a systematic review. Saudi J Anaesth. 2024;18(2):249–56.
Keles H. Artificial intelligence applications in medicine. KÜ Tıp Fak Derg. 2022;24:604–13.
Joosten A, Rinehart J, Bardaji A, Van der Linden P, Jame V, Van Obbergh L, Alexander B, Cannesson M, Vacas S, Liu N, Slama H, Barvais L. Anesthetic management using multiple closed-loop systems and delayed neurocognitive recovery: a randomized controlled trial. Anesthesiology. 2020;132(2):253–66.
Hemmerling TM. Robots will perform anesthesia in the near future. Anesthesiology. 2020;132(2):219–20.
Ozsahin I. Identifying a personalized anesthetic with fuzzy PROMETHEE. Healthc Inform Res. 2020;26(3):201–11.
Acknowledgements
Unfortunately, no clinical trial number was obtained for our study. Human Ethics and consent to participate. This study was approved by The Ethics Committee of Mardin Artuklu University on February 13, 2024 (No. 2024/2-40). The researchers assessed the patients’ eligibility and secured their written consent before the procedure. Written informed consent about the study protocol was obtained from each patient preoperatively.
Funding
This research received no external funding.
Author information
Authors and Affiliations
Contributions
Research concept and design: EÇ, HA, İT Data analysis and interpretation: EÇ, MA Collection and/or assembly of data: EÇ, MAT Writing the article: EÇ, HA Critical revision of the article: EÇ, MK, İT, HA Final approval of the article: EÇ, İT, HA All authors read and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study was approved by The Ethics Committee of Mardin Artuklu University (13.02.2024—IRB:2024/2–40). And conducted in accordance with the Helsinki Declaration of 1975.
Patients provided written informed consent to participate in the study.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Çelik, E., Turgut, M.A., Aydoğan, M. et al. Comparison of AI applications and anesthesiologist’s anesthesia method choices. BMC Anesthesiol 25, 2 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12871-024-02882-2
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12871-024-02882-2