PRO-CON DEBATE - CON: Artificial Intelligence is Not a Magic Pill

This Pro-Con Debate took place at the 2019 Stoelting Conference entitled “Patient Deterioration: Early Recognition, Rapid Intervention, and the end of Failure to Rescue.” The two following authors have expertise in the field of adopting artifical intelligence for managing patients who are deteriorating in the hospital setting.

Artificial Intelligence (AI) is supposed to hold the promise of curing many problems facing health care such as predicting morbidity and mortality and outperforming physicians at diagnosis. In reality, despite increasing research, there are a limited number of clinically validated AI algorithms. Even as the number of U.S. Food and Drug Administration-approved AI applications grows, the implementation and widespread use of these applications has been challenging. Computer scientist Rodney Brooks described some of the challenges with AI predictions. These include overestimating or underestimating solutions, imagining magical algorithms, the scale of deployment, and performance limitations.^1,2

AI performance limitations are especially important in diagnostic AI solutions. Many researchers using artificial neural networks have claimed to improve diagnosis and outperform clinicians, as in diagnosis of diseases visualized on chest X-rays.³ Often, these self-limited, narrow spectrum algorithms can detect lesions such as atelectasis or infiltrates on chest X-rays. Despite claims of high accuracy however, these applications have been hard to replicate and generalize.⁴ In other approaches to machine learning, the computer algorithm learns from clinician-labeled data. In many publicly available chest X-ray data sets underpinning these algorithms, lesions are labelled by radiologists as infiltrates, mass, atelectasis, etc. These clinician assessments are considered the “gold standard,” but significant inter-rater differences have been noted,⁵ raising the specter of mislabeled datasets. Algorithms created from such mislabeled datasets are likely to have significant errors in their results which can confound clinician decision-making.

AI-based prediction of disease is similarly problematic. In the research done on prediction of acute kidney injury by Tomasev et al., prediction bias was introduced through the dataset itself. Their U.S. Veteran Affairs dataset contained only 6.4% female patients; model performance in these patients was lower than the rest.⁶ Bias continues to be a challenge even in administrative datasets and solutions developed for use by health care executives or insurance companies. As demonstrated by Obermeyer et al., these biases can be introduced at the level of algorithm development, but can also be based on the dataset used or the way the algorithm is implemented.⁷ These biased algorithms can lead to delivery of improper unsafe treatment to our patients.

Indeed, poor predictive values continue to limit the adoption of well-researched AI algorithms. Results based on the “area under the curve”—a statistical reflection of “model fit”—have been extensively exploited to report accuracy of these algorithms. However, multiple other parameters should be considered, including sensitivity and positive predictive value. Without good predictive values and replicable results, AI algorithms are unlikely to be adopted by clinicians.⁸

Scalability and generalizability of AI algorithms is another big challenge in health care. While electronic health records are the primary means to deploy many of these algorithms, poor interfaces, limited support for IT teams, and lack of integrated solutions still limit the ease of adoption.

Marketing and hype created by some organizations has also had a negative impact and resulted in loss of credibility of AI amongst many clinicians. Some of the well-researched breakthroughs have been hyped enormously to leverage the current market value associated with AI. In a survey of European startups using AI by the London venture capital firm Marsh & McLennan Companies, Inc. (MMC), 40% were not actually using AI as a part of their product.⁹

AI does hold the promise of delivering potentially safer solutions for health care using the ever-increasing volume of data in an efficient and reproducible manner. But realizing this potential requires clinician leadership and rigorous clinical validation while developing and deploying AI algorithms (Table 1).

Table 1: Solutions for Effective Deployment of AI in Health Care

Patient- and care-provider-centric—first do no harm

Clinician leadership

Rigorous model development and testing

Explainable or Interpretable solutions—avoidance of black box

Clinical validation for generalizability and scalability

Cost-effective solutions

We are still in the early phases of research and development of AI algorithms for health care. Clearly, the growth in AI has been exponential and the pace is likely to continue in the near future. We need to be prepared to dedicate clinical, information technology, and financial resources to see effective utilization of these remarkable algorithms. Clinicians, especially radiologists and oncologists, are already leading the development of many AI algorithms to avoid ill-prepared solutions creeping into their work environment. Anesthesia professionals and perioperative clinicians who have been early adopters of technology and live in a data-rich environment also need to lead research, development, and deployment of sustainable AI algorithms to provide safer care to our patients.

Dr. Mathur is staff anesthesiologist/intensivist in the Department of General Anesthesiology and the quality improvement officer, Anesthesiology Institute, Cleveland Clinic, Cleveland, Ohio.

The author has no conflicts of interest to disclose.

References

Brooks R. https://www.technologyreview.com/s/609048/the-seven-deadly-sins-of-ai-predictions/. MIT technology review. 2017. Accessed December 9, 2019.
Panetta K. https://www.gartner.com/smarterwithgartner/5-trends-appear-on-the-gartner-hype-cycle-for-emerging-technologies-2019/. Accessed August 29, 2019.
Rajpurkar P, Irvin J, Ball RL, et al. Deep learning for chest radiograph diagnosis: a retrospective comparison of the CheXNeXt algorithm to practicing radiologists. PLoS Med. 2018;15:e1002686.
Zech JR, Badgeley MA, Liu M, et al. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLoS Med. 2018;15:e1002683.
Oakden-Rayner L. Exploring large-scale public medical image datasets. Acad Radiol. 2019.
Tomasev N, Glorot X, Rae JW, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. 2019;572:116–119.
Obermeyer Z, Powers B, Vogeli C, et al. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366:447–453.
Ginestra JC, Giannini HM, Schweickert WD, et al. Clinician perception of a machine learning-based early warning system designed to predict severe sepsis and septic shock. Crit Care Med. 2019;47:1477–1484.
Olson P. https://www.forbes.com/sites/parmyolson/2019/03/04/nearly-half-of-all-ai-startups-are-cashing-in-on-hype/#454f99e7d022. Forbes. Accessed March 4, 2019.

Cookie	Description
PHPSESSID	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
__stripe_mid	This cookie is set by Stripe payment gateway. This cookie is used to enable payment on the website without storing any payment information on a server.
__stripe_sid	This cookie is set by Stripe payment gateway. This cookie is used to enable payment on the website without storing any payment information on a server.
_wpfuuid	This cookie is used by the WPForms WordPress plugin. The cookie is used to allow the paid version of the plugin to connect entries by the same user and is used for some additional features like the Form Abandonment addon.
__cfduid	The cookie is set by CloudFare. The cookie is used to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.

Cookie	Description
GPS	This cookie is set by Youtube and registers a unique ID for tracking users based on their geographical location
_ga	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gid	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Description
YSC	This cookies is set by Youtube and is used to track the views of embedded videos.
__utmz	This cookie is set by Google analytics and is used to store the traffic source or campaign through which the visitor reached your site.
__utmb	The cookie is set by Google Analytics. The cookie is used to determine new sessions/visits. The cookie is created when the JavaScript library executes and there are no existing __utma cookies. The cookie is updated every time data is sent to Google Analytics.
__utma	This cookie is set by Google Analytics and is used to distinguish users and sessions. The cookie is created when the JavaScript library executes and there are no existing __utma cookies. The cookie is updated every time data is sent to Google Analytics.
__utmc	The cookie is set by Google Analytics and is deleted when the user closes the browser. The cookie is not used by ga.js. The cookie is used to enable interoperability with urchin.js which is an older version of Google analytics and used in conjunction with the __utmb cookie to determine new sessions/visits.
_gat	This cookies is installed by Google Universal Analytics to throttle the request rate to limit the colllection of data on high traffic sites.

Cookie	Description
VISITOR_INFO1_LIVE	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
IDE	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
NID	This cookie is used to a profile based on user's interest and display personalized ads to the users.

newsletter

PRO-CON DEBATE – CON: Artificial Intelligence is Not a Magic Pill

Table 1: Solutions for Effective Deployment of AI in Health Care

References