INTRODUCTION
Endoscopic ultrasound (EUS) is used for diagnosing pancreatic cystic neoplasms (PCNs) such as intraductal papillary mucinous neoplasms (IPMNs), mucinous cystic neoplasms (MCNs), and serous cystic neoplasms (SCNs) [1]. Because of the improved quality and ubiquitous use of EUS abdominal imaging, the number of incidental PCNs has risen substantially recently, with estimated prevalence ranging from 2.4-13.5% [2]. However, only a few subtypes of these cysts are considered precursor lesions of cancer. In clinical practice, accuracies for discriminating cystic subtypes of PCN using EUS range from 75 to 95%, and final diagnoses can only be made after histopathological analysis [3]. Importantly, PCN resection may dramatically reduce quality of life because it is associated with a 50% risk of complications and a 5% risk of death. Furthermore, it has been demonstrated that ~60% of surgeries are conducted unnecessarily on benign cysts [4]. Given this challenging situation, artificial intelligence (AI) might improve the accuracy of PCN classification. Many studies have been performed on this topic over recent years, and the majority reported the use of AI improved the differential diagnosis of PCL. In this review, the efficacy of EUS-based AI for the diagnosis of PCL is discussed.
ARTIFICIAL INTELLIGENCE
AI enables the analysis of huge numbers of training images and the identification of patterns corresponding to specific clinical features, which can then be used to make diagnoses. Many different algorithms have been devised for this purpose and range in complexity from machine learning (ML) to deep learning (DL) [5].
MACHINE LEARNING
ML is a core discipline of AI and utilizes algorithms that detect patterns within existing data and then uses these patterns to ‘train’ itself to make predictions based on new data. Simple predictive models, such as logistic regression, have been used since the 1960s, but recent decades have witnessed the development of neural network algorithms that predict the risks for various cancers, including breast, lung, and colon cancer. ML uses mathematical algorithms to analyze input values and predict output values without being explicitly programmed to do so. These algorithms are designed to derive different predictive models using a training process during which parameters of an artificial neural network (ANN) model are estimated [6]. The greatest advantage of ML is that it can integrate vast amounts of data and combine observed and predicted quantities in nonlinear and highly interactive ways [7]. ML can be broadly classified based on the type of learning adopted, such as supervised, unsupervised, semi-supervised, or reinforcement learning. Examples of ML include linear discriminants, Bayesian networks, random forest, and support vector machines. Each ML method requires training with input and outcome datasets. In everyday life, ML is used for email spam filtering, online recommendation systems for movies or books, and weather prediction models.
DEEP LEARNING
DL differs from ML in that it can handle much larger datasets and has a neural-inspired architectural design. ANNs are AI algorithms that mimic human neurons. Each ANN has an input layer that receives a signal and an output layer that categorizes input signals. Just as neurons weigh action potentials to determine whether to propagate a signal, hidden layers between the input and output layers of ANNs weigh data characteristics and input from other neurons to determine outputs. DL systems are composed of multiple ANNs and can perform all the above-mentioned ML applications but are more complex, can provide decision-making capabilities, and handle extremely large datasets. DL algorithms have been applied to numerous specialties in the medical field. Its most prominent applications involve detecting abnormalities or diseases based on radiologic and pathologic data [8]. DL can accomplish many tasks, such as identifying the location of a lesion of interest and generating differential diagnoses. Furthermore, convoluted neural networks (CNNs) were recently reported to outperform gastroenterologists at identifying pathologies in endoscopic images [9]. CNNs differ from traditional neural networks because they incorporate many different types of layers, such as convolutional layers and pooling layers [10]. CNNs use convolutional layers to extract features from input signals and identify important features. Pooling layers minimizes network size and maintain computer systems at computational thresholds. CNNs are much better at classifying large images than traditional neural networks.
EUS-BASED ARTIFICIAL INTELLIGENCE AND THE MANAGEMENT OF CYSTIC PANCREATIC LESIONS
Increased EUS availability has enabled pancreatic cyst surveillance. EUS provides more granularity and pancreatic images with better spatial resolutions than other imaging modalities, including computed tomography and magnetic resonance imaging, because of transducer proximity to pancreas [11]. However, the learning curve of EUS is protracted, and thus to acquire competence in all aspects of EUS, it is suggested that clinicians perform 150 supervised cases. Thus, insufficient operator experience can result in misdiagnoses, and even experienced operators can be affected by fatigue due to the durations of some EUS procedures. In practice, the detection of small lesions by EUS is challenging, which makes tumor staging and differential diagnoses difficult because the sonographic appearances of many benign and malignant cystic lesions are similar. Moreover, the increased diagnostic accuracy of EUS-fine needle aspiration (FNA) cystic fluid analysis for differential diagnosis or prognosis prediction is also a concern. Given the amount of information provided by EUS procedures not perceived by the eye and beyond human interpretation, clinicians are developing specific ML and DL algorithms to aid the diagnosis and risk stratification of PCL. Unfortunately, most studies on the use of AI in PCN have used computer tomographic or MRI data, and few have investigated the use of EUS-based AI systems.
1. The application of artificial intelligence to pancreatic cyst differentiation
The first EUS-based AI evaluation of pancreatic lesions was performed in 2001. In this study, the program differentiated focal pancreatitis and malignant in 35 patients with a specificity, sensitivity, and accuracy of 50, 100, and 80%, respectively, versus histopathologic results, which compared with an accuracy of 89% achieved by two endosonographers [12]. This study represented a monumental step in the use of AI, though, at the time, AI-based interpretations were no better than human interpretations. In the early days, ML techniques were mainly used in the AI algorithms used to analyze EUS images. However, thanks to advancements in AI, DL concepts such as CNN are now being applied to assist the differential diagnosis of PCL using EUS images. Nguon et al. [9] developed DL-based algorithms for the differentiation of the MCN and SCN, and analyzed the EUS images of 109 patients (60 MCN, 49 SCN); MCN was defined based on histologic results obtained by EUS-FNA or surgery. An accuracy of up to 82.75% was achieved for differentiating MCN and SCN with an area under the receiver operating curve of 0.88 [9]. Vilas-Boas et al. [13] developed a DL-based diagnostic system for differentiating mucinous and non-mucinous cysts using histologic or cystic fluid study results as the gold standard. Mucinous cysts were defined to have calcinoembryonic antigen fluid levels of >192 ng/mL, glucose levels <50 mg/dL, or mucinous epithelial cells by cytology. EUS images from 28 pancreatic cysts were used to train the model, and overall accuracy, sensitivity, and specificity were 98.5, 98.3, and 98.9%, respectively. However, given their various morphologies and heterogenous echogenicities, PCNs have been difficult to assess using AI until recently.
2. The application of artificial intelligence to pancreatic cystic neoplasms malignant risk stratification
AI helps clinicians stratify PCN malignant risk by identifying deeper characteristics in EUS images. Similarly, EUS-delineated cyst morphology has been utilized to develop AI models that differentiate dysplasia in PCNs. In 2019 a retrospective study was conducted using a DL algorithm to analyze 3790 EUS images of IPMN patients with the aim of evaluating the ability of the algorithm to differentiate malignant potential in patients with a pathological diagnosis that underwent resection. The DL program returned malignant potentials of 95.7, 96.2, and 94% for the sensitivity, specificity, and accuracy, respectively [14]. Schultz et al. [15] developed CNNs to differentiate IPMNs with low- or high-grade dysplasia. EUS images were collected from 43 patients with IPMN, as determined by histopathology of surgical specimens. CNN models were used to analyze EUS images, and the algorithm competently classified advanced neoplasia with a surprising accuracy of 99.6%, which was considerably higher than international guidelines (American Gastroenterology Association, American College of Gastroenterology, and Fukuoka guidelines); international guidelines had accuracies ranging from 51.8 to 70.3% for this cohort [15]. The results of AI studies on the differentiation of pancreas cyst and risk stratification are summarized in Table 1.
3. EUS-guided needle-based confocal laser endomicroscopy artificial intelligence applications
Confocal laser endomicroscopy (CLE) is a new technique that couples cystic epithelial tissue illumination with real-time, high-resolution microscopic imaging and allows endoscopists to obtain real-time optical biopsies of PCNs in multiple locations. Major trials have established reference standards and safety profiles for the EUS-guided needle-based CLE (EUS-nCLE) of PCNs. This technique provides significant benefits for the differentiation of IPMNs and other PCNs. In addition, certain characteristics of EUS-nCLE enable the prediction of advanced neoplasia, such as high-grade dysplasia or carcinoma, in IPMNs. However, EUS-nCLE video files are often difficult to interpret because they are typically large with multiple frames, which results in interobserver variations. AI provides the ability to interpret such large amounts of data and detect advanced neoplasia. Furthermore, AI algorithms have been used to stratify EUS-nCLE images. For example, Machicado et al. [16] developed a CNN algorithm to risk-stratify IPMNs. EUS-nCLE video frames from 35 patients with histologically proven IPMNs were used to design a CNN model that measured papillary epithelial thickness and extracted nCLE features for risk stratification. The algorithm had higher accuracy (85.7%), sensitivity (83.3%), and specificity (88.2%) than those required by international guidelines.
4. The applications of artificial intelligence for cystic fluid analysis
AI algorithms have been used indirectly to identify biomarkers in cystic fluid associated with high-grade dysplasia in IPMN. Current clinical guidelines cannot accurately identify patients with IPMN of the pancreas at high risk of pancreatic cancer; thus, a bio-signature that can accurately predict IPMN with high malignant potential is required. Maker et al. [17] used Lasso-penelized logistic regression to analyze the cystic fluid levels of various mRNAs and miRNAs and of KRAS and GNAS in 59 IPMN patients to identify an optimal bio-signature for high-grade dysplasia in IPMN. They found that IL1β, MUC4, and prostaglandin E synthase 2 mRNA levels most accurately differentiated high- and low-risk IPMN with an area under the curve of up to 0.86 (p=0.02). However, few studies have addressed the diagnostic or risk stratification potentials of AI-based genomic studies.
5. Application of artificial intelligence to pancreatic cystic neoplasms treatment decision-making
PCNs are common and often pose a management dilemma because some cysts are precancerous, whereas others have little risk of developing into invasive cancers. Thus, the accurate differentiation of low-risk PCNs and high-risk PCLs is essential to prevent unnecessary pancreatic surgery and for the appropriate management of high-risk patients. AI has demonstrated its ability to assist clinician decision-making by integrating clinical characteristics, imaging features, and genomic analysis results. Springer et al. [18] applied ML to develop the CompCyst algorithm to differentiate patients requiring surgery, routine monitoring, or not requiring further surveillance. CompCyst was trained using the clinical features, imaging characteristics, and the cystic fluid genetic and biochemical markers of 436 patients with pancreatic cyst and then tested it in an independent cohort of 426 patients using histopathology as the gold standard. Clinical management informed by CompCyst was more accurate than management determined using conventional clinical and imaging criteria. In particular, in a subset of patients without advanced neoplasia (n=140), CompCyst recommended surveillance in 68%, whereas standard of care recommended surveillance in 34% (p=0.02). Thus, compliance with CompCyst results would have spared more than half of the patients (60%) that underwent cyst resection. However, standard management and CompCyst similarly identified lesions requiring surgery with high accuracy (89 and 91%, respectively).
CONCLUSIONS
The use of AI is in its infancy in the medical field, and much effort will be required from clinicians to develop algorithms suitable for clinical practice. New modalities based on the use of AI tools will undoubtedly allow patient risk stratification for malignancy, improve the management of PCNs, and enable clinicians to diagnose pancreas cysts more accurately.