Authors: Angela M. Barbara, Hannah Loshak
AUC
area under the curve
BTS
British Thoracic Society
CHEST
American College of Chest Physicians
ICER
incremental cost-effectiveness ratio
IQR
interquartile range
LDCT
low-dose computed tomography
Lung-RADS
Lung Imaging Reporting and Data System
NLST
National Lung Screening Trial
PanCan
Pan-Canadian Early Detection of Lung Cancer
QALY
quality-adjusted life-year
SSN
subsolid nodule
Evidence of variable quality from 6 diagnostic test accuracy studies indicates that the Pan-Canadian Early Detection of Lung Cancer (PanCan) model may perform better at determining which lung nodules identified by low-dose CT are cancerous compared to the Lung Imaging Reporting and Data System. However, evidence from 3 other studies, also of variable quality, suggests that the risk calculators have similar diagnostic test accuracy.
No studies were identified that compared the clinical utility of PanCan versus the Lung Imaging Reporting and Data System.
Results from 2 economic evaluations were inconsistent about the cost-effectiveness of the 2 lung cancer risk models. However, each study applied the models to different types of lung nodules.
One evidence-based guideline recommended that PanCan be used in the UK for initial risk assessment and for the management of lung nodules.
In 2021, lung cancer represented 13% of new cancer cases and 25% of cancer deaths in Canada.1 The 5-year lung cancer–specific survival rate for the 2015 to 2017 time frame was 19% in males and 26% in females.1 Low-dose CT (LDCT) screening leads to earlier detection of lung cancer and therefore improving survival compared to usual care.2 Two screening protocols are available to estimate the risk of lung cancer and guide the management of lung nodules identified by first (baseline) LDCT screening:
the Lung Imaging Reporting and Data System (Lung-RADS)
the Pan-Canadian Early Detection of Lung Cancer (PanCan).
Lung-RADS was developed by the American College of Radiology3 to standardize the reporting and management of lung nodules. Modelled after the American College of Radiology Breast Imaging Reporting and Data System, the BI-RADS,4 Lung-RADS categorizes the risk of cancer based on 3 primary nodule characteristics: size, type, and growth rate. The nodule type is defined by its solidity. Solid nodules are homogeneous and obscure the lung parenchyma, whereas subsolid nodules have sections that are solid and non-solid nodules have no solid parts. Subsolid nodules can be pure ground-glass nodules, which appear opaque or hazy on scans, or part-solid nodules, which contain both solid and ground-glass components.5 The Lung-RADS categories indicate an increasing likelihood of malignancy, where 1 means negative; 2 means benign; 3 means probably benign; 4A means suspicious, with 5% to 15% probability of clinically active cancer in the next year; 4B means very suspicious, with more than 15% probability of cancer in the next year; and 4X means very suspicious but not otherwise specified. The category classifications and management recommendations associated with those classifications were updated in 2019 to version 1.1 based on empirical evidence and clinical experience.6
PanCan was developed by McWilliams et al. (2013)7 and is also referred to as the Brock University model or the Vancouver risk calculator for the locations of its conception. To generate a probability of having lung cancer on a continuous scale, PanCan utilizes patient characteristics (age, sex, family history of lung cancer, having emphysema) and nodule characteristics (size, type, location, number of nodules, and signs of spiculation). A nodule risk index of less than 1.5% means normal finding, 1.5% to 5% means low risk of malignancy, 6% to 30% means moderate risk of malignancy, and greater than 30% means high risk of malignancy. The PanCan results will determine if a person should undergo further diagnostic testing, such as annual CT screening.
The parameters of the 2 models were mathematically derived from screening data. PanCan was validated using data from chemoprevention trials of BC Cancer7 and the Danish Lung Cancer Screening Trial.8
Because lung cancer screening is a process and not a diagnostic test, there is no “gold standard” per se. Rather, the results of the screening algorithms will determine the follow-up interval(s) and further diagnostic testing. For example, when the PanCan equation predicts a greater than 10% risk of a nodule being malignant, PET or CT is then used as additional diagnostic testing.8 Another example is that Lung-RADS recommends follow-up LDCT, chest CT or PET, and/or tissue sampling depending on the probability of malignancy and comorbidities for category 4B nodules.6
While both protocols for nodule classification and subsequent management are available and in use in Canada, neither is universally accepted. The objective of the current review was to evaluate the evidence regarding the comparative diagnostic accuracy, clinical utility, and cost-effectiveness of Lung-RADS compared to PanCan for patients at high risk of lung cancer undergoing screening with LDCT to identify malignant lung nodules. Additionally, evidence-based guidelines regarding the use of either Lung-RADS or PanCan were sought.
What is the comparative diagnostic accuracy of the Lung Imaging Reporting and Data System versus the Pan-Canadian Early Detection of Lung Cancer nodule risk calculation for the identification of malignant lung nodules in patients at high risk of lung cancer undergoing screening with low-dose CT?
What is the comparative clinical utility of the Lung Imaging Reporting and Data System versus the Pan-Canadian Early Detection of Lung Cancer nodule risk calculation for the identification of malignant lung nodules in patients at high risk of lung cancer undergoing screening with low-dose CT?
What is the comparative cost-effectiveness of the Lung Imaging Reporting and Data System versus the Pan-Canadian Early Detection of Lung Cancer nodule risk calculation for the identification of malignant lung nodules in patients at high risk of lung cancer undergoing screening with low-dose CT?
What are the evidence-based guidelines describing use of the Lung Imaging Reporting and Data System and/or the Pan-Canadian Early Detection of Lung Cancer nodule risk calculation for the identification of malignant lung nodules in patients at high risk of lung cancer undergoing screening with low-dose CT?
A limited literature search was conducted by an information specialist on key resources including MEDLINE, the Cochrane Database of Systematic Reviews, the international HTA database, the websites of Canadian and major international health technology agencies, as well as a focused internet search. The search strategy comprised both controlled vocabulary, such as the National Library of Medicine’s MeSH (Medical Subject Headings), and keywords. The main search concepts were the Lung Imaging Reporting and Data System and the Pan-Canadian Early Detection of Lung Cancer nodule risk calculation. No filters were applied to limit the retrieval by study type. Where possible, retrieval was limited to the human population. The search was also limited to English-language documents published between January 1, 2011 and November 4, 2021. A second search was done for low-dose CT and lung cancer screening, with CADTH-developed search filters applied to limit retrieval to guidelines. The second search was also limited to English-language documents published between January 1, 2016 and November 4, 2021.
One reviewer screened citations and selected studies. In the first level of screening, titles and abstracts were reviewed and potentially relevant articles were retrieved and assessed for inclusion. The final selection of full-text articles was based on the inclusion criteria presented in Table 1.
Criteria | Description |
---|---|
Population | Q1 to Q4: Patients at high risk of lung cancer undergoing screening with low-dose CT to identify malignant lung nodules |
Intervention | Q1 to Q3: Lung-RADS Q4: Lung-RADS and/or PanCan |
Comparator | Q1 to Q3: PanCan Q4: Not applicable |
Reference standard | Q1: Confirmed lung cancer diagnosis; i.e., as determined by biopsy/histology, pathology, surgery, bronchoscopy, or other follow-up diagnostic procedure Q2 to Q4: Not applicable |
Outcomes | Q1: Comparative diagnostic test accuracy; e.g., positive and negative predictive value, sensitivity (effectiveness in identifying all cases of malignant nodules and lung cancer), specificity (effectiveness in accurately identifying malignant nodules and cases of lung cancer) Q2: Comparative clinical utility; e.g., benefits and harms to patients, including time to treatment, impact on quality of life, feasibility of screening test, adverse events from the screening, incidental findings Q3: Comparative cost-effectiveness; e.g., quality-adjusted life-years, costs per unit of health benefit Q4: Recommendations regarding the use of either Lung-RADS and/or PanCan; e.g., which screening method is optimal; guidance as to which intervention is preferable in particular patient populations, settings, contexts; clinical and other considerations when using either screening method |
Study designs | Health technology assessments, systematic reviews, randomized controlled trials, non-randomized studies, economic evaluations, evidence-based guidelines and recommendations |
Lung-RADS = Lung Imaging Reporting and Data System; PanCan = Pan-Canadian Early Detection of Lung Cancer.
Articles were excluded if they did not meet the selection criteria outlined in Table 1, if they were duplicate publications, or if they were published before 2011. Guidelines with unclear methodology were also excluded.
The included publications were critically appraised by 1 reviewer using the following tools as a guide: the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) checklist9 for diagnostic test accuracy studies, the Drummond checklist10 for economic evaluations, and the Appraisal of Guidelines for Research & Evaluation (AGREE) II instrument11 for guidelines. Summary scores were not calculated for the included studies; rather, the strengths and limitations of each included publication were described narratively.
A total of 258 citations were identified in the literature search. Following screening of titles and abstracts, 217 citations were excluded and 41 potentially relevant reports from the electronic search were retrieved for full-text review. Three potentially relevant publications were retrieved from the grey literature search for full-text review. Of these potentially relevant articles, 32 publications were excluded for various reasons and 12 publications met the inclusion criteria and were included in this report. These comprised 9 diagnostic test accuracy studies, 2 economic evaluations, and 1 evidence-based guideline. Appendix 1 presents the PRISMA12 flow chart of the study selection.
Additional references of potential interest are provided in Appendix 5.
Nine diagnostic test accuracy studies,13-21 2 economic evaluations,22,23 and 1 evidence-based guideline24 were identified for inclusion in this review. No relevant systematic reviews, health technology assessments, or randomized controlled trials were identified.
Details regarding the characteristics of included publications are provided in Appendix 2.
Seven diagnostic test accuracy studies13,14,16-18,20,21 retrospectively evaluated pulmonary nodules from participants enrolled in previous studies that used a single-gate approach for patient selection (i.e., patients with unknown lung cancer status). Four studies13,14,18,21 included patients who were randomly assigned to LDCT screening in the multi-centre National Lung Screening Trial (NSLT).25 Patients were enrolled from August 2002 through April 2004. One diagnostic test accuracy study included patients who were randomized to LDCT screening from October 2004 to March 2006 in a single-centre trial.20 Two studies included patients previously enrolled in prospective cohort studies: 1 was a population-based multi-centre study that enrolled patients from April 2017 to December 201816 and the other study recruited patients from December 2007 to December 2010 at a single tertiary institution.17
Two diagnostic test accuracy studies enrolled new patients and also used a single-gate approach for patient selection: 1 retrospective study screened patients from December 2012 to June 2016 at a single centre15 and 1 multi-centre prospective study enrolled patients between June 2015 and December 2017.19
The 2 economic evaluations were conducted as cost-utility analyses, using lifetime horizons. One study22 used a Monte Carlo simulation model for subsolid nodules using data from the literature, NLST, and national databases. Major assumptions included that follow-up CT screening led to definitive treatment and a willingness-to-pay threshold of US$100,000 per quality-adjusted life-year (QALY). The perspective taken was that of the health care system and society. The other study23 used a predictive logistic regression model for nodules assigned a Lung-RADS category of 4A, 4B, or 4X. Costs were derived from a previous cost-effectiveness study and the perspective of health care payers and policy-makers was taken. Assumptions were made about survival and mortality rates, follow-up detection of nodules originally screened as benign, and growth rates. Model parameters in both studies included nodule properties, patient characteristics, mortality, and treatment.
The evidence-based guideline24 was developed by the British Thoracic Society (BTS), which included respiratory physicians, radiologists, respiratory specialty trainees, a thoracic surgeon, a pathologist, and a respiratory nurse practitioner. The guideline was informed by systematic reviews of the literature. The recommendations were classified based on Scottish Intercollegiate Guidelines Network criteria. A grade between A (highest) and D (lowest) or Checkmark (no research evidence) was assigned to each recommendation. no research evidence was assigned to each recommendation. Scientific evidence that informed the recommendations was classified between 1++ (highest quality) and 4 (lowest quality). Before publication, the draft guideline was made available online for public consultation and feedback was invited from stakeholder organizations.
The diagnostic test accuracy studies were conducted in, and enrolled patients from, Australia,17 Canada,19 Denmark,20 Germany,20 the Netherlands,20 South Korea,16 and the US.13-15,18,21
The 2 economic evaluations were conducted by authors in the US.22,23
The guideline was intended for use in the UK.24
Four diagnostic test accuracy studies13,14,18,21 included participants in the LDCT screening arm of NSLT of between 55 years and 74 years of age, had a history of cigarette smoking of at least 30 pack-years, and, if former smokers, had quit within the previous 15 years. Each study included a different subset of NSLT participants: 1 study included a random set of 434 patients with subsolid or part-solid nodules,13 1 study included 58 patients with images of pre-cancers and 127 patients with benign nodules,14 another study included 6,956 patients with solid nodules only,18 and the fourth study assessed 2,813 patients with all nodule types.21
Two other studies included patients using the eligibility criteria of NSLT: the study by Kessler et al. (2020) enrolled 486 patients with a mean age of 63 years15 and the study by Marshall et al. (2017) included 256 patients aged 60 years or older.17 The study by Tremblay et al. (2019) enrolled 775 patients who either met the NSLT eligibility criteria or were aged 55 years to 80 years of age with an estimated 6-year lung cancer risk of 1.5% or more.19
The diagnostic test accuracy study by van Riel et al. (2017) included 613 current or former smokers (aged 50 years to 75 years) who had any nodule identified by LDCT screening.20 The study by Kim et al. (2021) included 4,578 patients (median age of 62 years, 54% smokers) with non-calcified nodules determined after LDCT screening.16
Patients in 1 economic evaluation included a hypothetical cohort of 10 million current and former smokers ranging from 55 years to 75 years of age and assumed to have subsolid nodules (SSNs) at baseline LDCT.22 Patients in the other economic evaluation included a simulated cohort of 100,000 patients aged 61 years to 71 years assigned Lung-RADS category 4 nodules.23
The target population of the included guideline was adult patients with pulmonary nodules. The intended users of all recommendations included health care professionals such as clinicians (e.g., physicians, general practitioners, radiologists, surgeons) and nurses.24
The 9 diagnostic test accuracy studies13-21 assessed their study populations using both Lung-RADS and PanCan. Confirmed lung cancer diagnosis, as determined by follow-up diagnostic procedures, was considered the reference standard for their analyses. The screening algorithms were applied to the baseline scans and the lung cancer diagnosis was assessed over the follow-up periods and ranging from 2 years to 6.5 years.
Hammer et al. (2019) reported that the assessments using both screening algorithms were conducted without knowledge of the results of the reference standard.13 In the other 8 studies, it was unclear if the screeners were blinded to the final diagnoses of the patients.14-21
The economic evaluations examined the cost-effectiveness of Lung-RADS and PanCan.22,23 In 1 study, Lung-RADS was compared to 2 different guidelines for nodule management using PanCan: the American College of Chest Physicians (CHEST) guidelines, which recommended PET or CT following a high-risk score and non-surgical lung biopsy following intermediate PET/CT results; and the BTS guideline, which recommended PET or CT following an intermediate risk score.23
The relevant intervention considered in the guideline was PanCan for initial risk assessment of the probability of malignancy in pulmonary nodules and management of SSNs.24
The diagnostic test accuracy studies calculated various parameters of diagnostic performance. Seven studies reported the area under the curve (AUC).13,15-20 Six studies reported sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV),15-19,21 and 3 studies reported accuracy.14,15,21
Seven studies calculated diagnostic parameters. Two studies used a PanCan risk score of 5% as the threshold for positivity19,21 and 2 studies used a PanCan risk score of 10% to indicate a positive result.14,17 Two studies used Lung-RADS category 3 as the threshold for a positive result17,19 and 2 studies used category 4A/4B as the threshold for a positive result.14,21 Three studies calculated diagnostic test accuracy for different thresholds used by each risk algorithm to determine whether a nodule was positive.15,16,18
The economic evaluations calculated the costs and QALYs for each screening strategy and incremental cost-effectiveness ratios (ICERs) expressed as ratios of incremental cost incurred per QALY gained, comparing different screening strategies.22,23
The BTS guideline provides recommendations relevant to the current report, which considered the diagnostic test accuracy of PanCan in estimating the probability that a lung nodule would be diagnosed as cancer within a 2- to 4-year follow-up period.24
There were several strengths common to the 9 diagnostic test accuracy studies:13-21 the screening tests, the fact that their conduct and interpretation matched the review question, and that the thresholds used for the screening tests were pre-specified; the target condition (i.e., lung cancer), as defined by the reference standard (i.e., lung cancer diagnosed during the follow-up period), matched the research question; and the study participants, care providers, and settings appeared to be representative of the population and that they were care settings of interest. Eight diagnostic test accuracy studies13,15-21 also clearly described objectives, interventions, controls, inclusion criteria, outcomes, and main findings, and avoided the use of a case-control study design. Eight studies reported sources of funding13,14,16-21 and 7 studies presented characteristics of included patients and lung nodules.13,15-17,19-21 The authors of 7 studies disclosed no conflicts of interest.13-18,21
As for limitations, it was unclear if the screeners who conducted the assessments using Lung-RADS and PanCan were blinded to the results of the reference standard (i.e., having no knowledge of the final diagnosis) in 8 studies.14-21 The authors of 2 studies had conflicts of interest.19,20 Three studies had a potential risk of bias because of missing data.15,20,21 Non-consecutive patients were included in the study by Hammer et al. (2019).13 In the study by Hawkins et al. (2016), there were additional limitations: a case-control design was used, it was unclear how patients were selected, and patient characteristics were not reported.14
The 2 economic evaluations22,23 shared the following strengths: the research question and its economic importance were stated; sources of effectiveness estimates, primary outcome, details of the simulation models, and methods for the estimation of quantities and unit costs were described; the time horizon of costs and benefits, discount rate, and details of statistical tests and sensitivity analyses were given; the incremental analysis was reported; conclusions were given; and the authors stated that they had no conflicts of interest. One economic evaluation23 provided confidence intervals for costs and QALYs, but the other study did not.22 The studies also shared the following limitations: no description of current price adjustments for inflation was provided, no justification for the selected discount rate was provided, and sources of funding were not disclosed.
The BTS evidence-based guideline24 provided a clear description of its scope and purpose, including objectives, the range of clinical questions covered in the guideline, the intended users, and the target population. The final recommendations were easily identifiable and were written using language that was clear and unambiguous. The guideline development groups appeared to include individuals from all relevant professional groups. Patient preferences were not sought or incorporated as part of the development process; however, a draft guideline document was made available online for public consumption before publication. Systematic literature searches were used to identify evidence for consideration when developing recommendations. Search strategies, databases searched, and the timing of the literature searches were clearly described. The guideline provided a detailed description of the methods used for selecting articles and stated that the screening process was done in duplicate. The guideline included a description of how the recommendations were formulated. Recommendations were externally reviewed by stakeholders before their publication and included a procedure for updating the guideline in the future. Regarding applicability, the facilitators and barriers to the implementation of the recommendations were not addressed. A description of potential conflicts of interest was included; however, sources of funding were not disclosed, making it unclear if the funders’ views had any impact on the content of the guideline. Finally, it should be noted that this guideline was developed for use in the UK; therefore, the generalizability of the recommendations to the Canadian context is unclear.
Additional details regarding the strengths and limitations of the included publications are provided in Appendix 3.
Appendix 4 presents the main study findings.
Evidence regarding the diagnostic accuracy of Lung-RADS versus PanCan for the detection of malignant pulmonary nodules was available from 9 diagnostic test accuracy studies.13-21
The study by van Riel et al. (2017) reported that PanCan performed significantly better than Lung-RADS for discriminating benign nodules from malignant nodules, based on AUCs (0.87 versus 0.81, P = 0.003).20 Three other studies13,17,19 reported greater, but non-significant, AUCs for PanCan (0.78, 0.90, 097) compared to Lung-RADS (0.70, 0.84, 0.93) and 3 other studies15,16,18 found similar AUCs for both screening tests.
The study by White et al. (2019)21 found that PanCan had statistically significantly greater specificity and accuracy (85%, 85%) compared to Lung-RADS (76%, 76%); sensitivity was also greater for PanCan (93% versus 87%), but the difference was not statistically significant. The study by Hawkins et al. (2016)14 reported that PanCan had greater accuracy compared to Lung-RADS (79% versus 71%; statistical significance was not reported). One study found that both PanCan and Lung-RADS were 100% sensitive, but PanCan had higher specificity (95% versus 80%; statistical significance was not reported).17 The study by Kessler et al. (2020)15 found that PanCan had higher sensitivity, although it was not a statistically significant finding (74% versus 58%), but it had a lower specificity than Lung-RADS (94% versus 98%; statistical significance was not reported). Three studies16,18,19 found similar diagnostic parameters between PanCan and Lung-RADS.
No relevant evidence regarding the comparative clinical utility of Lung-RADS versus PanCan was identified; therefore, no summary can be provided.
One economic evaluation reported that Lung-RADS compared to PanCan was cost-effective under a willingness-to-pay threshold of $100,000 per QALY, with an ICER of $52,993 per QALY.22
The second economic evaluation found that the BTS guideline using PanCan was associated with more QALYs than Lung-RADS or the CHEST guidelines using PanCan and lower costs compared to the CHEST guidelines using PanCan. The BTS guideline compared to Lung-RADS had an ICER of $52,643 per QALY gained.23
The BTS guideline24 recommends the use of PanCan for initial risk assessment of pulmonary nodules in people aged 50 years or older who are smokers or former smokers. This is a grade C recommendation, based on level 2+ evidence from a validation study conducted in the UK.26
The BTS guideline24 recommends consideration of PanCan for the initial risk assessment of pulmonary nodules in all patients. This is a grade D recommendation, based on level 3 evidence.
The BTS guideline24 recommends the use of PanCan to calculate the risk of malignancy in SSNs larger than 5 mm that are unchanged at 3 months. This is a grade C recommendation, based on level 2+ evidence.
Seven of the 9 diagnostic test accuracy studies were retrospective.13-15,17,18,20,21 The studies that included patients from NSLT13,14,18,21 shared limitations related to the original trial. Scanners used in the trial were less technologically advanced than scanners available today. Also, the trial was conducted at a variety of medical institutions in the US, many of which were recognized for their expertise in radiology and in the diagnosis of cancer;25; therefore, applicability to all Canadian facilities is uncertain.
No studies on the benefits and harms of Lung-RADS versus PanCan were found. It is unclear which of these 2 screening protocols may result in improved clinical outcomes for patients undergoing LDCT screening to identify malignant lung nodules.
Apart from 1 prospective diagnostic test accuracy study,19 none of the primary studies or economic evaluations22,23 were conducted in Canada. Similarly, the included guideline was not intended for professionals in Canada.24 Therefore, the generalizability of the findings from the included literature and the applicability of the recommendations from the included guideline to Canadian settings are unclear.
This review comprised 9 diagnostic accuracy studies,13-21 2 economic evaluations,22,23 and 1 evidence-based guideline.24
Evidence from 6 retrospective studies suggests that PanCan had superior diagnostic test accuracy compared to Lung-RADS for predicting malignancy.13-15,17,20,21 However, evidence from 1 prospective study and 1 retrospective analysis suggests that the screening protocols have a similar diagnostic performance.16,18 Findings from the sole prospective study performed in Canada also found similar diagnostic test accuracy between Lung-RADS and PanCan.16,18,19
Results from the 2 economic evaluations were inconsistent about the cost-effectiveness of the 2 lung cancer risk models. One study22 reported an ICER of $52,993 per QALY for Lung-RADS compared to PanCan, while the other study reported an ICER of $52,643 per QALY for the BTS guideline using Pan Can compared to Lung-RADS.23 However, each study applied the models to different types of lung nodules.
The BTS guideline recommends PanCan for initial risk assessment of the probability of malignancy in pulmonary nodules and management of SSNs.24 However, there are no Canadian guidelines that recommend the use of Lung-RADS or PanCan for the identification of malignant lung nodules in patients at high risk of lung cancer undergoing LDCT screening.
The limitations of the included literature should be considered when interpreting the findings of this report. Further research investigating the diagnostic accuracy of Lung-RADS versus PanCan in Canadian settings would help confirm if PanCan performs better or is similar to Lung-RADS for distinguishing malignant pulmonary nodules. Clinical utility research is needed to evaluate the benefits and harms to patients using Lung-RADS compared to PanCan. Future economic evaluations conducted from Canadian perspectives and guideline recommendations intended for Canadian settings may be helpful to further inform clinical and policy decisions.
1.Canadian Cancer Statistics 2021. Ottawa (ON): Canadian Cancer Statistics Advisory Committee, Statistics Canada, Public Health Agency of Canada; 2021: https://cdn.cancer.ca/-/media/files/research/cancer-statistics/2021-statistics/2021-pdf-en-final.pdf?rev=2b9d2be7a2d34c1dab6a01c6b0a6a32d&hash=01DE85401DBF0217F8B64F2B7DF43986&_ga=2.203979736.995607737.1638540598-408723823.1638540598. Accessed 2021 Nov 27.
2.Huang KL WS, Lu WC, Chang YH, Su J, Lu YT. Effects of low-dose computed tomography on lung cancer screening: a systematic review, meta-analysis, and trial sequential analysis. BMC Pulm Med. 2019;19(1):126. PubMed
3.Kazerooni EA, Armstrong MR, Amorosa JK, et al. ACR CT accreditation program and the lung cancer screening program designation. J Am Coll Radiol. 2015;12(1):38-42. PubMed
4.An JY, Unsdorfer KML, Weinreb JC. BI-RADS, C-RADS, CAD-RADS, LI-RADS, Lung-RADS, NI-RADS, O-RADS, PI-RADS, TI-RADS: Reporting and Data Systems. Radiographics. 2019;39(5):1435-1436. PubMed
5.Dziadziuszko K, Szurowska E. Pulmonary nodule radiological diagnostic algorithm in lung cancer screening. Transl Lung Cancer Res. 2021;10(2):1124-1135. PubMed
6.Chelala L, Hossain R, Kazerooni EA, Christensen JD, Dyer DS, White CS. Lung-RADS Version 1.1: Challenges and a Look Ahead, From the AJR Special Series on Radiology Reporting and Data Systems. AJR Am J Roentgenol. 2021;216(6):1411-1422. PubMed
7.McWilliams A, Tammemagi MC, Mayo JR, et al. Probability of cancer in pulmonary nodules detected on first screening CT. N Engl J Med. 2013;369(10):910-919. PubMed
8.Winkler Wille MM, van Riel SJ, Saghir Z, et al. Predictive Accuracy of the PanCan Lung Cancer Risk Prediction Model -External Validation based on CT from the Danish Lung Cancer Screening Trial. Eur Radiol. 2015;25(10):3093-3099. PubMed
9.Whiting PF, Rutjes AW, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529-536. PubMed
10.Higgins JPT, Green S, editors. Figure 15.5.a: Drummond checklist (Drummond 1996). Cochrane handbook for systematic reviews of interventions. London (GB): The Cochrane Collaboration; 2011: http://handbook-5-1.cochrane.org/chapter_15/figure_15_5_a_drummond_checklist_drummond_1996.htm. Accessed 2021 Dec 3.
11.Agree Next Steps Consortium. The AGREE II Instrument. Hamilton (ON): AGREE Enterprise; 2017: https://www.agreetrust.org/wp-content/uploads/2017/12/AGREE-II-Users-Manual-and-23-item-Instrument-2009-Update-2017.pdf. Accessed 2021 Dec 3.
12.Liberati A, Altman DG, Tetzlaff J, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol. 2009;62(10):e1-e34. PubMed
13.Hammer MM, Palazzo LL, Kong CY, Hunsaker AR. Cancer Risk in Subsolid Nodules in the National Lung Screening Trial. Radiology. 2019;293(2):441-448. PubMed
14.Hawkins S, Wang H, Liu Y, et al. Predicting Malignant Nodules from Screening CT Scans. J Thorac Oncol. 2016;11(12):2120-2128. PubMed
15.Kessler A, Peng R, Mardakhaev E, Haramati LB, White CS. Performance of the Vancouver Risk Calculator Compared with Lung-RADS in an Urban, Diverse Clinical Lung Cancer Screening Cohort. Radiol Imaging Cancer. 2020;2(2):e190021. PubMed
16.Kim H, Kim HY, Goo JM, Kim Y. External validation and comparison of the Brock model and Lung-RADS for the baseline lung cancer CT screening using data from the Korean Lung Cancer Screening Project. Eur Radiol. 2021;31(6):4004-4015. PubMed
17.Marshall HM, Zhao H, Bowman RV, et al. The effect of different radiological models on diagnostic accuracy and lung cancer screening performance. Thorax. 2017;72(12):1147-1150. PubMed
18.Sundaram V, Gould MK, Nair VS. A Comparison of the PanCan Model and Lung-RADS to Assess Cancer Probability Among People With Screening-Detected, Solid Lung Nodules. Chest. 2021;159(3):1273-1282. PubMed
19.Tremblay A, Taghizadeh N, MacGregor JH, et al. Application of Lung-Screening Reporting and Data System Versus Pan-Canadian Early Detection of Lung Cancer Nodule Risk Calculation in the Alberta Lung Cancer Screening Study. J Am Coll Radiol. 2019;16(10):1425-1432. PubMed
20.van Riel SJ, Ciompi F, Jacobs C, et al. Malignancy risk estimation of screen-detected nodules at baseline CT: comparison of the PanCan model, Lung-RADS and NCCN guidelines. Eur Radiol. 2017;27(10):4019-4029. PubMed
21.White CS, Dharaiya E, Dalal S, Chen R, Haramati LB. Vancouver Risk Calculator Compared with ACR Lung-RADS in Predicting Malignancy: Analysis of the National Lung Screening Trial. Radiology. 2019;291(1):205-211. PubMed
22.Hammer MM, Eckel AL, Palazzo LL, Kong CY. Cost-Effectiveness of Treatment Thresholds for Subsolid Pulmonary Nodules in CT Lung Cancer Screening. Radiology. 2021;300(3):586-593. PubMed
23.Hammer MM, Gupta S, Kong CY. Cost-Effectiveness of Management Algorithms for Lung-RADS Category 4 Nodules. Radiol Cardiothorac Imaging. 2021;3(2):e200523. PubMed
24.Callister M, Baldwin D, Akram A, et al. British Thoracic Society guidelines for the investigation and management of pulmonary nodules. Thorax. 2015;70:ii1-ii54. PubMed
25.National Lung Screening Trial Research Team, Aberle D, Adams A, et al. Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. N Engl J Med. 2011;365(5):395-409. PubMed
26.Al-Ameri A, Malhotra P, Thygesen H, et al. Risk of malignancy in pulmonary nodules: a validation study of four prediction models. Lung Cancer. 2015;89(1):27-30. PubMed
27.Hammer MM, Byrne SC, Kong CY. Factors Influencing the False Positive Rate in CT Lung Cancer Screening. Acad Radiol. 2020;03:03.
Note that this appendix has not been copy-edited.
Table 2: Characteristics of Included Primary Clinical Studies
Study citation, country, funding source | Study design | Population characteristics | Intervention and comparator(s) | Clinical outcomes, length of follow-up |
---|---|---|---|---|
Diagnostic test accuracy | ||||
Kim et al. (2021)16 South Korea Funding sources: National R&D Program for Cancer Control, Ministry of Health and Welfare; the National Health Promotion Fund, Ministry of Health and Welfare, Republic of Korea. | Secondary analysis of a national, multi-centre, prospective cohort study (Korean Lung Cancer Screening Project) | Inclusion criteria: (1) participants whose CT scans were read using a cloud-based thin-client reading system; (2) participants with at least 1 non-calcified nodule Exclusion criteria: (1) participants who had undergone baseline chest CT scans before enrolment; (2) Lung-RADS category 1 (i.e., calcified nodules); (3) masses larger than 30 mm, which were not of interest for a prediction model; (4) participants with missing values for the model inputs Number of participants: 4,578 Median age, IQR (years): 62, 59 to 67 Sex: 97% males | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis, confirmed by pathology | Outcome: Diagnostic test accuracy (AUC, sensitivity, specificity, PPV, NPV) Follow-up: median 664 days, IQR 562 to 794 days |
Sundaram et al. (2021)18 US Funding source: Cancer Center Support Grant | Retrospective analysis of a multi-centre randomized controlled trial (NLST) | Inclusion criteria: Patients in the LDCT arm of NLST, with a new positive finding (i.e., solid nodules) in any of the 3 screening years Exclusion criteria: Participants with masses (i.e., lesions larger than 3 cm in maximum diameter), partly solid nodules or GGNs, a prior lung cancer diagnosis, and/or missing nodule size; and participants for whom age at smoking cessation was inconsistent with age at randomization Number of participants: 6,956 Age: NR Sex: NR | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by needle biopsy, surgery, bronchoscopy, further imaging, or other diagnosis procedure | Outcome: Diagnostic test accuracy (AUC, sensitivity, specificity, PPV, NPV) Follow-up: 2 years |
Kessler et al. (2020)15 US Funding source: NR | Single-centre, retrospective diagnostic test accuracy study | Inclusion criteria: Patients eligible for screening, according to NLST and Centers for Medicare and Medicaid Services eligibility criteria Exclusion criteria: NR Number of participants: 486 Mean age, SD (years): 63, 5 Sex: 54% females | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by histologic evaluation, follow-up imaging, or clinical evaluation | Outcome: Diagnostic test accuracy (AUC, sensitivity, specificity, PPV, NPV, accuracy) Follow-up: mean 40 months, SD 14 months |
Hammer et al. (2019)13 US Funding source: National Institutes of Health/ National Cancer Institute. | Retrospective analysis of a multi-centre randomized controlled trial (NLST) | Inclusion criteria: Patients in the LDCT arm of NLST, with at least 2 scans (i.e., baseline and at least 1 follow-up, with: (a) subsolid pulmonary nodules comprising GGNs smaller than 10 mm; (b) GGNs measuring 10 mm or larger; and (c) part-solid nodules (“mixed” nodules) measuring 6 mm or larger Exclusion criteria: Patients with nodules that were not truly subsolid; patients with part-solid nodules smaller than 6 mm Number of participants: 434 Median age, range (years): 62, 55 to 74 Sex: 51% females | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by needle biopsy, surgery (thoracotomy, thoracoscopy, or mediastinoscopy) bronchoscopy, chest CT imaging; PET/chest MRI, or other diagnosis procedure | Outcome: Diagnostic test accuracy (AUC) Follow-up: 6.5 years |
Tremblay et al. (2019)19 Canada Funding source: Alberta Cancer Foundation | Prospective, diagnostic test accuracy study | Inclusion criteria: Participants meeting NSLT eligibility criteria (i.e., 55-74 years of age; ≥ 30 pack-years smoking history or quit smoking ≤ 15 years prior) or who were 55 to 80 years of age and had estimated 6-year lung cancer risk ≥ 1.5% using validated model (PLCOm2012) Exclusion criteria: Participants with nodules previously detected on off-study clinical scans Number of participants: 775 Mean age, range (years): 63.3, 55 to 80 Sex: 49.9% women | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; including histopathology | Outcome: Diagnostic test accuracy (AUC, sensitivity, specificity, PPV, NPV) Follow-up: mean 763 days, SD 203 days |
White et al. (2019)21 US Funding source: No funding | Retrospective analysis of a multi-centre randomized controlled trial (NLST) | Inclusion criteria: Patients meeting the NSLT eligibility criteria (i.e., 55-74 years of age; ≥ 30 years smoking history or quit smoking ≤ 15 years prior) with ≥ 4mm nodules found at initial prevalence screening Exclusion criteria: Patients with incomplete information Number of participants: 2,813 (4,408 nodules) Mean age, SD (years): 64.3, 5.2 (patients with malignant nodules); 62.1, 5.1 (patients with benign nodules) Sex: 41% women | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by needle biopsy, surgery, bronchoscopy, further imaging, or other diagnosis procedure | Outcomes: Diagnostic test accuracy (sensitivity, specificity, accuracy) Follow-up: 2 years |
Marshall et al. (2017)17 Australia Funding sources: National Health and Medical Research Councils; Smart State Project Grant, Queensland Health; National Centre for Asbestos Related Diseases Project Grant; The Prince Charles Hospital Foundation | Retrospective analysis of a prospective cohort study (QLCSS) | Inclusion criteria: Patients aged 60 to 74 years; with minimum lung function (i.e., forced expiratory volume in 1 second ≥ 50% predicted); smokers (i.e., ≥ 30 pack-years, current or quit within the past 15 years) Exclusion criteria: Patients with any medical comorbidity; CT scan within the prior 18 months; poor spirometry; lost to follow-up; or missing data Number of participants: 256 Median age (years): 64.5 Sex: 67% males | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by fine need aspirate, bronchoscopy, surgery, or other diagnostic procedure | Outcome: Diagnostic test accuracy (AUC, sensitivity, specificity, PPV, NPV) Follow-up: 5 years |
van Riel et al. (2017)20 Denmark, Germany, the Netherlands Funding source: MeVis Medical Solutions AG | Retrospective analysis of a randomized controlled trial (Danish Lung Cancer Screening Trial) | Inclusion criteria: Patients aged 50 to 70 years who were current or former smokers with a minimum smoking history of 20 pack-years, normal lung function, and nodules annotated by at least 1 screening radiologist Exclusion criteria: NR Number of participants: 613 Mean age, range (years): 58, 50 to 75 Sex: 53% women | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by lung cancer mortality, histology/tissue sampling, follow-up LDCT, chest CT or PET, staging, and other diagnostic procedure | Outcome: Diagnostic test accuracy (AUC) Follow-up: 9 years |
Hawkins et al. (2016)14 US Funding sources: US Public Health Service; Cancer Center Support Grant; State of Florida Department of Health. | Nested matched case-control study using data from a multi-centre randomized controlled trial (NLST) | Inclusion criteria: Patients in the LDCT arm of NLST with screen-detected lung cancer (cases) or cancer-free findings (controls) Exclusion criteria: NR Number of participants: 185 Mean age (years): 64 Sex: 53% females | Intervention: Lung-RADS Comparator: PanCan Reference standard: Lung cancer diagnosis; as determined by needle biopsy, surgery, bronchoscopy, further imaging, or other diagnosis procedure | Outcome: Diagnostic test accuracy (sensitivity, specificity, accuracy) Follow-up: 2 years |
AUC = area under the curve; GGN = ground-glass nodule; IQR = interquartile range; LDCT = low-dose CT; Lung-RADS = Lung Imaging Reporting and Data System; NLST = National Lung Screening Trial; NPV = negative predictive value; NR = not reported; PanCan = Pan-Canadian Early Detection of Lung Cancer; PPV = positive predictive value; QLCSS = Queensland Lung Cancer Screening Study; SD = standard deviation.
Table 3: Characteristics of Included Economic Evaluations
Study citation country, funding source | Type of analysis, time horizon, perspective | Population characteristics | Intervention and comparator(s) | Approach | Source of clinical, cost, and utility data used in analysis | Main assumptions |
---|---|---|---|---|---|---|
Hammer et al. (2021)22 US Funding source: NR | Analysis: Cost-utility analysis. Time horizon: Lifetime Perspective: Health care system and society | A hypothetical cohort of 10 million current and former smokers undergoing LDCT lung cancer screening who are assumed to have a ground-glass nodules at baseline CT. Patient age range was 55 to 75 years at the beginning of screening, and 49% of patients were men. | Intervention: Lung-RADS Comparator: PanCan | A state-transition Monte Carlo simulation model with a monthly cycle to investigate the effectiveness of the nodule management guidelines for non-solid nodules. Nodules could grow and develop solid components. | Initial nodule size, nodule growth rates, and the potential for the development of a solid component were determined using data from the literature. Patient characteristics were generated from primary analysis of the NLST data. Age-dependent mortality rates were derived from the National Health Interview Survey, Substance Abuse and Mental Health Services, American Cancer Society cancer prevention studies, and Berkeley Mortality Database. Incidental cancer rates dependent on patient age and smoking status were derived by using the Lung Cancer Policy Model, and the Smoking History Generator was used to model individuals’ smoking history. The QALYs and costs were discounted by 3% per year. | Nodules meeting criteria for Lung-RADS category 4B or category 4X at follow-up CT proceed to definitive treatment in the model, as per the assumption that they started out as GGNs and must have grown and/or developed solid components to meet these criteria. Patients are assumed to have a single nodule for purposes of PanCan. A willingness-to-pay threshold of $100,000 per QALY was used, in keeping with recommendations from the literature for the US health care system. |
Hammer et al. (2021)23 US Funding source: NR | Analysis: Cost-utility analysis. Time horizon: Lifetime. Perspective: Payers and policy-makers within a health care system. | A simulated cohort of 100,000 patients derived from a random subset of 151 patients who underwent LDCT lung cancer screening within the health care network and had been assigned a Lung-RADS category of 4A, 4B, or 4X. The median age was 66 years (range 61 to 71). Males made up 55% of the cohort. | Intervention: Lung-RADS (using the following stratification methods: nodule size to determine initial categorization; PET and/or CT for category 4B or 4X nodules, and follow-up for category 4A nodules). Comparators: BTS guideline using PanCan (using the following stratification methods: Brock risk score for initial categorization; PET or CT for high Brock risk score, follow-up for low Brock risk score, biopsy for intermediate PET and/or CT results); or CHEST guidelines using PanCan (using the following stratification methods: | Multivariable logistic regression analysis to predict the results of PET or CT and of follow-up chest CT from patient and nodule characteristics as well as the nodule diagnosis (benign vs. malignant). | Cancer treatment costs were derived from a previous cost-effectiveness study.27 Costs and QALYs were discounted at 3% per year. Baseline life expectancies by age and sex for smokers were obtained from the State Board of Administration of Florida. Survival of localized lung cancer by cancer stage was derived from the IASLC Lung Cancer Staging Project. | Survival of patients with clinical tumor size 0 cancer was assumed to be 100%, thus median survival was set at 99 years. Survival of patients with metastatic disease was estimated at 5% at 5 years, yielding a median survival of 1.15 years. For a given patient, survival was calculated as the minimum of age-based survival and, if the nodule was malignant, then it was calculated as cancer-based survival (in other words, a patient could not survive past his/her age-based life expectancy). For benign nodules, a surgery-related mortality rate of 2% and a biopsy-related mortality rate of 0.2% were implemented stochastically. For patients who underwent surgery for a benign nodule, the utility was assumed to be 0.9 for that year, then normal (utility value not provided) thereafter. |
Brock score for initial stratification; PET or CT for intermediate Brock risk score, follow-up for low Brock risk score). | Any lung cancers that were assigned a benign diagnosis by the management algorithm were assumed to be detected at a follow-up CT performed as part of the lung cancer screening program 1 year after the work-up. Lung cancers were assumed to grow during this period, but the rate of growth was dependent upon nodule characteristics. The growth rate of a lung cancer was assumed to be 150 days, approximately the average rate of solid lung cancers. |
BTS = British Thoracic Society; CHEST = American College of Chest Physicians; GGN = ground-glass nodule; IALSC = International Association for the Study of Lung Cancer; ICER = incremental cost-effectiveness ratio; LDCT = low-dose CT; Lung-RADS = Lung Imaging Reporting and Data System; NLST = National Lung Screening Trial; NR = not reported; PanCan = Pan-Canadian Early Detection of Lung Cancer; QALY = quality-adjusted life-year; vs. = versus.
Table 4: Characteristics of Included Guidelines
Intended users, target population | Intervention and practice considered | Major outcomes considered | Evidence collection, selection, and synthesis | Evidence quality assessment | Recommendations development and evaluation | Guideline validation |
---|---|---|---|---|---|---|
British Thoracic Society Guideline (2015)24 | ||||||
Intended users: Practitioners within the UK, including physicians, general practitioners, nurses, radiologists, surgeons and other health care professionals Target population: Adults with pulmonary nodules | Risk assessment for malignancy based on clinical and radiological factors; management of subsolid nodules | Risk of cancer; management of subsolid nodules; lung cancer diagnosis; diagnostic test accuracy of risk calculators | Evidence collection: Systematic electronic database searches were conducted using Ovid MEDLINE, Ovid Embase and the Cochrane Library to identify potentially relevant studies for inclusion in the guideline. The searches were first run in November 2012 and updated in June 2014. Evidence selection: Literature retrieved in the electronic searches was screened for relevance by 2 reviewers Evidence synthesis: The body of evidence for each recommendation was summarized into evidence statements and graded using the Scottish Intercollegiate Guidelines Network grading system. | Appraisal was performed to be compliant with the AGREE collaboration. Two guideline reviewers independently appraised each paper using the Scottish Intercollegiate Guidelines Network critical appraisal checklists. | Recommendations were graded according to the strength of the evidence: A: At least 1 meta-analysis, systematic review or RCT rated as 1++ and directly applicable to the target population; or ≥ 1 systematic reviews or RCTs; or a body of evidence consisting principally of studies rated as 1+ directly applicable to the target population and demonstrating overall consistency of results. B: A body of evidence including studies rated as 2++ directly applicable to the target population and demonstrating overall consistency of results; or extrapolated evidence from studies rated as 1++ or 1+. C: A body of evidence including studies rated as 2+ directly applicable to the target population and demonstrating overall consistency of results; or extrapolated evidence from studies rated as 2++. D: Evidence level 3 or 4; or extrapolated evidence from studies rated as 2+. Checkmark: Important practical points for which there is no research evidence, nor is there likely to be any research evidence. | The draft guideline was made available online for public consultation and feedback was invited from stakeholder organizations. |
AGREE = Appraisal of Guidelines for Research & Evaluation; RCT = randomized controlled trial.
Note that this appendix has not been copy-edited.
Table 5: Strengths and Limitations of Diagnostic Test Accuracy Studies Using the QUADAS-2 Checklist9
Strengths | Limitations |
---|---|
Kim et al. (2021)16 | |
|
|
Sundaram et al. (2021)18 | |
|
|
Kessler et al. (2020)15 | |
|
|
Hammer et al. (2019)13 | |
|
|
Tremblay et al. (2019)19 | |
|
|
White et al. (2019)21 | |
|
|
Marshall et al. (2017)17 | |
|
|
van Riel et al. (2017)20 | |
|
|
Hawkins et al. (2016)14 | |
|
|
QUADAS-2 = Quality Assessment of Diagnostic Accuracy Studies 2; Lung-RADS = Lung Imaging Reporting and Data System; NLST = National Lung Screening Trial; PanCan = Pan-Canadian Early Detection of Lung Cancer nodule risk calculation; RCT = randomized controlled trial.
Table 6: Strengths and Limitations of Economic Evaluations Using the Drummond Checklist10
Strengths | Limitations |
---|---|
Hammer et al. (2021)22 | |
Study design:
Data collection:
Analysis and interpretation:
Miscellaneous:
|
|
Hammer et al. (2021)23 | |
Study design:
Data collection:
Analysis and interpretation:
Miscellaneous:
|
|
QALY = quality-adjusted life-year.
Table 7: Strengths and Limitations of Guideline Using AGREE II11
Item | BTS Guideline, 201524 |
---|---|
Domain 1: Scope and Purpose | |
1. The overall objective(s) of the guideline is (are) specifically described. | Yes |
2. The health question(s) covered by the guideline is (are) specifically described. | Yes |
3. The population (patients, public, etc.) to whom the guideline is meant to apply is specifically described. | Yes |
Domain 2: Stakeholder Involvement | |
4. The guideline development group includes individuals from all relevant professional groups. | Yes |
5. The views and preferences of the target population (patients, public, etc.) have been sought. | Yes |
6. The target users of the guideline are clearly defined. | Yes |
Domain 3: Rigour of Development | |
7. Systematic methods were used to search for evidence. | Yes |
8. The criteria for selecting the evidence are clearly described. | Yes |
9. The strengths and limitations of the body of evidence are clearly described. | Yes |
10. The methods for formulating the recommendations are clearly described. | Yes |
11. The health benefits, side effects, and risks have been considered in formulating the recommendations. | Yes |
12. There is an explicit link between the recommendations and the supporting evidence. | Yes |
13. The guideline has been externally reviewed by experts before its publication. | Yes |
14. A procedure for updating the guideline is provided. | Yes |
Domain 4: Clarity of Presentation | |
15. The recommendations are specific and unambiguous. | Yes |
16. The different options for management of the condition or health issue are clearly presented. | Yes |
17. Key recommendations are easily identifiable. | Yes |
Domain 5: Applicability | |
18. The guideline describes facilitators and barriers to its application. | Unclear |
19. The guideline provides advice and/or tools on how the recommendations can be put into practice. | Yes |
20. The potential resource implications of applying the recommendations have been considered. | Yes |
21. The guideline presents monitoring and/or auditing criteria. | Yes |
Domain 6: Editorial Independence | |
22. The views of the funding body have not influenced the content of the guideline. | Unclear |
23. Competing interests of guideline development group members have been recorded and addressed. | Yes |
AGREE II = Appraisal of Guidelines for Research & Evaluation II.
Note that this appendix has not been copy-edited.
Table 8: Summary of Findings by Outcome ― Diagnostic Test Accuracy, AUC
Strategy | AUC (95% CI) | ||||||
---|---|---|---|---|---|---|---|
Kim et al. (2021)16 | Sundaram et al. (2021)18 | Kessler et al. (2020)15 | Hammer at al. (2019)13 | Tremblay et al. (2019)19 | Marshall et al. (2017)17 | van Riel et al. (2017)20 | |
Lung-RADS | 0.95 (0.91, 0.99) | 0.84 (0.81, 0.86) | 0.87 (0.84, 0.90) | 0.70 (0.60, 0.80) | 0.93 (0.89, 0.98) | 0.84 (IQR 0.69-0.98) | 0.81 (NR) |
PanCan | 0.96 (0.92, 0.99) | 0.85 (0.82,0.87) | 0.88 (0.85, 0.91) | 0.78 (0.67, 0.85) | 0.97 (0.95, 1.0) | 0.90 (IQR 0.75-1.0) | 0.87 (NR) |
P value | 0.34 | 0.17 | NR | 0.09 | ns | 0.25 | 0.003 |
AUC = area under the curve;CI = confidence interval; IQE = interquartile range; Lung-RADS = Lung Imaging Reporting and Data System; NR = not reported; PanCan = Pan-Canadian Early Detection of Lung Cancer .
Table 9: Summary of Findings by Outcome ― Diagnostic Test Accuracy, Diagnostic Paramaters
Strategy | Diagnostic parameters (95% CI) | ||||||
---|---|---|---|---|---|---|---|
Kim et al. (2021)16 | Sundaram et al. (2021)18 | Kessler et al. (2020)15 | Tremblay et al. (2019)19 | White et al. (2019)21 | Marshall et al. (2017)17 | Hawkins et al. (2016)14 | |
Lung-RADS, 3 and 4A/4B positive | Sens: 95.0 (88.2, 100) Spec: 76.7 (75.8, 77.5) PPV: 1.6 (1.1, 2.1) NPV: 100 (99.9, 100) | Sens: 0.81 (0.76, 0.85) Spec: 0.78 (0.77, 0.79) PPV: 0.16 (0.15, 0.17) NPV: 0.99 (0.98, 0.99) | Sens: 84.2 (68.1, 93.4) Spec: 79.2 (75.1, 82.8) PPV: 25.5 (18.4, 34.3) NPV: 98.3 (96.2, 99.3) Accuracy: 79.6 (75.8, 83.1) | Sens: 76.2 (52.8, 91.8) Spec: 92.6 (90.5, 94.3) PPV: 22.2 (16.8, 28.8) NPV: 99.3 (98.5, 99.7) | NR | Sens: 100 (47.8, 100) Spec: 79.6 (74.1, 84.4) PPV: 8.9 (3.0, 19.6) NPV: 100 (98.2, 100) | NR |
Lung-RADS, 4A/4B positive | Sens: 87.5 (77.3, 97.7) Spec: 93.3 (92.8, 93.7) PPV: 5.0 (3.4, 6.6) NPV: 99.9 (99.9, 100) | Sens: 0.48 (0.42, 0.53) Spec: 0.94 (0.93, 0.94) PPV: 0.29 (0.26, 0.32 NPV: 0.97 (0.97, 0.97) | Sens: 58.0 (40.8, 73.7) Spec: 98.0 (96.2, 99.1) PPV: 71.0 (54.8, 83.1) NPV: 96.5 (95.0, 97.6) Accuracy: 94.9 (92.5, 96.6) | NR | Sens: 87 (80, 93) Spec: 83 (82, 84) Accuracy: 76 (75, 78) | NR | Sens: 22.4 Spec: 93.7 Accuracy: 71.4 |
PanCan, 1.5% threshold | NR | Sens: 0.78 (0.73, 0.82) Spec: 0.82 (0.81, 0.83) PPV: 0.19 (0.17, 0.20) NPV: 0.99 (0.98, 0.99) | Sens: 79.0 (62.7, 90.4) Spec: 83.9 (80.2, 87.2) PPV: 29.4 (24.2, 35.3) NPV: 97.9 (96.2, 98.9) Accuracy: 83.5 (79.9, 86.7) | NR | NR | NR | NR |
PanCan, 5% threshold | Sens: 87.5 (77.3, 97.7) Spec: 92.3 (91.8, 92.9) PPV: 4.4 (3.0, 5.8) NPV: 99.9 (99.9, 100) | Sens: 0.51 (0.46, 0.57) Spec: 0.93 (0.92, 0.94) PPV: 0.29 (0.26, 0.43) NPV: 0.97 (0.97, 0.98) | Sens: 73.7 (56.9, 86.6) Spec: 93.5 (90.8, 95.6) PPV: 49.1 (39.3, 59.0) NPV: 97.7 (96.1, 98.6) Accuracy: 92.0 (89.7, 94.6) | Sens: 90.5 (69.6, 98.8) Spec: 93.1 (91.1, 94.8) PPV: 26.8 (21.4, 33.0) NPV: 99.7 (99.0, 99.9) | Sens: 93 (86, 97) Spec: 90 (89, 90) Accuracy: 85 (84, 86) | NR | NR |
PanCan, 10% threshold | Sens: 82.5 (70.7, 94.3) Spec: 95.9 (95.5, 96.2) PPV: 7.4 (5.0, 9.9) NPV: 99.9 (99.9, 100) | NR | Sens: 65.8 (48.7, 80.4) Spec: 95.8 (93.5, 97.5) PPV: 56.8 (44.5, 68.3) NPV: 97.1 (95.5, 98.1) Accuracy: 93.4 (90.8, 97.3) | NR | NR | Sens: 100 (47.8, 100) Spec: 94.8 (91.3, 97.2) PPV: 27.8 (9.7, 53.5) NPV: 100 (98.5, 100) | Sens: 46.5 Spec: 93.7 Accuracy: 78.9 |
CI = confidence interval; IQR = interquartile range; Lung-RADS = Lung Imaging Reporting and Data System; NPV = negative predictive value; PanCan = Pan-Canadian Early Detection of Lung Cancer nodule risk calculation; PPV = positive predictive value; Sens = sensitivity; Spec = specificity.
Table 10: Summary of Findings of Included Economic Evaluation
Main study findings | Authors’ conclusion |
---|---|
Hammer et al. (2021)22 | |
Costs
QALY
ICER (95% CI)
Sensitivity analyses
| “Lung CT Screening Reporting and Data System–based strategies perform better than strategies using the Brock risk calculator, with a 4-mm solid component size threshold yielding the greatest quality-adjusted life years (QALYs) at a willingness-to-pay threshold of $100 000 per QALY.” (p. 593)22 |
Hammer et al. (2021)23 | |
Costs (95% CI)
QALY (95% CI)
ICER (95% CI)
Sensitivity analyses
| “We found that the two management algorithms on the efficient frontier were Lung-RADS and BTS, with BTS yielding the greatest QALYs. The advantage of the BTS algorithm was seen by its ICER compared with Lung-RADS in statistical analysis by confidence interval and all except one sensitivity analysis we performed; in a condition where the growth rate of faster growing nodules was increased, the ACCP algorithm yielded higher QALYs but at a substantial cost (ICER of over $1 million).” (p. 4)23 “In conclusion, the BTS algorithm was the cost-effective option with the best outcomes for managing high-risk (Lung-RADS 4) pulmonary nodules. This finding held true under multiple sensitivity analyses, suggesting that it may be generalizable, at least within the United States health care system.” (p. 5)23 |
ACCP/CHEST = American College of Chest Physicians; BTS = British Thoracic Society; CI = confidence interval; ICER = incremental cost-effectiveness ratio; Lung-RADS = Lung Imaging Reporting and Data System; PanCan = Pan-Canadian Early Detection of Lung Cancer; QALY = quality-adjusted life-year; USD = US dollar; vs. = versus.
Table 11: Summary of Recommendations in Included Guideline
Guideline | Recommendations and supporting evidence | Quality of evidence and strength of recommendations |
---|---|---|
BTS Guideline (2015)24 | Recommendation: “Use the Brock model (full, with spiculation) for initial risk assessment of pulmonary nodules (≥8 mm or ≥300 mm3) at presentation in people aged ≥50 who are smokers or former smokers.” (p. ii17)24 Evidence statement: Prediction models for pulmonary nodules based on clinical and radiological parameters have been externally validated. In the only validation study performed in a UK population, the Herder model (incorporating nodule FDG avidity) performed significantly better than other models (Mayo, Brock, Veterans Administration). In sub-centimetre nodules, the Brock score had the highest accuracy (AUC value). | Strength of recommendation: Grade C Quality of evidence: Evidence level 2+ |
Recommendation: “Consider the Brock model (full, with spiculation) for initial risk assessment of pulmonary nodules (≥8 mm or ≥300 mm3) in all patients at presentation.” (p. ii17)24 Evidence statement: The use of clinical prediction models is more accurate than clinicians’ individual clinical judgment in estimating the probability of malignancy in patients with pulmonary nodules. | Strength of recommendation: Grade D Quality of evidence: Evidence level 3 | |
Recommendation: “Use the Brock risk prediction tool to calculate risk of malignancy in SSNs ≥5 mm that are unchanged at 3 months.” (p. ii24)24 Evidence statement: One prospective study that validated the Brock model included 1,672 SSNs. The guideline authors reported that the Brock model may underestimate risk of malignancy in SSN that persist at 3 months. | Strength of recommendation: Grade C Quality of evidence: Evidence level 2+ |
AUC = area under the curve; BTS = British Thoracic Society; FDG = fluorodeoxyglucose; SSN = subsolid nodule; SSN = subsolid nodule.
Note that this appendix has not been copy-edited.
Alternative Intervention
Gupta S, Jacobson FL, Kong CY, Hammer MM. Performance of Lung Nodule Management Algorithms for Lung-RADS Category 4 Lesions. Acad Radiol. 2021;28(8):1037-1042. PubMed
Gonzalez Maldonado S, Delorme S, Husing A, et al. Evaluation of Prediction Models for Identifying Malignancy in Pulmonary Nodules Detected via Low-Dose Computed Tomography. JAMA Netw. 2020;3(2):e1921221. PubMed
Dziadziuszko K, Szurowska E. Pulmonary nodule radiological diagnostic algorithm in lung cancer screening. Transl Lung Cancer Res. 2021;10(2):1124-1135. PubMed
Rzyman W, Didkowska J, Dziedzic R, et al. Consensus statement on a screening programme for the detection of early lung cancer in Poland. Adv Respir Med. 2018;86(1):53-74. PubMed
Field JK, Marcus MW, Oudkerk M. Risk assessment in relation to the detection of small pulmonary nodules. Transl Lung Cancer Res. 2017;6(1):35-41. PubMed
Veronesi G, Baldwin DR, Henschke CI, et al. Recommendations for Implementing Lung Cancer Screening with Low-Dose Computed Tomography in Europe. Cancers (Basel). 2020;12(6):24. PubMed
Wormanns D, Kauczor HU, Antoch G, et al. Joint Statement of the German Radiological Society and the German Respiratory Society on a Quality-Assured Early Detection Program for Lung Cancer with Low-Dose CT. ROFO Fortschr Geb Rontgenstr Nuklearmed. 2019;191(11):993-997. PubMed
Jonas DE, Reuland DS, Reddy SM, et al. Screening for Lung Cancer with Low-Dose Computed Tomography Updated Evidence Report and Systematic Review for the US Preventive Services Task Force. JAMA. 2021;325(10):971-987. PubMed
US Preventive Services Task Force, Krist AH, Davidson KW, et al. Screening for Lung Cancer: US Preventive Services Task Force Recommendation Statement. JAMA. 2021;325(10):962-970. PubMed
ISSN: 2563-6596
Disclaimer: The information in this document is intended to help Canadian health care decision-makers, health care professionals, health systems leaders, and policy-makers make well-informed decisions and thereby improve the quality of health care services. While patients and others may access this document, the document is made available for informational purposes only and no representations or warranties are made with respect to its fitness for any particular purpose. The information in this document should not be used as a substitute for professional medical advice or as a substitute for the application of clinical judgment in respect of the care of a particular patient or other professional judgment in any decision-making process. The Canadian Agency for Drugs and Technologies in Health (CADTH) does not endorse any information, drugs, therapies, treatments, products, processes, or services.
While care has been taken to ensure that the information prepared by CADTH in this document is accurate, complete, and up to date as at the applicable date the material was first published by CADTH, CADTH does not make any guarantees to that effect. CADTH does not guarantee and is not responsible for the quality, currency, propriety, accuracy, or reasonableness of any statements, information, or conclusions contained in any third-party materials used in preparing this document. The views and opinions of third parties published in this document do not necessarily state or reflect those of CADTH.
CADTH is not responsible for any errors, omissions, injury, loss, or damage arising from or relating to the use (or misuse) of any information, statements, or conclusions contained in or implied by the contents of this document or any of the source materials.
This document may contain links to third-party websites. CADTH does not have control over the content of such sites. Use of third-party sites is governed by the third-party website owners’ own terms and conditions set out for such sites. CADTH does not make any guarantee with respect to any information contained on such third-party sites and CADTH is not responsible for any injury, loss, or damage suffered as a result of using such third-party sites. CADTH has no responsibility for the collection, use, and disclosure of personal information by third-party sites.
Subject to the aforementioned limitations, the views expressed herein are those of CADTH and do not necessarily represent the views of Canada’s federal, provincial, or territorial governments or any third-party supplier of information.
This document is prepared and intended for use in the context of the Canadian health care system. The use of this document outside of Canada is done so at the user’s own risk.
This disclaimer and any questions or matters of any nature arising from or relating to the content or use (or misuse) of this document will be governed by and interpreted in accordance with the laws of the Province of Ontario and the laws of Canada applicable therein, and all proceedings shall be subject to the exclusive jurisdiction of the courts of the Province of Ontario, Canada.
The copyright and other intellectual property rights in this document are owned by CADTH and its licensors. These rights are protected by the Canadian Copyright Act and other national and international laws and agreements. Users are permitted to make copies of this document for non-commercial purposes only, provided it is not modified when reproduced and appropriate credit is given to CADTH and its licensors.
About CADTH: CADTH is an independent, not-for-profit organization responsible for providing Canada’s health care decision-makers with objective evidence to help make informed decisions about the optimal use of drugs, medical devices, diagnostics, and procedures in our health care system.
Funding: CADTH receives funding from Canada’s federal, provincial, and territorial governments, with the exception of Quebec.
Questions or requests for information about this report can be directed to Requests@CADTH.ca