- Open Access
Quality and transparency of reporting derivation and validation prognostic studies of recurrent stroke in patients with TIA and minor stroke: a systematic review
Diagnostic and Prognostic Research volume 6, Article number: 9 (2022)
Clinical prediction models/scores help clinicians make optimal evidence-based decisions when caring for their patients. To critically appraise such prediction models for use in a clinical setting, essential information on the derivation and validation of the models needs to be transparently reported. In this systematic review, we assessed the quality of reporting of derivation and validation studies of prediction models for the prognosis of recurrent stroke in patients with transient ischemic attack or minor stroke.
MEDLINE and EMBASE databases were searched up to February 04, 2020. Studies reporting development or validation of multivariable prognostic models predicting recurrent stroke within 90 days in patients with TIA or minor stroke were included. Included studies were appraised for reporting quality and conduct using a select list of items from the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) Statement.
After screening 7026 articles, 60 eligible articles were retained, consisting of 100 derivation and validation studies of 27 unique prediction models. Four models were newly derived while 23 were developed by validating and updating existing models. Of the 60 articles, 15 (25%) reported an informative title. Among the 100 derivation and validation studies, few reported whether assessment of the outcome (24%) and predictors (12%) was blinded. Similarly, sample size justifications (49%), description of methods for handling missing data (16.1%), and model calibration (5%) were seldom reported. Among the 96 validation studies, 17 (17.7%) clearly reported on similarity (in terms of setting, eligibility criteria, predictors, and outcomes) between the validation and the derivation datasets. Items with the highest prevalence of adherence were the source of data (99%), eligibility criteria (93%), measures of discrimination (81%) and study setting (65%).
The majority of derivation and validation studies for the prognosis of recurrent stroke in TIA and minor stroke patients suffer from poor reporting quality. We recommend that all prediction model derivation and validation studies follow the TRIPOD statement to improve transparency and promote uptake of more reliable prediction models in practice.
The protocol for this review was registered with PROSPERO (Registration number CRD42020201130).
Clinical prediction models (also called clinical prediction rules, clinical prediction scores, clinical decision rules, or prognostic models) aid clinicians in making diagnostic and therapeutic decisions at the bedside and reduce inefficient provision of resources when presented and applied appropriately [1,2,3]. Such tools are commonly used by clinicians, and especially in the emergency department, to identify patients at high risk of stroke as it is not practical to treat everyone due to limited resources. Transient ischemic attack (TIA, i.e. a cerebral ischemia without lasting symptoms) and minor stroke carry a serious risk of subsequent stroke or death shortly after diagnosis, and thus represent an opportunity for stroke prevention .
To maximize the accuracy and clinical utility of clinical prediction models, they need to go through at least two consecutive phases: derivation including internal validation and external validation (evaluation to check accuracy in an independent population and setting) [2, 5,6,7,8,9]. Numerous methodological standards and guides have been developed for clinical prediction modelling [3, 6, 10,11,12,13,14]. Unfortunately, several systematic reviews have indicated shortcomings in methodological quality of many existing prediction studies [14, 15]. In addition to methodological standards for the development and validation of clinical prediction models, appraisal guidelines such as the CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies (CHARMS)  and the Prediction model Risk Of Bias ASsessment Tool (PROBAST)  have been developed for data extraction and critical appraisal and assessment of risk of bias in modelling studies. To use such appraisal guides to their full extent, essential items need to be reported in the paper deriving or validating a prediction model. The strengths and weaknesses of a prediction model study can only be revealed with full and transparent reporting to enable its interpretation and usefulness and enhance the uptake and implementation of validated models for use in clinical settings . Complete and transparent reporting also support future prediction model studies by allowing researchers to validate and compare existing prediction models . For this reason, reporting guidelines such as the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) have been developed for studies developing, validating, or updating a prediction model .
Several clinical prediction models exist for the prognosis of stroke in patients with TIA and minor stroke. To critically appraise their methodological quality and make recommendations about future updates and/or adoption in clinical practice, better reporting quality of prediction models is essential. Although several systematic reviews have been conducted of the quality of reporting of prediction model studies in various other clinical domains [15, 20,21,22,23,24,25,26,27,28], to our knowledge, there have been no reviews of the reporting quality of prognostic models for stroke. Thus, we aimed to identify existing clinical prediction models and assess their reporting quality using the recommendations in the TRIPOD statement.
The overall goal of this study is to critically appraise existing derivation and validation studies of prediction models, in terms of reporting quality, for the prognosis of recurrent stroke within 90 days in patients with TIA or minor stroke. The specific objectives are:
To identify and characterize derivation and validation studies of existing multivariable clinical prediction models in the literature for the prognosis of recurrent stroke within 90 days in patients with TIA or minor stroke;
To characterize the quality of reporting of a select list of essential items for both the derivations and validations.
This review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement (See Fig. 1 and Additional file 1) . To help us guide the framing of the review aim, search strategy, and study inclusion and exclusion criteria, we used key items from the TRIPOD and the CHARMS Checklist, presented in additional files (See Supplementary Table 1, Additional file 2).
We conducted comprehensive electronic searches to identify all published studies of clinical prediction models for the prognosis of recurrent stroke within 90 days in patients with TIA or minor stroke.
We searched the Ovid interface, as well as Medline and Embase databases using a strategy that included the National Library of Medicine’s Medical Subject Headings (MeSH) and non-MeSH keywords up to February 04, 2020. The search strategy used TIA and stroke search syntax and used established search filters for prediction models [30,31,32,33]. Due to the low quality of reporting of prediction modelling studies, we further modified the search filter by including additional and more specific search terms to be more comprehensive in identifying relevant studies. We developed the search strategy with the help of an experienced medical librarian. It was later validated by a second medical librarian in accordance with the Peer Review of Electronic Search Strategies (PRESS) guidelines . A copy of the search strategy can be found in additional files (See Additional file 3). We also used Google Scholar to search and find any additional articles by searching for relevant keywords in the Google Scholar search engine. In addition, we searched the citations from the included studies for additional eligible studies of clinical prediction models.
Inclusion and exclusion criteria
This systematic review focused on studies of any design that developed or validated multivariable prediction models or scoring rules for the prognosis of recurrent stroke within 90 days for patients diagnosed with TIA or minor stroke.
We considered a clinical prediction model to be any tool that combined at least two predictors to estimate a probability or score for the outcome of stroke within 90 days. We excluded studies that investigated a single predictor, test, or marker. We also excluded studies that investigated only causality between one or more variables and an outcome, and predictor finding studies, i.e. studies which aim to explore which predictors, out of a number of candidate predictors, are independently associated with a diagnostic or prognostic outcome rather than deriving or validating a prediction model . We excluded five studies in languages other than English. We did not apply publication date restrictions.
After retrieving the potentially relevant articles from the search, we imported the records into Covidence (https://www.covidence.org). We removed duplicates and two reviewers (KEA and KY) independently screened the title, abstract, and keywords. Prior to embarking on screening, the two reviewers screened a sample of records as part of a training and calibration exercise, and the screening criteria were clarified where necessary.
We retrieved the full text of the articles that were considered potentially relevant by at least one of the reviewers in the title or abstract screen. One of the reviewers then reviewed the full text of each of the articles to determine eligibility. We excluded articles for which we could not obtain full text.
One reviewer (KEA) extracted data from each included study using an extraction form that was specifically designed and pilot tested for the review. We extracted data separately for each derivation or validation cohort (hereafter called cohort) in a publication.
The data extraction form was based on selected items from the TRIPOD and CHARMS checklists for reporting and critical appraisal of prognostic model studies. We focused on items that are essential for the appraisal of derivation and validation studies. Although we initially set out to assess both reporting quality and quality of methodological conduct, we ultimately decided to focus on quality of reporting as our appraisal of methodological conduct was hampered by a lack of clarity in the reporting. For feasibility reasons and to keep the project manageable, we did not assess adherence to all the TRIPOD items. We extracted information pertaining to items 1, 3 through 9, 13, 14, 16, and 22 of the TRIPOD statement, covering the title, background and rationale, methods, and results along with information about funding that was thought to be particularly relevant to the field; we did not extract information about items 2, 10, 11, 15, 17 through 21 covering the abstract, specifics of statistical analysis methods, model specification, discussion, and other information (supplemental information) sections. Some TRIPOD items are applicable to both derivation and validation studies, while others are applicable to validation studies only. We distinguished between validation studies without updating and validation studies with updating (i.e. studies developing an updated model based on an existing model). For validation studies with updating, we also extracted whether the update was performed in accordance with suggested methodology by Su and colleagues . In particular, we extracted whether the order of performing the update was as follows: (a) recalibrating the intercept only, (b) recalibrating the intercept and adjusting the other regression coefficients by a common factor, (c) category b plus extra adjustment of a subset of the existing coefficients to a different strength, (d) category c plus adding new predictors, (e) re-estimating all of the original regression coefficients, and (f) category e plus adding new additional predictors . This order of performing the update was recommended so that authors consider less extensive update methods prior to considering more extensive revisions. For example, it might suffice to recalibrate the intercept only to improve the performance of the model in the setting it is being validated in.
A list of extracted items can be found in additional files (See Supplementary Table 2, Additional file 2).
As this was a study on quality of reporting, we did not perform risk of bias assessment on individual studies. We have registered the protocol for this study with PROSPERO (Registration number CRD42020201130) prior to extracting the data.
We summarized and presented the results of this systematic review using descriptive statistics (numbers and percentages). We classified each item as adherent, not adherent, or unclear or not reported. We combined a few studies classified as non-adherent with unclear or not reported to create a combined category of “Non-adherence, unclear, or not reported”. In other words, we consider not reporting or unclear reporting as a form of non-adherence.
Our database search identified 9036 articles. After removal of duplicates, 7026 titles and abstracts were screened for eligibility. After title and abstract screening, 6696 records were excluded leaving 330 full-text articles for eligibility assessment. Two additional articles were later identified by hand searching the reference lists. A total of 60 articles ultimately met our inclusion criteria (Fig. 1). Reasons for exclusion are outlined in Fig. 1.
A total of 100 derivation and validation studies were performed on 27 prediction models within 60 published articles. Among the 100 derivation and validation studies, 27 were classified as prediction model development (either anew or by model updating) while the remaining 73 were classified as validation. Of the 27 prediction models, four were newly developed (i.e. not based on existing models) while 23 were developed based on existing models (i.e. validation with model updating). All 60 articles were published between 2000 and 2020. The studies were conducted in 18 different countries, mostly the UK (8 or 13.3%) and China (8 or 13.3%) followed by the USA (7 or 11.7%). Countries with three or fewer studies were grouped together under the other category and included the following countries: Austria, Bulgaria, Canada, Germany, Greece, Italy, Japan, Norway, Singapore, Spain, Sweden, Switzerland, and Turkey. The country of the study was not reported for 4 (6.7%) of the studies.
A list of the included studies can be found in the additional files (See Additional file 4). Distribution of included studies by year of publication is presented in Fig. 2.
Reporting of items applicable at the article level
Results of reporting of items applicable at the article level are presented in Table 1. According to the TRIPOD statement, an informative study title entails identifying the study as derivation or validation of a multivariable prediction model, the target population, and the outcome to be predicted (TRIPOD item 1). Fifteen out of 60 studies (25.0%) adhered to the title recommendations. The source and role of funders (TRIPOD item 22) was provided by 14 (23.3%) of the 60 published articles.
Reporting of essential items common to both derivation and validation studies
Results of reporting of essential items common to derivation and external validation studies are presented in Table 2.
Introduction (TRIPOD item 3)
Background information includes the medical context and rationale for deriving or validating the model. Background information was provided in 66 (66%) derivation and validation studies.
Methods (TRIPOD items 4–12)
Study design or source of data was reported in 99 (99%) of the 100 derivation and validation studies while key study dates (i.e. start and end dates of cohort data collection) were reported in 95 (95%) and study setting (i.e. tertiary, community, or both) was reported in 65 (65%). Fifty-six (56%) recruited participants from multiple sites with 41 (41%) recruiting from one site. Fifty-one (51%) provided information on their recruitment methods of which 50 (50%) recruited consecutive participants. Eligibility criteria for participants was clearly reported in 93 (93%) of the derivation and validation studies.
A clear outcome definition (clinical vs tissue-based) was reported in 80 (80%) of the derivation and validation studies. Forty-four (44%) conveyed that the same outcome definition and method of measurement were used in all patients. The majority, 73 (73%), used a 90-day outcome of stroke followed by a 7-day outcome in 66 (66%) followed by other combinations of outcome time periods. Assessment of the outcome without knowledge of the candidate predictors was reported in 24 (24%). In terms of predictor definitions and measurement, a total of 63 (63%) of the derivation and validation studies reported information on how to measure all measurement predictors while 47 (47%) reported information on when to measure all measurement predictors. Assessment of predictors without knowledge of the outcome or other predictors was reported in 12 (12%).
Sample size justification was reported in 49 (49%) of the derivation and validation studies. Information on missing data and methods of handling the missing data were reported in 15 (16.1%) out of 93 applicable derivations and validations with possible missing data. One (1.1%) study reported that a multiple imputation method was used. Information on risk group creation was provided in 25 (25%).
Results (TRIPOD items 13–17)
The flow of participants was reported in 21 (21%) of the derivations and validation studies. Thirty-four (34%) reported information about the prevalence of participants with any missing values, 28 (28%) reported the number of participants with missing data for each predictor, while 7 (7%) reported that there were no missing data.
Measures of discrimination were reported in 81 (81%) of the derivation and validation studies with c-statistic being the most commonly used measure (79%); measures of calibration were reported in 5 (5%), all of which used the Hosmer-Lemeshow test. Sensitivity and specificity were both reported in 28 (28%) while net reclassification improvement and predictive values were reported in 13 (13%) and 11 (11%) respectively.
Reporting of essential items relevant to all validations
Results of reporting of essential items relevant to validation studies only are presented in Table 3.
Methods (TRIPOD items 4–12 relevant to validations only)
Among the 96 validation studies, 17 (17.7%) mentioned either similarities or differences in definitions of all four of setting, eligibility criteria, predictors, and outcomes between the validation and derivation of the model. Similarly, the distribution of important variables of at least age and sex was presented along with the development study counterparts for 21 (21.9%) of the validations.
As for model evaluation, the type of external validation (e.g. temporal, geographical, or methodological) was provided for 94 (97.9%) of the validation studies. The majority or 64 (66.7%) of the external validations were geographical. Of the 96 external validations, 23 were external model validations with model updating.
Reporting of essential items applicable to validations with updating only
Table 4 presents the results of reporting of essential items that are applicable to validation studies with updating only. A rationale for updating the model was provided by all 23 studies. Two (8.7%) studies reported that they have attempted to update the model with less extensive revisions prior to considering more extensive revisions. One study reported applying a method of shrinkage of predictor weights or regression coefficients. When it comes to reporting the results of the updated models, 13 (56.5%) of the studies reported such results (e.g. model specification, model performance, recalibration).
A summary of the quality of reporting can be found in Fig. 3.
Summary of main findings
We assessed the quality of reporting of derivation and validation studies of prediction models for the prognosis of recurrent stroke in patients with TIA and minor stroke. We found inadequate reporting against selected items in TRIPOD. Items that were especially poorly reported included an informative title, blind assessment of outcome and predictors, sample size justification, use of shrinkage methods, reporting and handling of missing data, reporting of all performance measurements, and comparability between the validation and derivation dataset. Source of data, eligibility criteria, and study setting had better quality of reporting.
Incomplete reporting of blinding is detrimental to assessment of risks of bias and appraisals of the quality of prognostic models. Inadequate sample sizes, a common problem with prediction models, could lead to overfitting and the performance of a prognostic model being overestimated [36, 37]. Sample size justifications for derivation and validation studies are often based on the concept of events per variable (EPV) . However, there is disagreement as to what the EPV should be [10, 38]. More recently, sample size calculation methods have been developed based on the total number of participants, the number of events relative to the number of potential candidate predictor parameters, the outcome proportion (incidence) in the study population, and the expected predictive performance of the model . Missing data can present several problems including reduction in statistical power, bias in the estimation of parameters due to data loss, reduction in representativeness of the samples, and complications in the analysis of the study leading to invalid conclusions such as distorted performance of the prediction model . Failure to adequately report on measures of discrimination and calibration prevents users from making informed decisions about the likely accuracy of the model when used in practice. When validating a model externally, similarities or differences in setting, eligibility criteria, predictors, and outcomes between the validation and derivation dataset need to be reported to understand the extent of reproducibility and generalizability of the model.
As in other systematic reviews, we found that some items are less well reported than others. Although it would be ideal to have all items completely reported, they are not equally important for the appraisal of a prediction model’s performance. However, they are all still important for the likelihood of future validation and uptake in clinical practice. For example, if information on handling of missing data is not reported, we cannot be certain of the true performance of the prediction model; on the other hand, if the title is not adequately reported as recommended in the TRIPOD statement, it does not mean that the quality of the prediction model is compromised, although retrieval of studies for possible updating would be compromised.
Comparison with other reviews
Our findings are comparable to those in several other systematic reviews of prediction models, published prior to and after the publication of the TRIPOD statement. A systematic review by Heus et al. assessed reporting quality of prediction model studies within 37 clinical domains . Their review excluded studies published prior to the publication of the TRIPOD statement in 2015. Reporting of background information, missing data, and calibration were better than in our review which may be explained by the fact that their review focused on high-impact journals only. Jiang et al.  studied the quality of reporting of derivation and validation of melanoma prediction model studies with no restriction on the date of publication. Their findings aligned with our findings although they found better reporting of blinding of predictors and comparability of validation and derivation datasets. A recent systematic review by Najafabadi et al. examined pre- and post-TRIPOD publications in seven high-impact medical journals and found that although there have been some improvements in the methodological conduct, such as better reporting of missing data, use of multiple imputations, reporting the full prediction model and reporting information on performance measures, the overall quality of reporting has not improved .
Strengths and limitations
To our knowledge, this is the first systematic review specifically evaluating the reporting quality of derivation and validation studies of prediction models for the prognosis of stroke in patients with TIA. Our results add to the growing number of studies finding poor reporting quality of prediction models.
Our study has some limitations. Although we did not extract information on all TRIPOD items, we extracted and reported most items with an emphasis on reporting rather than methodological conduct. An item that we missed extracting was the reporting of a full model equation. Although we have attempted to extract information on each item in accordance with the TRIPOD statement, we may have been overly conservative in our assessment of some items: for example, we considered blinding of outcome and predictors as reported if and only if they were explicitly mentioned by the authors.
Although we have searched through two of the major medical databases (Medline and Embase) with the help of two experienced medical librarians and with a sensitive search strategy, we may have missed some eligible articles due to inadequate reporting of information by some authors. In addition, we excluded five studies published in non-English languages due to language barriers. Furthermore, our search covered publications up to February 04, 2020, after which there may be additional publications. However, given that this is a review of the quality of reporting and the comparability of our findings to existing systematic reviews, it is unlikely that any possible missed articles would change the conclusion of this systematic review. A final limitation is that the full-text screening and data extractions were conducted by a single reviewer. However, given the objective nature of many items, it is unlikely that there would have been substantial misclassification.
Current reporting of multivariable prediction models for the prognosis of stroke in patients with TIA do not meet TRIPOD requirements for reporting. Essential items in need of improvement are providing an informative title, providing a justification for the sample size, providing information on missing data and handling of missing data, blinding of outcome and predictors, applying and reporting a shrinkage method, and clear reporting around comparability between the validation and derivation cohorts when validating. An example prediction model for the prognosis of stroke with a high number of items reported is the validation of the Canadian TIA score study . In addition to adhering to the TRIPOD statement, more comprehensive guidance with breakdown of items by study types and examples with templates would be helpful. In other words, to provide details of what is expected of authors for each item with examples and to provide templates for the authors working on prediction model development studies without external validation, prediction model development studies with external validation, and external model validation studies with or without model updating. In addition, transparent and complete reporting can be facilitated by journals and peer reviewers requiring that authors follow the TRIPOD guidelines when submitting a derivation or validation study. Finally, we found a large number of studies validating and updating existing models, in accordance with recommendations that a prediction model be updated rather than a new one created . However, additional guidance is required with respect to validating and updating a prediction model.
Availability of data and materials
CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies
Events per variable
Medical Subject Headings
Peer Review of Electronic Search Strategies
Preferred Reporting Items for Systematic Review and Meta-Analysis
Prediction model Risk Of Bias ASsessment Tool
The International Prospective Register of Systematic Reviews
Transient ischemic attack
Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis
Reilly BM, Evans AT. Translating clinical research into clinical practice: impact of using prediction rules to make decisions. Ann Intern Med. 2006;144(3):201–9. https://doi.org/10.7326/0003-4819-144-3-200602070-00009.
Stiell IG, Bennett C. Implementation of clinical decision rules in the emergency department. Acad Emerg Med. 2007;14(11):955–9. https://doi.org/10.1197/j.aem.2007.06.039.
Steyerberg EW, Moons KGM, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Med. 2013;10(2):e1001381. https://doi.org/10.1371/journal.pmed.1001381.
Giles MF, Rothwell PM. Risk of stroke early after transient ischaemic attack: a systematic review and meta-analysis. Lancet Neurol. 2007;6(12):1063–72. https://doi.org/10.1016/S1474-4422(07)70274-0.
Laupacis A, Sekar N, Stiell IG. Clinical prediction rules. A review and suggested modifications of methodological standards. JAMA. 1997;277(6):488–94. https://doi.org/10.1001/jama.1997.03540300056034.
Stiell IG, Wells GA. Methodologic standards for the development of clinical decision rules in emergency medicine. Ann Emerg Med. 1999;33(4):437–47. https://doi.org/10.1016/S0196-0644(99)70309-4.
McGinn TG, Guyatt GH, Wyer PC, Naylor CD, Stiell IG, Richardson WS. Users’ guides to the medical literature: XXII: how to use articles about clinical decision rules. Evidence-Based Medicine Working Group. JAMA. 2000;284(1):79–84. https://doi.org/10.1001/jama.284.1.79.
Lee TH. Evaluating decision aids: the next painful step. J Gen Intern Med. 1990;5(6):528–9.
Perry JJ, Stiell IG. Impact of clinical decision rules on clinical care of traumatic injuries to the foot and ankle, knee, cervical spine, and head. Injury. 2006;37(12):1157–65. https://doi.org/10.1016/j.injury.2006.07.028.
Cowley LE, Farewell DM, Maguire S, Kemp AM. Methodological standards for the development and evaluation of clinical prediction rules: a review of the literature. Diagnostic Progn Res. 2019;3(1):16. https://doi.org/10.1186/s41512-019-0060-y.
Moons KGM, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009;338(feb23 1):b375. https://doi.org/10.1136/bmj.b375.
Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J. 2014;35(29):1925–31. https://doi.org/10.1093/eurheartj/ehu207.
Steyerberg EW. Clinical prediction models : a practical approach to development, validation, and updating. New York: Springer; 2009. p. 497. https://doi.org/10.1007/978-0-387-77244-8.
Bouwmeester W, Zuithoff NPA, Mallett S, Geerlings MI, Vergouwe Y, Steyerberg EW, et al. Reporting and methods in clinical prediction research: a systematic review. Macleod MR, editor. PLoS Med. 2012;9(5):e1001221.
Collins GS, de Groot JA, Dutton S, Omar O, Shanyinde M, Tajar A, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014;14(1):40. https://doi.org/10.1186/1471-2288-14-40.
Moons KGM, de Groot JAH, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLoS Med. 2014;11(10):e1001744. https://doi.org/10.1371/journal.pmed.1001744.
Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170(1):51–8. https://doi.org/10.7326/M18-1376.
Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–W73. https://doi.org/10.7326/M14-0698.
Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med. 2015;13(1):1. https://doi.org/10.1186/s12916-014-0241-z.
Heus P, Damen JAAG, Pajouheshnia R, Scholten RJPM, Reitsma JB, Collins GS, et al. Poor reporting of multivariable prediction model studies: towards a targeted implementation strategy of the TRIPOD statement. BMC Med. 2018;16(1):1–12.
Jiang M, Dragnev N, Wong S. Evaluating the quality of reporting of melanoma prediction models. Surgery. 2020;168(1):173–7. https://doi.org/10.1016/j.surg.2020.04.016.
Dhiman P, Ma J, Navarro C, Speich B, Bullock G, Damen J, et al. Reporting of prognostic clinical prediction models based on machine learning methods in oncology needs to be improved. J Clin Epidemiol. 2021;138:60–72. https://doi.org/10.1016/j.jclinepi.2021.06.024.
Andaur Navarro CL, Damen JAA, Takada T, Nijman SWJ, Dhiman P, Ma J, Collins GS, Bajpai R, Riley RD, Moons KGM, Hooft L. Completeness of reporting of clinical prediction models developed using supervised machine learning: a systematic review. BMC Med Res Methodol. 2022;22(1):12. https://doi.org/10.1186/s12874-021-01469-6.
Nagendran M, Chen Y, Lovejoy CA, Gordon AC, Komorowski M, Harvey H, et al. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ. 2020:368. https://doi.org/10.1136/bmj.m689.
Najafabadi AHZ, Ramspek CL, Dekker FW, Heus P, Hooft L, Moons KGM, et al. TRIPOD statement: a preliminary pre-post analysis of reporting and methods of prediction models. BMJ Open. 2020;10(9):e041537. https://doi.org/10.1136/bmjopen-2020-041537.
Bouwmeester W, Zuithoff NPA, Mallett S, Geerlings MI, Vergouwe Y, Steyerberg EW, et al. Reporting and methods in clinical prediction research: a systematic review. PLoS Med. 2012;9(5):e1001221. https://doi.org/10.1371/journal.pmed.1001221.
Takemura T, Kataoka Y, Uneno Y, Otoshi T, Matsumoto H, Tsutsumi Y, et al. The reporting quality of prediction models in oncology journals: a systematic review. Ann Oncol. 2018;29:ix171.
Yusuf M, Atal I, Li J, Smith P, Ravaud P, Fergie M, et al. Reporting quality of studies using machine learning models for medical diagnosis: a systematic review. BMJ Open. 2020;10(3):e034568. https://doi.org/10.1136/bmjopen-2019-034568.
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. PLoS Med. 2021;18(3):e1003583. https://doi.org/10.1371/journal.pmed.1003583.
Wilczynski NL, Haynes RB, Hedges Team. Developing optimal search strategies for detecting clinically sound prognostic studies in MEDLINE: an analytic survey. BMC Med. 2004;2(1):23. https://doi.org/10.1186/1741-7015-2-23.
Ingui BJ, Rogers MA. Searching for clinical prediction rules in MEDLINE. J Am Med Inform Assoc. 2001;8(4):391–7. https://doi.org/10.1136/jamia.2001.0080391.
Wong SS, Wilczynski NL, Haynes RB, Ramkissoonsingh R; Hedges Team. Developing optimal search strategies for detecting sound clinical prediction studies in MEDLINE. AMIA Annu Symp Proc. 2003;2003:728–32.
Geersing G-J, Bouwmeester W, Zuithoff P, Spijker R, Leeflang M, Moons K, et al. Search filters for finding prognostic and diagnostic prediction studies in medline to enhance systematic reviews. Smalheiser NR, editor. PLoS One. 2012;7(2):e32844.
McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS peer review of electronic search strategies: 2015 Guideline Statement. J Clin Epidemiol. 2016;75:40–6. https://doi.org/10.1016/j.jclinepi.2016.01.021.
Su TL, Jaki T, Hickey GL, Buchan I, Sperrin M. A review of statistical updating methods for clinical prediction models. Stat Methods Med Res. 2018;27(1):185–97. https://doi.org/10.1177/0962280215626466.
Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368. https://doi.org/10.1136/bmj.m441.
Steyerberg EW, Borsboom GJJM, van Houwelingen HC, Eijkemans MJC, Habbema JDF. Validation and updating of predictive logistic regression models: a study on sample size and shrinkage. Stat Med. 2004;23(16):2567–86. https://doi.org/10.1002/sim.1844.
Courvoisier DS, Combescure C, Agoritsas T, Gayet-Ageron A, Perneger TV. Performance of logistic regression modeling: beyond the number of events per variable, the role of data structure. J Clin Epidemiol. 2011;64(9):993–1000. https://doi.org/10.1016/j.jclinepi.2010.11.012.
Kang H. The prevention and handling of the missing data. Korean J Anesthesiol Korean Soc Anesthesiol. 2013;64(5):402–6. https://doi.org/10.4097/kjae.2013.64.5.402.
Perry JJ, Sivilotti MLA, Émond M, Stiell IG, Stotts G, Lee J, et al. Prospective validation of Canadian TIA Score and comparison with ABCD2 and ABCD2i for subsequent stroke risk after transient ischaemic attack: multicentre prospective cohort study. BMJ. 2021;372. https://doi.org/10.1136/bmj.n49.
We are grateful to Lindsey Sikora and Amanda Hodgson, Health Sciences Research Liaison Librarians at the University of Ottawa, for their help in guiding us with the development of the search strategy.
The authors have no funding sources to report for this study. Dr. Perry is supported with a peer-reviewed unrestricted mid-career salary support grant from the Heart and Stroke Foundation of Ontario.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1.
Additional file 2 : Table S1.
Key items used to guide the framing of the review aim, search strategy, and study inclusion and exclusion criteria - and - Table S2. List of extracted items. This file contains adapted TRIPOD data elements used to design and extract the data.
Additional file 3.
Literature Search Strategy.
Additional file 4.
List of included studies.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Abdulaziz, K.E., Perry, J.J., Yadav, K. et al. Quality and transparency of reporting derivation and validation prognostic studies of recurrent stroke in patients with TIA and minor stroke: a systematic review. Diagn Progn Res 6, 9 (2022). https://doi.org/10.1186/s41512-022-00123-z
- Prediction models
- Prediction rules
- Clinical decision rules
- Risk scores
- Recurrent stroke
- Transient ischemic attack
- Cerebral ischemia
- Reporting quality