Inter-rater and Intra-rater Reliability in the Radiographic Diagnosis of Growth Arrest in Paediatric Physeal Fractures.
Banting N, Schaeffer EK, Bone J, Habib E, Hooper N, Reilly CW, Cooper A, Mulpuri K. Inter-rater and Intra-rater Reliability in the Radiographic Diagnosis of Growth Arrest in Paediatric Physeal Fractures. Indian J Orthop. May 2021. DOI: 10.1007/s43465-020-00327-9
Abstract
Background
Fractures through the physis account for 18–30% of paediatric fractures and can lead to growth arrest in 5–10% of these cases. Long-term radiographic follow-up is usually necessary to monitor for signs of growth arrest at the affected physis. Given plain radiographs of a physeal fracture obtained throughout patient follow-up, different surgeons may hold different opinions about whether or not early growth arrest has occurred despite using identical radiographs to guide decision-making. This study aims to assess the inter-rater and intra-rater reliability of early growth arrest diagnosis among orthopaedic surgeons given a set of identical plain radiographs.
Methods
A retrospective chart review was conducted on patients aged 2–18 years previously treated for a physeal fracture at a paediatric tertiary care hospital between 2011 and 2018. De-identified anteroposterior (AP) and lateral radiographs of 39 patients from the date of injury and minimum one-year post-injury were administered in a survey to international paediatric orthopaedic surgeons. Each surgeon was asked whether they would diagnose the patient with growth arrest based on the radiographs provided. Surgeons were asked to complete this process again two weeks after the initial review, but using identical shuffled radiographs. Inter-rater and intra-rater reliability was calculated using appropriate kappa statistics.
Results
A total of 11 paediatric orthopaedic surgeons completed the first round of the survey, and 9 of these 11 completed the second round. The inter-rater reliability for the first round was 0.22 [95% CI (0.06, 0.35)] and 0.21 [95% CI (0.02, 0.32)] for the second round. The average kappa for intra-rater reliability was − 0.05 [95% CI (− 0.31, 0.21)]. Comparison by injury side showed no significant variation in diagnosis {p = 0.509, OR = 0.90, [95% CI (0.67, 1.22)]}, while comparison by location of injury varied significantly (p = 0.003).
Conclusions
Radiographic diagnosis of growth arrest among paediatric orthopaedic surgeons demonstrated ‘fair’ inter-rater agreement and no intra-rater agreement, suggesting critical differences in identifying growth arrest on plain radiographs. Further research is necessary to develop an improved diagnostic approach for growth arrest among orthopaedic surgeons.
Level of Evidence
Diagnostic level III.