Assessing the Reliability of Hospital Performance Measures: A Test-Retest Approach | Exams Urology

Split-Half Reliability Method Examples

Example 1

We tested the reliability of the facility measure score by calculating the intra-class correlation coefficient

(ICC) of the measure score. To calculate the ICC, we used the Medicare FFS FY 2012-2015 Dataset. For

ASCs with two or more urology procedures, these procedures were then randomly split into the two

samples (2 years of combined data for each sample). The ICC evaluates the agreement between the risk-

standardized hospital visit rates (RSHVRs) calculated in the two randomly selected samples.

The ICC [2,1] score of 0.45, calculated for two years of data, indicates moderate measure score

reliability.

Example 2

We tested the reliability of the facility measure score by calculating the intra-class correlation coefficient

(ICC) of the measure score. To calculate the ICC, we used the Medicare FFS CYs 2012-2015 Dataset. For

ASCs with two or more general surgery procedures, these procedures were randomly split into the two

samples within each facility. The ASCs with one procedure were randomly split into the two samples.

The ICC evaluated the agreement between the risk-standardized hospital visit ratios (RSHVRs) calculated

in the two randomly selected samples [1].

The ICC [2,1] score of 0.530, calculated for four years of data, indicates moderate measure score

reliability.

Example 3

We defined reliability as described by Lord and Novick using split-sample methodology. (Lord FM,

Novick MR. Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley; 1968)

Using split-sample methodology, FTR had a split half sample correlation estimate of 0.32, with the

upper bound on validity (provided by the square root of the Spearman-Brown reliability correction)

being 0.56.

Example 4

The reliability of a measurement is the degree to which repeated measurements of the same entity

agree with each other. For measures of hospital performance, the measured entity is naturally the

hospital, and reliability is the extent to which repeated measurements of the same hospital give similar

results. In line with this thinking, our approach to assessing reliability is to consider the extent to which

assessments of a hospital using different but randomly selected subsets of patients produces similar

measures of hospital performance. That is, we take a "test-retest" approach in which hospital

performance is measured once using a random subset of patients, then measured again using a second

Assessing the Reliability of Hospital Performance Measures: A Test-Retest Approach, Exams of Urology