Machine Learning Algorithms for Predicting Lab Values at Massachusetts General Hospital | Lecture notes Machine Learning

Can machine learning algorithms predict lab values?

Charna Albert

February 2020—At Massachusetts General Hospital, machine learning is being used in the laboratories to build

next-level clinical decision support, and in the latest phase, it’s undergoing trial for use in predicting laboratory

results.

“I think this is the new paradigm for cost-eﬀective laboratory medicine. This is an important way we’re going to

change how we do business,” says Anand Dighe, MD, PhD, who spoke about machine learning techniques for labs

during a CAP19 presentation last fall and in a recent interview with CAP TODAY.

Dr. Baron

Dr. Dighe, director of clinical informatics and director of the core laboratory at MGH, has been working with other

scientists and pathologists to make this vision a reality. He and colleague Jason Baron, MD, a pathologist and

clinical informatician within the MGH core laboratory and an assistant professor of pathology at Harvard Medical

School, enlisted the help of two computer scientists at Massachusetts Institute of Technology. Together they

studied ways to use machine learning to predict laboratory values using the results from other lab tests in the

patient’s medical record (Luo Y, et al. Am J Clin Pathol. 2016;145[6]:778–788; Luo Y, et al. J Am Med Inform Assoc.

2018; 25[6]:645–653).

The collaboration with MIT was “particularly fruitful,” Dr. Baron tells CAP TODAY, in integrating MGH clinical

laboratory and clinical data science expertise with computer science from MIT. “Although many mature machine

learning methods developed outside of health care were available for us to use, some were not well suited to

clinical data.” Existing prediction models required ﬁnesse to handle important nuances of clinical data, he says.

“For example, no outpatient has a CBC every day. It’s not like a stock market ticker.” (Finance drove the

development of some machine learning algorithms.)

“We had to ﬁgure out novel algorithms that could provide useful information, even in the face of the missing data

that is so common with laboratory results.” The development of these algorithms was a key contribution of their

MIT collaborators Peter Szolovits, PhD, professor of computer science and engineering and head of the clinical

decision-making group within the MIT computer science and artiﬁcial intelligence laboratory, and Yuan Luo, PhD,

who is now chief AI scientist and associate professor of preventive medicine at Northwestern University Feinberg

School of Medicine.

One target of their work was predicting ferritin results from other laboratory tests. The MIT researchers worked

with Dr. Dighe, Dr. Baron, and colleagues to develop imputation algorithms—methods that allowed them to infer

the missing lab test values needed to train the model. In stage one of the two-step process, they imputed the

results for lab tests that hadn’t been performed (other than ferritin). In stage two, they took the measured and

imputed values for the predictor tests and used those, in addition to basic patient characteristics, to predict ferritin

results.

“When looked at in isolation, ferritin values can lead to misdiagnosis. Ferritin often increases from inﬂammation, so

non-iron-deﬁcient patients undergoing inﬂammatory responses may have elevated ferritin levels. And normal

ferritin values can obscure when a patient is in fact iron-deﬁcient,” Dr. Dighe says. One application of the ferritin

Machine Learning Algorithms for Predicting Lab Values at Massachusetts General Hospital, Lecture notes of Machine Learning