Multiple Imputation: A Solution for Nonresponse

TriCollege Digital Repository

Title: Multiple Imputation: A Solution for Nonresponse
Author: Lo, Ethan
Advisor: Dougherty, John P.
Department: Haverford College. Dept. of Computer Science
Type: Thesis (B.S.)
Issue Date: 2011
Abstract: Missing data is found in virtually every large database, and can be a significant challenge for researchers who want to create analyses on the data. Several approaches have been developed to deal with missing data, but many of them are either ineffective or computationally expensive. However, an approach called Multiple Imputation (MI) is widely used due to its simplicity and effectiveness. MI creates several different sets of full databases (with missing data predicted from existing values of other variables), and then analyzes each one. These results are then combined to create one overall analysis. This seems very straightforward, but is not without problems. There are specific assumptions made when dealing with MI, and ignoring these assumptions can lead to invalid conclusions. Assumptions such as not having missing values in the predictors, not normalizing data, and not looking at missing data problems can all contribute to incorrect conclusions. However, if carefully implemented, MI is a powerful and simple tool that can be applied to almost any missing data problem.
Subject: Database searching -- Methodology
Subject: Multiple imputation (Statistics)
Terms of Use:
Permanent URL:

Files in this item

Files Description Size Format
2011LoE_thesis.pdf Thesis (Haverford Users Only) 109.2Kb PDF
2011LoE_release.pdf **Archive Staff Only** 79.78Kb PDF


Lo, Ethan. "Multiple Imputation: A Solution for Nonresponse". 2011. Available electronically from

This item appears in the following Collection(s) Except where otherwise noted, this item's license is described as


Advanced Search


My Account