Document Type : Original Article

Author

University of Guilan

Abstract

The issue of missing data is a pervasive challenge in research, posing a significant obstacle to the reliability and validity of study findings. To address this issue, researchers have developed numerous approaches for replacing missing values. In this study, we focus on one such method for imputing missing data. Specifically, our paper introduces a novel technique for addressing missing data by implementing a partitioning strategy for the data that contains these missing values. Subsequently, we utilize the Expectation-Maximization (EM) method to compensate for the missing values within each resulting partition. Our findings demonstrate the efficacy of segmenting data that includes missing values, revealing that employing a higher degree of segmentation leads to improved estimation accuracy. To evaluate the performance of our approach, we compared the results using two key indices, namely Mean Squared Error (MSE) and Standard Deviation (S.D), across complete data, missing data, and partitioned data scenarios. Notably, our analysis focused on situations where data loss completely at random within real-world datasets. In summary, this research contributes a new and effective method for addressing the challenge of missing data through data segmentation and the application of Expectation-Maximization techniques. Our results highlight the potential of this approach to enhance the accuracy and reliability of data analysis in the presence of missing values.

Keywords