Abstract:
For ages, actuaries, medical officers, and governments have used survival analysis to estimate life expectancies, medical treatment effectiveness, insurance premiums, and annuities. Censored data, the motivating factor for survival analysis, can be thought of as a special case of a more general statistical topic, missing data. Let X1, X2, …,Xn be the failure data from some absolutely continuous distribution function F (t, 𝜃); where t is the time, and let Y1, Y2, …, Yn be the censored data from some distribution function G(t). We further assume that two samples are mutually independent. In this study, we consider the right censoring data (Z1, δ1), (Z2, δ2), …,(Zn, δn), where Zi = min (Xi, Yi) and δi = I(Xi < Yi) and I(.) is the indicator function, and explain how to estimate the unknown parameter θ of relevant distribution, and hence obtained the parametric estimator for the survival distribution. In this study, we used the EM algorithm to estimate the unknown parameter θ because the EM algorithm is efficient for handling missing data. We conducted all simulation studies by using R software. In the simulations study, survival data are generated by using a one-parameter Exponential distribution and then a two-parameter Weibull distribution. In both cases, we compared the obtained parametric survival distributions with the nonparametric version, the Kaplan-Meier estimator. The results obtained by the three methods: Maximum likelihood method, Expectation Maximization algorithm, and Kaplan-Meier estimator, were the same. So, we can conclude that both estimation methods are suitable for parameter estimation in survival analysis.