PubAg

Main content area

A novel MM algorithm and the mode-sharing method in Bayesian computation for the analysis of general incomplete categorical data

Author:
Tian, Guo-Liang, Liu, Yin, Tang, Man-Lai, Li, Tao
Source:
Computational statistics & data analysis 2019 v.140 pp. 122-143
ISSN:
0167-9473
Subject:
Bayesian theory, algorithms, data collection, epidemiology, medicine, models, probability, psychology, sports, statistical analysis
Abstract:
Incomplete categorical data often occur in the fields such as biomedicine, epidemiology, psychology, sports and so on. In this paper, we first introduce a novel minorization–maximization (MM) algorithm to calculate the maximum likelihood estimates (MLEs) of parameters and the posterior modes for the analysis of general incomplete categorical data. Although the data augmentation (DA) algorithm and Gibbs sampling as the corresponding stochastic counterparts of the expectation–maximization (EM) and ECM algorithms are developed very well, up to now, little work has been done on creating stochastic versions to the existing MM algorithms. This is the first paper to propose a mode-sharing method in Bayesian computation for general incomplete categorical data by developing a new acceptance–rejection (AR) algorithm aided with the proposed MM algorithm. The key idea is to construct a class of envelope densities indexed by a working parameter and to identify a specific envelope density which can overcome the four drawbacks associated with the traditional AR algorithm. The proposed mode-sharing based AR algorithm has three significant characteristics: (I) it can automatically establish a family of envelope densities {gλ(⋅): λ∈Sλ} indexed by a working parameter λ, where each member in the family shares mode with the posterior density; (II) with the one-dimensional grid method searching over the finite interval Sλ, it can identify an optimal working parameter λopt by maximizing the theoretical acceptance probability, yielding a best easy-sampling envelope density gλopt(⋅), which is more dispersive than the posterior density; (III) it can obtain the optimal envelope constant copt by using the mode-sharing theorem (indicating that the high-dimensional optimization can be completely avoided) or by using the proposed MM algorithm again. Finally, a toy model and three real data sets are used to illustrate the proposed methodologies.
Agid:
6461406