Conway-Maxwell-Poisson (COM-Poisson) GLM family
The COM-Poisson family is a generalization of the Poisson family which can describe over-dispersed as well as under-dispersed count data. It is indexed by a parameter nu
that quantifies such dispersion. For nu
>1, the distribution is under-dispersed relative to the Poisson distribution with same mean. It includes the Poisson, geometric and Bernoulli as special (or limit) cases (see Details). The COM-Poisson family is here implemented as a family
object, so that it can be fitted by glm
, and further used to model conditional responses in mixed models fitted by this package's functions (see Examples). nu
is distinct from the dispersion parameter ν=1/φ considered elsewhere in this package and in the GLM literature, as ν affects in a more specific way the log-likelihood.
Several links are now allowed for this family, corresponding to different versions of the COMPoisson described in the literature (e.g., Sellers & Shmueli 2010; Huang 2017).
COMPoisson(nu = stop("COMPoisson's 'nu' must be specified"), link = "loglambda")
link |
GLM link function. The default is the canonical link |
nu |
Under-dispersion parameter. The |
The ith term of the distribution can be written q_i/Z where q_i=λ^i / (i!)^ν and Z=∑_(i=0)^∞ q_i, for λ=λ(μ) implied by its inverse relationship, the expectation formula μ=μ(λ)=∑_(i=0)^∞ i q_i(λ)/Z. The case nu=0
is the geometric distribution with parameter λ; nu=1
is the Poisson distribution with mean λ; and the limit as nu
-> ∞ is the Bernoulli distribution with expectation λ/(1+λ).
From this definition, this is an exponential family model with canonical parameters log(λ) and ν. When the linear predictor η specifies log(λ(μ)), the canonical link is used (e.g., Sellers & Shmueli 2010). It is here nicknamed "loglambda"
and does not have a known expression in terms of elementary functions. To obtain μ as the link inverse of the linear predictor η, one then first computes λ=e^η and then μ(λ) by the expectation formula. For other links (Huang 2017), one directly computes μ by the link inverse (e.g., μ=e^η for link "log"
), and then one may solve for λ= λ(μ) to obtain other features of the distribution.
The relationships between λ and μ or other moments of the distribution involve infinite summations. These sums can be easily approximated by a finite number of terms for large nu
but not when nu
approaches zero. For this reason, the code may fail to fit distributions with nu
approaching 0 (strong residual over-dispersion). The case nu=0
(the geometric distribution) is fitted by an ad hoc algorithm devoid of such problems. Otherwise, spaMM
truncates the sum, and uses numerical integrals to approximate missing terms (which slows down the fitting operation). In addition, it applies an ad hoc continuity correction to ensure continuity of the result in nu=1
(Poisson case). These corrections affect numerical results for the case of residual overdispersion but are negligible for the case of residual underdispersion. Alternatively, spaMM
uses Gaunt et al.'s (2017) approximations when the condition defined in spaMM.getOption("CMP_asympto_cond")
is satisfied. All approximations reduces the accuracy of computations, in a way that can impede the extended Levenberg-Marquardt algorithm sometimes needed by spaMM.
The name COMP_nu
should be used to set initial values or bounds on nu
in control arguments of the fitting functions (e.g., fitme(.,init=list(COMP_nu=1))
). Fixed values should be set by the family argument (COMPoisson(nu=.)
).
A family object.
Gaunt, Robert E. and Iyengar, Satish and Olde Daalhuis, Adri B. and Simsek, Burcin. (2017) An asymptotic expansion for the normalizing constant of the Conway–Maxwell–Poisson distribution. Ann Inst Stat Math doi: 10.1007/s10463-017-0629-6.
Huang, Alan (2017) Mean-parametrized Conway-Maxwell-Poisson regression models for dispersed counts. Stat. Modelling doi: 10.1177/1471082X17697749
G. Shmueli, T. P. Minka, J. B. Kadane, S. Borle and P. Boatwright (2005) A useful distribution for fitting discrete data: revival of the Conway-Maxwell-Poisson distribution. Appl. Statist. 54: 127-142.
Sellers KF, Shmueli G (2010) A Flexible Regression Model for Count Data. Ann. Appl. Stat. 4: 943–961
if (spaMM.getOption("example_maxtime")>0.9) { # Fitting COMPoisson model with estimated nu parameter: # data("freight") ## example from Sellers & Shmueli, Ann. Appl. Stat. 4: 943–961 (2010) fitme(broken ~ transfers, data=freight, family = COMPoisson()) fitme(broken ~ transfers, data=freight, family = COMPoisson(link="log")) # glm(), HLCor() and HLfit() handle spaMM::COMPoisson() with fixed overdispersion: # glm(broken ~ transfers, data=freight, family = COMPoisson(nu=10)) HLfit(broken ~ transfers+(1|id), data=freight, family = COMPoisson(nu=10),method="ML") # Equivalence of poisson() and COMPoisson(nu=1): # COMPglm <- glm(broken ~ transfers, data=freight, family = poisson()) coef(COMPglm) logLik(COMPglm) COMPglm <- glm(broken ~ transfers, data=freight, family = COMPoisson(nu=1)) coef(COMPglm) logLik(COMPglm) HLfit(broken ~ transfers, data=freight, family = COMPoisson(nu=1)) }
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.