Generalized Gamma distribution family function
Estimation of the 3-parameter generalized gamma distribution proposed by Stacy (1962).
gengamma.stacy(lscale = "loglink", ld = "loglink", lk = "loglink", iscale = NULL, id = NULL, ik = NULL, imethod = 1, gscale.mux = exp((-4:4)/2), gshape1.d = exp((-5:5)/2), gshape2.k = exp((-5:5)/2), probs.y = 0.3, zero = c("d", "k"))
lscale, ld, lk |
Parameter link function applied to each of the positive parameters
b, d and k, respectively.
See |
iscale, id, ik |
Initial value for b, d and k, respectively. The defaults mean an initial value is determined internally for each. |
gscale.mux, gshape1.d, gshape2.k |
See |
imethod, probs.y, zero |
See |
The probability density function can be written
f(y;b,d,k) = d * b^(-d*k) * y^(d*k-1) * exp(-(y/b)^d) / gamma(k)
for scale parameter b > 0, and Weibull-type shape parameter d > 0, gamma-type shape parameter k > 0, and y > 0. The mean of Y is b*gamma(k+1/d)/gamma(k) (returned as the fitted values), which equals b*k if d=1.
There are many special cases, as given in Table 1 of Stacey and Mihram (1965).
In the following, the parameters are in the order b,d,k.
The special cases are:
Exponential f(y;b,1,1),
Gamma f(y;b,1,k),
Weibull f(y;b,d,1),
Chi Squared f(y;2,1,a/2) with a degrees of freedom,
Chi f(y;sqrt(2),2,a/2) with a degrees of freedom,
Half-normal f(y;sqrt(2),2,1/2),
Circular normal f(y;sqrt(2),2,1),
Spherical normal f(y;sqrt(2),2,3/2),
Rayleigh f(y;c sqrt(2),2,1) where c>0.
Also the log-normal distribution corresponds to when k = Inf
.
An object of class "vglmff"
(see vglmff-class
).
The object is used by modelling functions such as vglm
,
and vgam
.
Several authors have considered maximum likelihood estimation for the
generalized gamma distribution and have found that the Newton-Raphson
algorithm does not work very well and that the existence of solutions
to the log-likelihood equations is sometimes in doubt.
Although Fisher scoring is used here, it is likely that the same
problems will be encountered.
It appears that large samples
are required, for example, the estimator of k became asymptotically
normal only with 400 or more observations.
It is not uncommon for maximum likelihood estimates to fail to converge
even with two or three hundred observations.
With covariates, even more observations are needed to increase the
chances of convergence.
Using covariates is not advised unless the sample size is at least
a few thousand, and even if so,
modelling 1 or 2 parameters as intercept-only is a very good idea
(e.g., zero = 2:3
).
Monitoring convergence is also a very good idea
(e.g., set trace = TRUE
).
Half-stepping is not uncommon, and if this occurs, then the
results should be viewed with more suspicion.
The notation used here differs from Stacy (1962) and Prentice (1974).
Poor initial values may result in failure to converge so
if there are covariates and there are convergence problems,
try using or checking the zero
argument (e.g., zero = 2:3
)
or the ik
argument
or the imethod
argument, etc.
T. W. Yee
Stacy, E. W. (1962). A generalization of the gamma distribution. Annals of Mathematical Statistics, 33(3), 1187–1192.
Stacy, E. W. and Mihram, G. A. (1965). Parameter estimation for a generalized gamma distribution. Technometrics, 7, 349–358.
Prentice, R. L. (1974). A log gamma model and its maximum likelihood estimation. Biometrika, 61, 539–544.
k <- exp(-1); Scale <- exp(1); dd <- exp(0.5); set.seed(1) gdata <- data.frame(y = rgamma(2000, shape = k, scale = Scale)) gfit <- vglm(y ~ 1, gengamma.stacy, data = gdata, trace = TRUE) coef(gfit, matrix = TRUE)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.