Generalized linear models (GLMs)

Stata fits multilevel mixed-effects generalized linear models (GLMs) with meglm. GLMs for cross-sectional data have been a workhorse of statistics because of their flexibility and ease of use. Stata’s xtgeecommand extends GLMs to the use of longitudinal/panel data by the method of generalized estimating equations.

Now you can use meglm to fit GLMs to hierarchical multilevel datasets with normally distributed random effects. Seven distributions for the response variable are supported (Gaussian, Bernoulli, binomial, gamma, negative binomial, ordinal, and Poisson); and five link functions are possible (identity, log, logit, probit, and complementary log-log).

Let’s fit a three-level model.

We have student-level data, where students are nested in classes, and classes are nested in schools. Our dependent variable thk is an ordered categorical variable that takes on the values 1, 2, 3, or 4; and we have three explanatory variables: prethk, cc, and tv. We will treat prethk as continuous. cc and tv are binary, and we want to include their interaction the model. Let’s fit an ordered logit model:

. webuse tvsfpors

. meglm thk prethk cc##tv || school: || class:, family(ordinal) link(logit)

Fitting fixed-effects model:

Iteration 0: log likelihood = -2212.775

Iteration 1: log likelihood = -2125.509

Iteration 2: log likelihood = -2125.1034

Iteration 3: log likelihood = -2125.1032

Refining starting values:

Grid node 0: log likelihood = -2152.1514

Fitting full model:

Iteration 0: log likelihood = -2152.1514 (not concave)

Iteration 1: log likelihood = -2125.9213 (not concave)

Iteration 2: log likelihood = -2120.1861

Iteration 3: log likelihood = -2115.6177

Iteration 4: log likelihood = -2114.5896

Iteration 5: log likelihood = -2114.5881

Iteration 6: log likelihood = -2114.5881

Mixed-effects GLM Number of obs = 1,600

Family: ordinal

Link: logit

Group Variable	No. of Groups	Observations per Group Minimum	Observations per Group Average	Observations per Group Maximum
school	28	18	57.1	137
class	135	1	11.9	28

Integration method: mvaghermite Integration points = 7

Wald chi2(4) = 124.39

Log likelihood = -2114.5881 Prob > chi2 = 0.0000

thk	Coef.	Std. Err.	z	P>\|z\|	[95% Conf. Interval]	[95% Conf. Interval]
prethk	.4085273	.039616	10.31	0.000	.3308814	.4861731
1.cc	.8844369	.2099124	4.21	0.000	.4730161	1.295858
1.tv	.236448	.2049065	1.15	0.249	-.1651614	.6380575
cc#tv 1 1	-.3717699	.2958887	-1.26	0.209	-.951701	.2081612
/cut1	-.0959459	.1688988	-0.57	0.570	-.4269815	.2350896
/cut2	1.177478	.1704946	6.91	0.000	.8433151	1.511642
/cut3	2.383672	.1786736	13.34	0.000	2.033478	2.733865
school var (_cons)	.0448735	.0425387			.0069997	.2876749
school>class var (_cons)	.1482157	.0637521			.063792	.3443674

LR test vs. ologit regression: chi2(2) = 21.03 Prob > chi2 = 0.0000

Note: LR test is conservative and provided only for reference.

Our model has two random-effects equations, separated by ||. Our first is a random intercept at the school level, and the second is a random intercept at the class level. The order we listed them matters: class comes after school, meaning that classes are nested within schools. Using the same logic, we could include more levels of nesting. meglm also allows crossed-effects models.

The output table includes the fixed-effect portion of our model, the estimated cutpoints (because this is an ordered logit model), and the estimated variance components.

This model can alternatively be fit with meologit, which is a convenient use for meglm with an ordinal family and a logit link.

Epidemiologia e Biostatistica

Scienze Sociali

Econometria

ECONOMETRIA FINANZIARIA

Corsi per l'utilizzo del software

CONVEGNO ITALIANO DEGLI UTENTI DI STATA

Analisi biostatistica, epidemiologica e ricerca medica

Software per ricerche operative

Analisi statistica generale

formazione multimediale

modelli gerarchici lineari e non lineari

Analisi di data mining

Trasferimento di archivi di dati

Analisi spaziale

Matematica e Ingegneria

word processing scientifico

Analisi statistica specialistica

Disegno di esperimenti e analisi della dimensione dei campioni

Analisi di serie temporali e la stima di modelli econometrici

analisi qualitativa

modelli di reti neurali

STATA PRESS

Altri testi relativi a Stata