Power analysis for linear regression

What’s this about?

Stata’s power command performs power and sample-size analysis (PSS). Its features now include PSS for linear regression.

As with all other power methods, the new methods allow you to specify multiple values of parameters and to automatically produce tabular and graphical results.

Linear regression

Stata’s power command provides three new PSS methods for linear regression.

power oneslope performs PSS for a slope test in a simple linear regression. It computes one of the sample size, power, or target slope given the other two and other study parameters. See [PSS] power oneslope.

power rsquared performs PSS for an R² test in a multiple linear regression. An R² test is an F test for the coefficient of determination (R²). The test can be used to test the significance of all the coefficients, or it can be used to test a subset of them. In both cases, power rsquared computes one of the sample size, power, or target R² given the other two and other study parameters. See [PSS] power rsquared.

power pcorr performs PSS for a partial-correlation test in a multiple linear regression. A partial-correlation test is an F test of the squared partial multiple correlation coefficient. The command computes one of the sample size, power, or target squared partial-correlation coefficient given the other two and other study parameters. See [PSS] power pcorr.

Here, we demonstrate PSS for an R² test of a subset of coefficients in a multiple linear regression.

Consider a test of the significance of two covariates in a multiple linear regression adjusting for three other covariates. We will call the two covariates the tested covariates and the three others control covariates. The reduced model with the control covariates has an R² of 0.1, and the full model with all five covariates has an R² of 0.2. We want to compute the required sample size for the two-sided R² test to achieve 80% power with a 5% significance level—power rsquared defaults.

. power rsquared 0.1 0.2, ntested(2) ncontrol(3)


Performing iteration ...

Estimated sample size for multiple linear regression
F test for R2 testing subset of coefficients
Ho: R2_F = R2_R  versus  Ha: R2_F != R2_R

Study parameters:

        alpha =    0.0500
        power =    0.8000
        delta =    0.1250
         R2_R =    0.1000
         R2_F =    0.2000
      R2_diff =    0.1000
     ncontrol =         3
      ntested =         2

Estimated sample size:

            N =        81

We need 81 observations.

Suppose that we want to investigate the impact of the effect size on the required sample size. We plot below the sample-size curve as a function of the R² values of the full model.

. power rsquared 0.1 (0.2(0.1)0.5), ntested(2) ncontrol(3) graph

graph

As the R² of the full model increases, the required sample size decreases. When the R² is closer to 0.2, the curve is steeper.

Tell me more

Learn more about Stata’s power and sample size features.

Read more about PSS for linear regression:

[PSS] power oneslope
[PSS] power rsquared
[PSS] power pcorr

Epidemiologia e Biostatistica

Scienze Sociali

Econometria

ECONOMETRIA FINANZIARIA

Corsi per l'utilizzo del software

Summer school

CONVEGNO ITALIANO DEGLI UTENTI DI STATA

Analisi biostatistica, epidemiologica e ricerca medica

Software per ricerche operative

Analisi statistica generale

formazione multimediale

modelli gerarchici lineari e non lineari

Analisi di data mining

Trasferimento di archivi di dati

Analisi spaziale

Matematica e Ingegneria

word processing scientifico

Analisi statistica specialistica

Disegno di esperimenti e analisi della dimensione dei campioni

Analisi di serie temporali e la stima di modelli econometrici

analisi qualitativa

modelli di reti neurali

STATA PRESS

Altri testi relativi a Stata