Whether your data require simple weighted adjustment because of differential sampling rates or you have data from a complex multistage survey, Stata’s survey features can provide you with correct standard errors and confidence intervals for your inferences. All you need to do is specify the relevant characteristics of your sampling design, including sampling weights (including weights at multiple stages), clustering (at one, two, or more stages), stratification, and poststratification. After that, most of Stata’s estimation commands can adjust their estimates to correct for your sampling design.


SURVEY REGRESSION MODELS

  • Descriptive statistics
  • Linear regression models
  • Structural equation models
  • Survival-data regression models
  • Binary-response regression models
  • Discrete-response regression models
  • Fractional-response regression models
  • Poisson regression models
  • Instrumental-variables regression models
  • Regression models with selection
  • Longitudinal/panel-data regression models
  • Multilevel mixed-effects models
  • Finite mixture models
  • Item response theory

SEE MULTILEVEL MODELS WITH SURVEY DATA

 

SEE GENERALIZED SEM MODELS FOR SURVEY DATA

 

VARIANCE AND STANDARD-ERROR ESTIMATES

  • Taylor-series linearization (Huber/White/sandwich)
  • Balanced and repeated replications (BRR)
  • Survey jackknife
  • Bootstrap (with bootstrap replicate weights)
  • Successive difference replication (SDR)

SAMPLING DESIGNS

  • Sampling (probability) weights
  • Stratification
  • Clustering
  • Multistage designs
  • Weights at each sampling stage
  • Finite population correction in all stages
  • Support for strata with one sampling unit

 

FEATURES

  • Poststratification
  • Weight calibration via the raking-ratio method
  • Weight calibration via the generalized regression (GREG) method
  • Design effects
  • Misspecification effects
  • Effects for linear combinations
  • Coefficient of variation
  • Estimate linear/nonlinear combinations of parameters
  • Hypotheses tests for survey data
  • Estimation with linear constraints
  • Goodness of fit for logistic and probit estimators
  • Multiple imputation

 

© Copyright 1996–2024 StataCorp LLC. All rights reserved.

MAXIMUM PSEUDOLIKELIHOOD ESTIMATION

  • User-defined likelihoods
  • Survey characteristics automatically handled

SUMMARY STATISTICS

  • Population and subpopulation means
  • Population and subpopulation standard deviations
  • Population and subpopulation proportions
  • Population and subpopulation ratios
  • Population and subpopulation totals
  • Provide full covariance estimates across subpopulations

SUMMARY TABLES

  • Two-way contingency tables with tests of independence
  • One-way tables
  • Table describing the sampling design of survey data

POSTESTIMATION SELECTOR

  • View and run all postestimation features for your command
  • Automatically updated as estimation commands are run

 

FACTOR VARIABLES

  • Automatically create indicators based on categorical variables
  • Form interactions among discrete and continuous variables
  • Include polynomial terms
  • Perform contrasts of categories/levels
Watch Introduction to Factor Variables in Stata tutorials

MARGINAL ANALYSIS

  • Estimated marginal means
  • Marginal and partial effects
  • Average marginal and partial effects
  • Least-squares means
  • Predictive margins
  • Adjusted predictions, means, and effects
  • Works with multiple outcomes simultaneously
  • Contrasts of margins
  • Pairwise comparisons of margins
  • Profile plots
  • Interaction plots
  • Graphs of margins and marginal effects
Watch Profile plots and interaction plots in Stata tutorials