Stata Is Widely Used In The Following Domains:

By discipline

  • Behavioral sciences
  • Biostatistics
  • Data Science
  • Economics
  • Education 
  • Epidemiology
  • Finance, business, and marketing
  • Institutional Research
  • Medicine
  • Political science
  • Public health
  • Public policy
  • Sociology

By category

regression  •  censored outcomes  •  endogenous regressors  •  bootstrap, jackknife, and robust and cluster–robust variance  •  instrumental variables  •  three-stage least squares  •  constraints  •  quantile regression  •  GLS

More

Kaplan–Meier and Nelson–Aalen estimators,  •  Cox regression (frailty)  •  parametric models (frailty, random effects)  •  competing risks  •  hazards  •  time-varying covariates  •  left-, right-, and interval-censoring  •  Weibull, exponential, and Gompertz models

More

interactive sessions  •  large-scale development projects  •  optimization  •  matrix inversions  •  decompositions  •  eigenvalues and eigenvectors  •  LAPACK engine  •  real and complex numbers  •  string matrices  •  interface to Stata datasets and matrices  •  numerical derivatives  •  object-oriented programming 

More

random and fixed effects with robust standard errors  •  linear mixed models  •  random-effects probit  •  GEE  •  random- and fixed-effects Poisson  •  dynamic panel-data models  •  instrumental variables  •  panel unit-root tests

More

thousands of built-in models  •  univariate and multivariate models  •  linear and nonlinear models  •  multilevel models  •  continuous, binary, ordinal, and count outcomes  •  bayes:prefix for 46 estimation commands  •  continuous univariate, multivariate, and discrete priors  •  add your own models  •  multiple chains  •  convergence diagnostics  •  posterior summaries  •  hypothesis testing  •  model fit  •  model comparison  •  predictions

More

menus and dialogs for all features  •  Data Editor  •  Variables Manager  •  Graph Editor  •  Project Manager  •  Do-file Editor  •  Clipboard Preview Tool  •  multiple preference sets 

More

continuous, binary, count, and survival outcomes  •  two-, three-, and higher-level models  •  generalized linear models  •  nonlinear models  •  random intercepts  •  random slopes  •  crossed random effects  •  BLUPs of effects and fitted values  •  hierarchical models  •  residual error structures  •  DDF adjustments  •  support for survey data

More 

effect sizes  •  common, fixed, and random effects  •  forest, funnel, and more plots  •  subgroup and cumulative analysis  •  meta-regression  •  small-study effects  •  publication bias

More 

35 manuals  •  18,000 pages  •  seamless navigation  •  thousands of worked examples  •  quick starts  •  methods and formulas  •  references

More

logistic, probit, tobit  •  Poisson and negative binomial  •  conditional, multinomial, nested, ordered, rank-ordered, and stereotype logistic  •  multinomial probit  •  zero-inflated and left-truncated count models  •  selection models  •  marginal effects

More

power  •  sample size  •  effect size  •  minimum detectable effect  •  CI width  •  means  •  proportions  •  variances  •  correlations  •  ANOVA  •  regression  •  cluster randomized designs  •  case–control studies  •  cohort studies  •  contingency tables  •  survival analysis  •  balanced or unbalanced designs  •  results in tables or graphs

More 

summaries  •  cross-tabulations  •  correlations  •  z and t tests  •  equality-of-variance tests  •  tests of proportions  •  confidence intervals  •  factor variables

More

discrete choice  •  rank-ordered alternatives  •  conditional logit  •  multinomial probit  •  nested logit  •  mixed logit  •  panel data  •  case-specific and alternative-specific predictors  •  interpret results—expected probabilities, covariate effects, comparisons across alternatives

More

inverse probability weight (IPW)  •  doubly robust methods  •  propensity-score matching  •  regression adjustment  •  covariate matching  •  multilevel treatments  •  endogenous treatments  •  average treatment effects (ATEs)  •  ATEs on the treated (ATETs)  •  potential-outcome means (POMs)  •  continuous, binary, count, fractional, and survival outcomes  •  panel data

More

nonparametric regression  •  Wilcoxon–Mann–Whitney, Wilcoxon signed ranks, and Kruskal–Wallis tests  •  Spearman and Kendall correlations  •  Kolmogorov–Smirnov tests  •  exact binomial CIs  •  survival data  •  ROC analysis  •  smoothing  •  bootstrapping 

More

endogenous covariates  •  sample selection  •  nonrandom treatment  •  panel data  •  account for problems alone or in combination  •  continuous, interval-censored, binary, and ordinal outcomes

More

lasso  •  elastic net  •  model selection  •  prediction  •  inference  •  continuous, binary, and count outcomes  •  cross-validation  •  adaptive lasso  •  double selection  •  partialing out  •  cross-fit partialing out  •  double machine learning  •  endogenous covariates

More

generalized method of moments (GMM)  •  nonlinear regression

More

ten link functions  •  user-defined links  •  seven distributions  •  ML and IRLS estimation  •  nine variance estimators  •  seven residuals 

More

graphical path diagram builder  •  standardized and unstandardized estimates  •  modification indices  •  direct and indirect effects  •  continuous, binary, count, ordinal, and survival outcomes  •  multilevel models  •  random slopes and intercepts  •  factor scores, empirical Bayes, and other predictions  •  groups and tests of invariance  •  goodness of fit  •  handles MAR data by FIML  •  correlated data  •  survey data

More

specify likelihood using simple expressions  •  no programming required  •  survey data  •  standard, robust, bootstrap, and jackknife SEs  •  matrix estimators

More

fmm: prefix for 17 estimators  •  mixtures of a single estimator  •  mixtures combining multiple estimators or distributions  •  continuous, binary, count, ordinal, categorical, censored, truncated, and survival outcomes

More

binary, ordinal, continuous, count, categorical, fractional, and survival items  •  add covariates to model class membership  •  combine with SEM path models  •  expected class proportions  •  goodness of fit  •  predictions of class membership

More

user-specified functions  •  NR, DFP, BFGS, BHHH  •  OIM, OPG, robust, bootstrap, and jackknife SEs  •  Wald tests  •  survey data  •  numeric or analytic derivatives

More 

spatial lags of dependent variable, independent variables, and autoregressive errors  •  fixed and random effects in panel data  •  endogenous covariates  •  analyze spillover effects

More

nine univariate imputation methods  •  multivariate normal imputation  •  chained equations  •  explore pattern of missingness  •  manage imputed datasets  •  fit model and pool results  •  transform parameters  •  joint tests of parameter estimates  •  predictions

More

kappa measure of interrater agreement  •  Cronbach’s alpha  •  stepwise regression  •  tests of normality

More

balanced and unbalanced designs  •  factorial, nested, and mixed designs  •  repeated measures  •  marginal means  •  contrasts 

More

multistage designs  •  bootstrap, BRR, jackknife, linearized, and SDR variance estimation  •  poststratification  •  raking  •  calibration  •  DEFF  •  predictive margins  •  means, proportions, ratios, totals  •  summary tables  •  almost all estimators supported

More 

statistical  •  random-number  •  mathematical  •  string  •  date and time  •  regular expressions  •  Unicode

More

exact logistic and Poisson regression  •  exact case–control statistics  •  binomial tests  •  Fisher’s exact test for r × c tables

More

hierarchical clustering  •  kmeans and kmedian nonhierarchical clustering  •  dendrograms  •  stopping rules  •  user-extensible analyses

More

ability to install new commands  •  web updating  •  web file sharing  •  latest Stata news 

More

standardization of rates  •  case–control  •  cohort  •  matched case–control  •  Mantel–Haenszel  •  pharmacokinetics  •  ROC analysis  •  ICD-10

More

binary (1PL, 2PL, 3PL), ordinal, and categorical response models  •  item characteristic curves  •  test characteristic curves  •  item information functions  •  test information functions  •  multiple-group models  •  differential item functioning (DIF) 

More

search and download thousands of free additions  •  discover new features in the Stata Journal  •  share commands by posting to the SSC  •  discuss community-contributed commands on Statalist 

More

specify models algebraically  •  solve models  •  estimate parameters  •  identification diagnostics  •  policy and transition matrices  •  IRFs  •  dynamic forecasts

More

factor analysis  •  principal components  •  discriminant analysis  •  rotation  •  multidimensional scaling  •  Procrustean analysis  •  correspondence analysis  •  biplots  •  dendrograms  •  user-extensible analyses

More

Wald tests  •  LR tests  •  linear and nonlinear combinations  •  predictions and generalized predictions  •  marginal means  •  least-squares means  •  adjusted means  •  marginal and partial effects  •  forecast models  •  Hausman tests 

More

data transformations  •  data frames  •  match-merge  •  import/export data  •  ODBC  •  SQL  •  Unicode  •  by-group processing  •  append files  •  sort  •  row–column transposition  •  labeling  •  save results

More

Q report for regulatory agencies such as the FDA  •  installation verification

compare means, intercepts, or slopes  •  compare with reference category, adjacent category, grand mean, etc.  •  orthogonal polynomials  •  multiple-comparison adjustments  •  graph estimated means and contrasts  •  interaction plots

More

reproducible reports  •  Word  •  Excel  •  PDF  •  HTML  •  dynamic documents  •  Markdown  •  Stata results and graphs  •  SVG  •  EPS  •  PNG  •  TIF  •  formatted text and tables 

More

Section 508 compliance, accessibility for persons with disabilities

bootstrap  •  jackknife  •  Monte Carlo simulation  •  permutation tests

More

lines  •  bars  •  areas  •  ranges  •  contours  •  confidence intervals  •  interaction plots  •  survival plots  •  publication quality  •  customize anything  •  Graph Editor 

More

Sample session

A sample session of Stata for MacUnix, or Windows.

ARIMA  •  ARFIMA  •  ARCH/GARCH  •  VAR  •  VECM  •  multivariate GARCH  •  unobserved-components model  •  dynamic factors  •  state-space models  •  Markov-switching models  •  business calendars  •  tests for structural breaks  •  threshold regression  •  forecasts  •  impulse–response functions  •  unit-root tests  •  filters and smoothers  •  rolling and recursive estimation 

More   

adding new commands  •  scripting  •  object-oriented programming  •  menu and dialog-box programming  •  dynamic documents  •  Markdown  •  Project Manager  •  Python integration  •  Java plugins  •  C/C++ plugins

More