Statistical software for data science.
With both a point-and-click interface and a powerful, intuitive command syntax, Stata is fast, accurate, and easy to use. For more than 30 years, Stata provides everything you need for data analysis, data management, and powerful publication quality graphics.

Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and reproducible reporting.

Complete data-management facilities
Stata's data-management features give you complete control of all types of data. You can combine and reshape datasets, manage variables, and collect statistics across groups or replicates. You can work with byte, integer, long, float, double, and string variables (including BLOBs and strings up to 2 billion haracters). Stata also has advanced tools for managing specialized data such as survival/duration data, time-series data, panel/longitudinal data, categorical data, multiple-imputation data, and survey data.
All analyses can be reproduced and documented for publication and review. Version control ensures statistical programs will continue to produce the same results no matter when you wrote them.

Stata most popular versions:

  • Stata/MP: The fastest version of Stata (for dual-core, quad-core and multicore/multiprocessor computers) that can analyze BIG DATA
  • Stata/SE: Stata for large datasets
  • Stata/IC: Stata for mid-sized datasets


Stata 15 | Features Documentation
Why Stata?

Stata 15 Data Management Features

Stata 15 Features Datasheet
Stata 15 Highlights Card

Publication-quality graphics

Stata makes it easy to generate publication-quality, distinctly styled graphs.

You can point and click to create a custom graph. Or you can write scripts to produce hundreds or thousands of graphs in a reproducible manner. Export graphs to EPS or TIFF for publication, to PNG or SVG for the web, or to PDF for viewing. With the integrated Graph Editor, you click to change anything about your graph or to add titles, notes, lines, arrows, and text.


Stata/MP is the fastest and largest version of Stata

Stata/MP lets you analyze data in one-half to two-thirds the time compared with Stata/SE on inexpensive dual-core laptops and in one-quarter to one-half the time on quad-core desktops and laptops. Stata/MP runs even faster on multiprocessor servers. Stata/MP supports up to 64 cores/processors.

Speed is often most crucial when performing computationally intense estimation procedures. A few of Stata's estimation procedures, including linear regression, are nearly perfectly parallelized, meaning they run twice as fast on two cores, four times as fast on four cores, eight times as fast on eight cores, and so on. Some estimation commands can be parallelized more than others. Taken at the median, estimation commands run 1.9 times faster on 2 cores, 3.1 times faster on 4 cores, and 4.3 times faster on 8 cores.

Speed can also be important when managing large datasets. Adding new variables is nearly 100 percent parallelized, and sorting is 75 percent parallelized.

Because speed is very important when your problems are big, Stata/MP supports even larger datasets than Stata/SE.

Stata/SE can analyze up to 2 billion observations. Stata/MP can analyze 10 to 20 billion observations on the largest computers currently available and is ready to analyze up to 1 trillion observations once computer hardware catches up. Stata/MP also allows 120,000 variables compared to 32,767 variables allowed by Stata/SE.

Stata/MP Brochure

Stata for Colleges and Universities

Universities worldwide rely on Stata because it is a complete statistical package that provides a broad statistical base. Teaching your course with Stata provides your students with tools and skills that translate to their professional life."

Education - Teaching with Stata




Stata Interface Tour

Preview: A prefix for Bayesian regression in Stata

Introduction to Bayesian analysis: The basic concepts

PDF Documentation

Sort By:
Page of 1