mode helix
NEWIntroducing Helix—the first instant, responsive data engine.Learn more

Mode Studio

The Collaborative Data Science Platform

Statsmodels

As its name implies, statsmodels is a Python library built specifically for statistics. Statsmodels is built on top of NumPy, SciPy, and matplotlib, but it contains more advanced functions for statistical testing and modeling that you won't find in numerical libraries like NumPy or SciPy.

Statsmodels tutorials

The tutorials below cover a variety of statsmodels' features.

Linear regression

Multiple regression

Logistic regression

Time series analysis

Statistical tests

Statsmodels resources

  • Chapter 11: Regression of Think Stats (Allen B. Downey) - This chapter covers aspects of multiple and logistic regression in statsmodels. It explains the concepts behind the code, but you'll still need familiarity with basic statistics before diving in.
  • The statsmodels section of Cross Validated - A question and answer site for people interested in statistics, machine learning, data analysis, data mining, and data visualization.
  • Logistic regression vs. multiple regression (CoolData) - Not Python related, but this provides a helpful breakdown of the differences between logistic and multiple regression.
  • Official statsmodels documentation

Looks like you've got a thing for cutting-edge data news.

Get the latest.

Get started