🎲 Bayesian Data Analysis
Reasoning about uncertainty with priors, likelihoods and posteriors
0.5.dev0+git.20260626.e137512 - June 26, 2026 18:41 UTC

Bayesian Data Analysis#

Bayesian analysis treats unknown quantities as probability distributions and updates them with data. Instead of a single “best” estimate, you get a full posterior — a principled account of what the data do and do not tell you. This hub builds from first principles up to the nonparametric models (mixtures, density estimation, Dirichlet processes) that the source corpus emphasises.

Three reading levels run through the page:

newcomers — the intuition of prior → likelihood → posterior;
practitioners — how to actually compute and check posteriors;
researchers — hierarchical and nonparametric (infinite-mixture) models.

Note

Open a dropdown for detail; follow See also links to related ideas. Code snippets use real scipy.stats / PyMC / ArviZ / scikit-learn calls. This page pairs with the Terminology reference (probability and distributions) and the Time Series hub (where Bayesian estimation also appears).

Discovery at a Glance#

🟢 Start Here — The Bayesian Idea

The one equation everything rests on.

🔁 Bayes’ Theorem

Posterior ∝ likelihood × prior — how belief is updated by evidence.

Bayes’ Theorem

🎯 Prior, Likelihood, Posterior

The three ingredients, what each encodes, and where they come from.

Prior, Likelihood & Posterior

📐 Credible Intervals

A 95 % interval you can read as “95 % probability” — unlike a confidence interval.

Credible Intervals (and how they differ from CIs)

🔵 Core — Computing Posteriors

From conjugate shortcuts to general-purpose sampling.

✨ Conjugacy

When prior and posterior share a family, the update is exact and closed-form.

Conjugacy (the exact, closed-form case)

⛓️ MCMC Sampling

Drawing from any posterior when no formula exists — the workhorse of modern Bayes.

MCMC Sampling

🔮 Posterior Predictive

Simulating new data to check the model and forecast.

Posterior Predictive Checks

🔴 Advanced — Hierarchies & Nonparametrics

Sharing strength across groups; letting complexity grow with data.

🏛️ Hierarchical Models

Partial pooling: groups borrow strength from each other.

Hierarchical Models (Partial Pooling)

🌗 Mixture Models

Sub-populations, label switching, and choosing the number of components.

Mixture Models & Label Switching

♾️ Dirichlet Processes

Nonparametric priors that let the number of clusters grow with the data.

Dirichlet Processes (Nonparametric Bayes)

Part 1 — The Bayesian Idea#

Part 2 — Computing Posteriors#

Part 3 — Hierarchies, Mixtures & Nonparametrics#

Map to scikit-plots & the Bayesian Stack#

scikit-plots’ role here is diagnostic and model-selection visual support; the heavy lifting is done by the probabilistic-programming stack.

Gaussian Mixture Models (AIC / BIC)

Choose the number of mixture components by information criteria.

https://scikit-plots.github.io/dev/auto_examples/stats/plot_gaussian_mixture_models.html

Residuals distribution

Distributional / Q–Q checks on fitted models.

https://scikit-plots.github.io/dev/auto_examples/stats/plot_residuals_distribution_script.html

PyMC

Probabilistic programming for building and sampling models.

https://www.pymc.io/

ArviZ

Diagnostics, summaries and plots for Bayesian inference.

https://python.arviz.org/

Sources#

Verified during preparation of this page; resolvable at build date.

Source context (framing only, re-expressed in our own words)

Bayesian Data Analysis category (144 posts): https://insightful-data-lab.com/category/bayesian-data-analysis/

Official documentation (API calls used above)

SciPy — scipy.stats distributions: https://docs.scipy.org/doc/scipy/reference/stats.html
scikit-learn — Gaussian mixture models: https://scikit-learn.org/stable/modules/mixture.html
PyMC — probabilistic programming: https://www.pymc.io/
ArviZ — exploratory analysis of Bayesian models: https://python.arviz.org/

scikit-plots (this project)

Example gallery: https://scikit-plots.github.io/dev/auto_examples/index.html
Terminology reference: terminology-index

Standard reference

Gelman, Carlin, Stern, Dunson, Vehtari & Rubin, Bayesian Data Analysis (3rd ed.): http://www.stat.columbia.edu/~gelman/book/

Tags: purpose: reference domain: statistics level: beginner level: intermediate level: advanced