Escolar Documentos
Profissional Documentos
Cultura Documentos
Probability distribution
In probability theory and statistics, a probability distribution identifies either the probability of each value of a
random variable (when the variable is discrete), or the probability of the value falling within a particular interval
(when the variable is continuous).[1] The probability distribution describes the range of possible values that a random
variable can attain and the probability that the value of the random variable is within any (measurable) subset of that
range.
When the random variable takes values
in the set of real numbers, the
probability distribution is completely
described by the cumulative
distribution function, whose value at
each real x is the probability that the
random variable is smaller than or
equal to x.
There are various probability distributions that show up in various different applications. Two of the most important
ones are the normal distribution and the categorical distribution. The normal distribution, also known as the Gaussian
distribution, has a familiar "bell curve" shape and approximates many different naturally occurring distributions over
real numbers. The categorical distribution describes the result of an experiment with a fixed, finite number of
outcomes. For example, the toss of a fair coin is a categorical distribution, where the possible outcomes are heads
and tails, each with probability 1/2.
Formal definition
In the measure-theoretic formalization of probability theory, a random variable is defined as a measurable function X
from a probability space to measurable space . A probability distribution is the pushforward
measure X*P = PX −1 on .
Discrete distributions and some continuous distributions (like the Cantor distribution) do not admit such a density.
Terminology
The support of a distribution is the smallest closed interval/set whose complement has probability zero. It may be
understood as the points or elements that are actual members of the distribution.
A discrete random variable is a random variable whose probability distribution is discrete. Similarly, a continuous
random variable is a random variable whose probability distribution is continuous.
Some properties
• The probability density function of the sum of two independent random variables is the convolution of each of
their density functions.
• The probability density function of the difference of two independent random variables is the cross-correlation
of their density functions.
• Probability distributions are not a vector space – they are not closed under linear combinations, as these do not
preserve non-negativity or total integral 1 – but they are closed under convex combination, thus forming a convex
subset of the space of functions (or measures).
Related to positive real-valued quantities that grow exponentially (e.g. prices, incomes,
populations)
• Log-normal distribution, for a single such quantity whose log is normally distributed
• Pareto distribution, for a single such quantity whose log is exponentially distributed; the prototypical power law
distribution
Basic distributions
• Bernoulli distribution, for the outcome of a single Bernoulli trial (e.g. success/failure, yes/no)
• Binomial distribution, for the number of "positive occurrences" (e.g. successes, yes votes, etc.) given a fixed total
number of independent occurrences
• Negative binomial distribution, for binomial-type observations but where the quantity of interest is the number of
failures before a given number of successes occurs
• Geometric distribution, for binomial-type observations but where the quantity of interest is the number of failures
before the first success; a special case of the negative binomial distribution
Related to categorical outcomes (events with K possible outcomes, with a given probability
for each outcome)
• Categorical distribution, for a single categorical outcome (e.g. yes/no/maybe in a survey); a generalization of the
Bernoulli distribution
• Multinomial distribution, for the number of each type of catergorical outcome, given a fixed number of total
outcomes; a generalization of the binomial distribution
• Multivariate hypergeometric distribution, similar to the multinomial distribution, but using sampling without
replacement; a generalization of the hypergeometric distribution
Related to events in a Poisson process (events that occur independently with a given rate)
• Poisson distribution, for the number of occurrences of a Poisson-type event in a given period of time
• Exponential distribution, for the time before the next Poisson-type event occurs
See also
• Copula (statistics) • Inverse transform sampling • Probability density function
• Cumulative distribution function • Likelihood function • Random variable
• List of probability distributions • List of statistical topics • Riemann–Stieltjes integral application to probability theory
• Histogram
Notes
[1] Everitt, B.S. (2006) The Cambridge Dictionary of Statistics, Third Edition. pp. 313–314. Cambridge University Press, Cambridge. ISBN
0521690277
Probability distribution 5
External links
• An 8-foot-tall (2.4 m) Probability Machine (named Sir Francis) comparing stock market returns to the
randomness of the beans dropping through the quincunx pattern. (http://www.youtube.com/
watch?v=AUSKTk9ENzg) from Index Funds Advisors IFA.com (http://www.ifa.com), youtube.com
• Interactive Discrete and Continuous Probability Distributions (http://www.socr.ucla.edu/htmls/
SOCR_Distributions.html), socr.ucla.edu
• A Compendium of Common Probability Distributions (http://www.causascientia.org/math_stat/Dists/
Compendium.pdf)
• A Compendium of Distributions (http://www.vosesoftware.com/content/ebook.pdf), vosesoftware.com
• Statistical Distributions - Overview (http://www.xycoon.com/contdistroverview.htm), xycoon.com
• Probability Distributions (http://www.sitmo.com/eqcat/8) in Quant Equation Archive, sitmo.com
• A Probability Distribution Calculator (http://www.covariable.com/continuous.html), covariable.com
• Sourceforge.net (http://sourceforge.net/projects/distexplorer/), Distribution Explorer: a mixed C++ and C#
Windows application that allows you to explore the properties of 20+ statistical distributions, and calculate CDF,
PDF & quantiles. Written using open-source C++ from the Boost.org (http://www.boost.org) Math Toolkit
library.
• Explore different probability distributions and fit your own dataset online - interactive tool (http://www.xjtek.
com/anylogic/demo_models/111/), xjtek.com
Article Sources and Contributors 6
License
Creative Commons Attribution-Share Alike 3.0 Unported
http:/ / creativecommons. org/ licenses/ by-sa/ 3. 0/