HOMEWORK 05

Arithmetic Mean: Derivations and Generalization

In mathematics and statistics, the arithmetic mean is the sum of a collection of numbers divided by the count of numbers in the collection¹. Its formula is: $(1/n)*\sum_{i=1}^{n }\ a_{i}$

The arithmetic mean is the most commonly used and readily understood measure of central tendency in a data set. In statistics, the term average refers to any of the measures of central tendency. Actually mean functions tend to represent the whole set of values with a single number…its about synthesis of the population. Arithmetic mean is just one of the many different function used to analyze distriubtion and with it follow some derivations:

Geometric Mean

The geometric mean is a mean or average which indicates a central tendency of a set of numbers by using the product of their values ² (as opposed to the arithmetic mean which uses their sum). The geometric mean is defined as the nth root of the product of n numbers, i.e., for a set of numbers x1, x2, …, xn, the geometric mean is defined as
$(\prod_{i=1}^{n} x_1)^{1/n}=\sqrt[n]{x_1 x_2 ... x_n}$

Harmonic Mean

It’s one of several kinds of average. It is sometimes appropriate for situations when the average rate is desired. The harmonic mean can be expressed as the reciprocal of the arithmetic mean of the reciprocals of the given set of observations³. $H = \frac{n}{\frac{1}{x_1}+\frac{1}{x_2}+...+\frac{1}{x_n}}$

Weighted Arithmetic Mean

The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The notion of weighted mean plays a role in descriptive statistics and also occurs in a more general form in several other areas of mathematics⁴. $(1/n)*\sum_{i=1}^{n }\ w_{i}*a_{i}$

It can be thought as a generalization of the standard arithmetic mean: actually if all the weights are equal, then the weighted mean is the same as the arithmetic mean.

Generalized Mean

Also known as power mean or Hölder mean are a family of functions for aggregating sets of numbers. A power mean is a mean of the form $M_p(x_1,...,x_n)=(\frac{1}{n}\sum_{i=1}^{n}{x_i^p})^{1/p}$ Where $p$ is a non-zero real number, and $x_1,…,x_n$ are positive real numbers. The following table summarizes some common named means that are special cases of the generalized mean, where

Mathematical Convergence vs Convergence in Probability

Convergence, in mathematics, property (exhibited by certain infinite series and functions) of approaching a limit more and more closely as an argument (variable) of the function increases or decreases or as the number of terms of the series increases. For example, the function y(x) = 1/x converges to zero as x increases. Although no finite value of x will cause the value of y to actually become zero, the limiting value of y is zero because y can be made as small as desired by choosing x large enough⁵.

Convergence in probability is said to be non-deterministic, stablished by the weak law of large numbers.

Convergence in distribution is in some sense the weakest type of convergence ⁶. All it says is that the CDF (cumulative distribution function) of Xn’s converges to the CDF of $X$ as n goes to infinity. It does not require any dependence between the $X_n$ and $X$.

In particular, for a sequence $X_1, X_2, X_3$ ⋯ to converge to a random variable $X$, we must have that $P(Abs(Xn−X)≥ϵ)$ goes to 0 as $n→∞$, for any $ϵ>0$.

Convergence in probability is stronger than convergence in distribution ⁷. When we refer about converge in probability, we are referring to convergence of a set of random variables $X_1,X_2,…,X_N$ to a distribution $X$ This, as opposed to point-wise convergence, is a function-wise convergence.

Differences between Descriptive Statistics and Inferential Statistics, the role of probability and probability distributions

With Inferential Statistics the dataset we use to retrieve information is just a limmited subset of the whole population: we try to draw conclusion about an unknown theorical population from a sample. Actually this kind of statistics infers properties of a population, which is said to be unknown.

On the other hand, in descriptive statistics what what we retrieve from a dataset is an empirical distribution, that’s because the sample is actually representative of the whole population.

Application

Here’s an application that meets both features. Code can be found in Github or can be downloaded.

Wikipedia, Arithmetic Mean: https://en.wikipedia.org/wiki/Arithmetic_mean ↩
Wikipedia, Geometric Mean: https://en.wikipedia.org/wiki/Geometric_mean ↩
Wikipedia, Harmonic Mean: https://en.wikipedia.org/wiki/Harmonic_mean ↩
Wikipedia, Weighted Arithmetic Mean: https://en.wikipedia.org/wiki/Weighted_arithmetic_mean ↩
Britannica.com, convergence: https://www.britannica.com/science/convergence-mathematics ↩
Probabilitycourse: https://www.probabilitycourse.com/chapter7/7_2_4_convergence_in_distribution.php ↩
Probabilitycourse: https://www.probabilitycourse.com/chapter7/7_2_5_convergence_in_probability.php ↩