5. Expected Values of Functions of Random Vectors

Let $(X, Y)$ be a two-dimensional random variable and $D_{(X, Y)}$ the set of points of discontinuity of the joint cumulative distribution function $F_{X, Y} (x, y) .$

Definition: Let be a function of the two-dimensional random variable $(X, Y)$ . Then, the expected value of is given by

$(X, Y)$ is a two-dimensional discrete random variable: $E [g (X, Y)] = \sum (x, y) \in D_{(X, Y)} g (x, y) f_{X, Y} (x, y)$ provided that $\sum_{(x, y) \in D_{(X, Y)}} | g (x, y) | f_{X, Y} (x, y) < + \infty .$
$(X, Y)$ is a two-dimensional continuous random variable: $E [g (X, Y)] = \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} g (x, y) f_{X, Y} (x, y) d x d y$ provided that $\int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} | g (x, y) | f (x, y) d x d y < + \infty .$

Example 1 Example: Let $(X, Y)$ be a discrete bidimensional random variable such that $f_{X, Y} (x, y) = {\begin{matrix} x, & 0 < x < 1, 0 < y < 2 0, & otherwise \end{matrix}$ Compute the expected value of $g (X, Y) = X + Y$ .

Answer: Using the definition of expected value, one gets $\begin{matrix} E (X + Y) & = \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} (x + y) f_{X, Y} (x, y) d x d y = \int_{0}^{2} \int_{0}^{1} x (x + y) d x d y = \int_{0}^{2} \frac{1}{3} + \frac{y}{2} d y = \frac{5}{3} \end{matrix}$

Theorem: Let $(X, Y)$ be a discrete two-dimensional random variable with joint probability function $f_{X, Y} (x, y)$ :

If $g (X, Y) = h (X)$ that is $g (X, Y)$ only depends on $X$ , then $\begin{matrix} E (g (X, Y)) & = E [h (X)] = \sum (x, y) \in D_{(X, Y)} h (x) f_{X, Y} (x, y) = \sum x \in D_{X} h (x) \sum y \in D_{Y} f_{X, Y} (x, y) = \sum x \in D_{X} h (x) f_{X} (x) \end{matrix}$ provided that $\sum_{(x, y) \in D_{(X, Y)}} | h (x) | f_{X, Y} (x, y) < + \infty .$
If $g (X, Y) = v (Y)$ that is $g (X, Y)$ only depends on $Y$ , then $\begin{matrix} E [v (Y)] & = \sum (x, y) \in D_{(X, Y)} v (y) f_{X, Y} (x, y) = \sum y \in D_{Y} v (y) \sum x \in D_{X} f_{X, Y} (x, y) = \sum y \in D_{Y} v (y) f_{Y} (y) \end{matrix}$ provided that $\sum_{(x, y) \in D_{(X, Y)}} | v (y) | f_{X, Y} (x, y) < + \infty .$

Example 2 Example: Let $(X, Y)$ be a two-dimensional random variable such that $f_{X, Y} = {\begin{matrix} \frac{1}{5}, & x = 1, 2, y = 0, 1, 2, y \leq x 0, & otherwise \end{matrix} .$ Compute the expected value of $X$ .

Solution:

$i$ By using the joint probability function: $\begin{matrix} E (X) & = \sum (x, y) \in D_{(X, Y)} x f_{X, Y} (x, y) = 2 \sum x = 1 x \sum y = 0 \frac{1}{5} x = \frac{8}{5} \end{matrix}$

Example 3 Example: Let $(X, Y)$ be a two-dimensional random variable such that $f_{X, Y} = {\begin{matrix} \frac{1}{5}, & x = 1, 2, y = 0, 1, 2, y \leq x 0, & otherwise \end{matrix} .$ Compute the expected value of $X$ .

Solution:

$i i$ By using the marginal function: $f_{X} (x) = x \sum y = 0 f_{X, Y} (x, y) = ⎧ ⎪ ⎨ ⎪ ⎩ \begin{matrix} \frac{2}{5}, & x = 1 \frac{3}{5}, & x = 2 0, & otherwise \end{matrix} .$ Therefore, $E (X) = 2 \sum x = 1 x f_{X} (x) = 1 \times \frac{2}{5} + 2 \times \frac{3}{5} = \frac{8}{5} .$

Theorem: Let $(X, Y)$ be a continuous two-dimensional random variable with joint probability function $f_{X, Y} (x, y) :$

If $g (X, Y) = h (X)$ that is $g (X, Y)$ only depends on $X$ , then

$\begin{matrix} E [h (X)] & = \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} h (x) f_{X, Y} (x, y) d x d y = \int_{- \infty}^{+ \infty} h (x) (\int_{- \infty}^{+ \infty} f_{X, Y} (x, y) d y) d x = \int_{- \infty}^{+ \infty} h (x) f_{X} (x) d x \end{matrix}$ provided that $\int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} | h (x) | f_{X, Y} (x, y) d x d y < + \infty .$

If $g (X, Y) = v (Y)$ that is $g (X, Y)$ only depends on $Y$ , then

$\begin{matrix} E [v (Y)] & = \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} v (y) f_{X, Y} (x, y) d x d y = \int_{- \infty}^{+ \infty} v (y) (\int_{- \infty}^{+ \infty} f_{X, Y} (x, y) d x) d y = \int_{- \infty}^{+ \infty} v (Y) f_{Y} (y) d y \end{matrix}$ provided that $\int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} | v (y) | f_{X, Y} (x, y) d x d y < + \infty .$

Example 4 Example: Let $(X, Y)$ be a discrete bidimensional random variable such that $f_{X, Y} (x, y) = {\begin{matrix} x, & 0 < x < 1, 0 < y < 2 0, & otherwise \end{matrix}$ Compute the expected value of $3 X + 2$ .

Answer:

$i$ Using the joint density function.

Using the definition of marginal expected value, one gets $\begin{matrix} E (3 X + 2) & = 3 E (X) + 2 = 3 \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} x f_{X, Y} (x, y) d x d y + 2 = 3 \int_{0}^{2} \int_{0}^{1} x^{2} d x d y + 2 = 3 \int_{0}^{2} \frac{1}{3} d y + 2 = 4 \end{matrix}$

Example 5 Example: Let $(X, Y)$ be a discrete bidimensional random variable such that $f_{X, Y} (x, y) = {\begin{matrix} x, & 0 < x < 1, 0 < y < 2 0, & otherwise \end{matrix}$ Compute the expected value of $3 X + 2$ .

Answer:

$i i$ Using the marginal density function.

The marginal density function of $X$ is given by $f_{X} (x) = \int_{- \infty}^{\infty} f_{X, Y} (x, y) d y = {\begin{matrix} 2 x, & 0 < x < 1 0, & otherwise \end{matrix} .$ Therefore, $E (3 X + 2) = 3 E (X) + 2 = 4$ , because $\begin{matrix} E (X) & = \int_{- \infty}^{+ \infty} x f_{X} (x) d x = \int_{0}^{1} 2 x^{2} d x = \frac{2}{3} \end{matrix}$

Properties:

$E [h (X) + v (Y)] = E [h (X)] + E [v (Y)]$ provided that $E [| h (X) |] < + \infty,$ $E$
$E [\sum_{i = 1}^{N} X_{i}] = \sum_{i = 1}^{N} E [X_{i}],$ where $N$ is a finite integer, provided that $E [| X_{i} |] < + \infty$ for $i = 1, 2, . . ., N .$

Example 6 Example:Let $(X, Y)$ be a discrete bidimensional random variable such that $f_{X, Y} (x, y) = {\begin{matrix} x, & 0 < x < 1, 0 < y < 2 0, & otherwise \end{matrix}$ Compute the expected value of $Y$ .

Answer: We know that $E (X + Y) = E (X) + E (Y) = \frac{5}{3}$ . Since $E (X) = \frac{2}{3}$ , then we get that $E (Y) = 1$ .

Definition: The $r$ th and $s$ th moment of products about the origin of the random variables $X$ and $Y$ , denoted by is the expected value of $X^{r} Y^{s},$ for which is given by

if and are discrete random variables:
if and are continuous random variables:

Remarks:

If we have
Cauchy-Schwarz Inequality: For any two random variables and , we have provided that is finite.
If and are independent random variables, for any two functions and

[Warning: The reverse is not true.]
If are independent random variables independent,

[Warning: The reverse is not true.]

Definition: The th and th moment of products about the mean of the discrete random variables and , denoted by is the expected value of for which is given by

Definition: The th and th moment of products about the mean of the continuous random variables and , denoted by for is given by

The covariance is a measure of the joint variability of two random variables. Formally it is defined as

How can we interpret the covariance?

When the variables tend to show similar behavior, the covariance is positive:
- If high (small) values of one variable mainly correspond to high (small) values of the other variable;
When the variables tend to show opposite behavior, the covariance is negative:
- When high (small) values of one variable mainly correspond to low (high) values of the other;
If there is no linear association, then the covariance will be zero.

Properties:

If and are independent
If , where is constant,
If ,
If , where is constant,
If follows from the Cauchy-Schwarz Inequality that

The covariance has the inconvenient of depending on the scale of both random variables. For what values of the covariance can we say that there is a strong association between the two random variables?

The correlation coefficient is a measure of the joint variability of two random variables that do not depend on the scale:

Properties:

If follows from the Cauchy-Schwarz Inequality that

If where and are constants

if
if
If it is not defined.

Summary of important results:

If
If are random variables and are constants and then
If are random variables, are constants and are constants, and then

Definition: Let be a two dimensional random variable and a function of and Then, the conditional expectation of given , is given by

if and are discrete random variables where is the set of discontinuity points of and is the value of the conditional probability function of given at
if and are continuous random variables where is the value of the conditional probability density function of given at .

provided that the expected values exist and are finite.

Remarks:

If , then we have the conditional mean of (notice that this is a function of )
If , then we have the conditional variance of
As usual,
If and are [independent],
Of course we can reverse the roles of and that is we can compute using definitions similar to those above.

Example: Let be two-dimensional random variable such that Then the conditional density function of is given by where

Example: The conditional expected value can be computed as follows:

To compute the conditional variance, one may start by computing the following conditional expected value Therefore

Example: Let and be two random variables such that

To compute the conditional expected value one has to compute the condition probability function:

where Therefore,

Notice that is indeed a function of . Therefore, is a random variable because can take different values according its distribution, i.e, if can take the value , then can take with probability .

Discrete random variables

The random variable takes the values . Assume that all values of are different. Then,

In general, the probability function of can be computed in the following way

Example: Let be a discrete random variable such that is represented in the following table

X/Y	1	2	3
0	0.2	0.1	0.15
1	0.05	0.35	0.15

One may compute the following conditional probability functions: Consequently, and . Therefore, the random variable has the following probability function

Continuous random variables

The cumulative distribution function of is, indeed When is an injective function, we get that

Therefore, we can calculate all the quantities that we know (the expected value, variance, ...) for or

Theorem (Law of iterated Expectations) Let be a two dimensional random variable. Then, provided that is finite and provided that is finite.

Remark: This theorem shows that there are two ways to compute (resp., ). The first is the direct way. The second way is to consider the following steps:

compute and notice that this is a function solely of that is we can write
according to the theorem replacing by and taking the mean we obtain for this specific form of
This theorem is useful in practice in the calculation of if we know or and (or some moments of ), but not

Remarks: The results presented can be generalized for functions of and , i.e., , if exists.

Example: Let be a bi-dimensional continuous random variable such that Taking into account the previous theorem,

Theorem: Assuming that exists then

Theorem: Let and be two random variables then

Example: Let be a bidimensional random variable such that Compute using the previous theorem.

Exam question: Let and be two random variables such that for all such that . Prove that . Are the random variables independent? Justify your answer.

Last updated on Jan 1, 0001

Edit this page