Advertisements
Advertisements
Question
The mean and standard deviation of a set of n1 observations are `barx_1` and s1, respectively while the mean and standard deviation of another set of n2 observations are `barx_2` and s2, respectively. Show that the standard deviation of the combined set of (n1 + n2) observations is given by
S.D. = `sqrt((n_1(s_1)^2 + n_2(s_2)^2)/(n_1 + n_2) + (n_1n_2 (barx_1 - barx_2)^2)/(n_1 + n_2)^2)`
Advertisements
Solution
Let xi' = 1, 2, 3, 4, ..., n1
And yj' = 1, 2, 3, 4, ..., n2
∴ `barx_1 = 1/n_1 sum_(i = 1)^n x_i`
And `barx_2 = 1/n_2 sum_(j = 1)^n y_j`
⇒ `sigma_1^2 = 1/n_1 sum_(i = 1)^(n_1) (x_i - barx_1)^2`
And `sigma_2^2 = 1/n_2 sum_(j = 1)^(n_2) (y_i - barx_2)^2`
Now mean of the combined series is given by
`barx = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) + sum_(j = 1)^(n_2) y_j]`
= `(n_1 barx_1 + n_2 x_2)/(n_1 + n_2)`
Therefore, `sigma^2` of the combined series is
`sigma^2 = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) (x_i - barx)^2 + sum_(j = 1)^(n_2) (y_j - barx)^2]`
Now, `sum_(i = 1)^(n_1) (x_i - barx)^2 = sum_(i = 1)^(n_1) (x_i - barx_j + bar_j - barx)^2`
= `sum_(i = 1)^(n_1) (x_i - x_j)^2 + n_1 (barx_j - barx)^2 + 2(barx_j - barx) sum_(i = 1)^(n_1) (x_i - barx_j)^2`
But `sum_(i = 1)^n (x_i - barx_i)` = 0
∵ The algebraic sum of the deviation of values of first series from their mean is zero
Also `sum_(i = 1)^(n_1) (x_i - barx)^2 = n_1s_1^2 + n_1(barx_1 - barx)^2`
= `n_1s_1^2 + n_1d_1^2`
Where `d_1 = (barx_1 - barx)`
Similarly, we have
`sum_(j = 1)^(n_2) (y_j - barx)^2 = sum_(j = 1)^(n_2) (y_j - barx_i + barx_i - barx)^2`
= `n_2s_2^2 + n_2d_2^2`
Where `d_2 = (barx_2 - barx)`
Now combined Standard Deviation (S.D.)
`sigma = sqrt((n_1(s_1^2 + d_1^2) + n_2(s_2^2 + d_2^2))/(n_1 + n_2))`
Where `d_1 = barx_1 - barx`
= `barx_1 - ((n_1barx_1 + n_2 barx_2)/(n_1 + n_2))`
= `(n_2(barx_1 - barx_2))/(n_1 + n_2)`
And `d_2 = barx_2 - barx`
= `barx_2 - ((n_1barx_1 + b_2barx_2)/(n_1 + n_2))`
= `(n_1(barx_2 - barx_1))/(n_1 + n_2)`
∴ `sigma^2 = 1/(n_1 + n_2)[n_1s_1^2 + n_2s_2^2 + (n_1n_2^2(barx_1 - barx_2)^2)/(n_1 + n_2)^2 + (n_2n_1^2(barx_2 - barx_1)^2)/(n_1 + n_2)^2]`
So, `sigma = sqrt((n_1s_1^2 + n_2s_2^2)/(n_1 + n_2) + (n_1n_2(barx_1 - barx_2)^2)/(n_1 + n_2)^2`
Hence proved.
APPEARS IN
RELATED QUESTIONS
Find the mean and variance for the data.
6, 7, 10, 12, 13, 4, 8, 12
Find the mean and variance for the first n natural numbers.
The sum and sum of squares corresponding to length x (in cm) and weight y (in gm) of 50 plant products are given below:
`sum_(i-1)^50 x_i = 212, sum_(i=1)^50 x_i^2 = 902.8, sum_(i=1)^50 y_i = 261, sum_(i = 1)^50 y_i^2 = 1457.6`
Which is more varying, the length or weight?
The mean and variance of eight observations are 9 and 9.25, respectively. If six of the observations are 6, 7, 10, 12, 12 and 13, find the remaining two observations.
The mean and variance of 7 observations are 8 and 16, respectively. If five of the observations are 2, 4, 10, 12 and 14. Find the remaining two observations.
Given that `barx` is the mean and σ2 is the variance of n observations x1, x2, …,xn. Prove that the mean and variance of the observations ax1, ax2, ax3, …,axn are `abarx` and a2 σ2, respectively (a ≠ 0).
The mean and standard deviation of a group of 100 observations were found to be 20 and 3, respectively. Later on it was found that three observations were incorrect, which were recorded as 21, 21 and 18. Find the mean and standard deviation if the incorrect observations are omitted.
Find the mean, variance and standard deviation for the data:
2, 4, 5, 6, 8, 17.
Find the mean, variance and standard deviation for the data:
6, 7, 10, 12, 13, 4, 8, 12.
Find the mean, variance and standard deviation for the data:
227, 235, 255, 269, 292, 299, 312, 321, 333, 348.
Show that the two formulae for the standard deviation of ungrouped data
\[\sigma = \sqrt{\frac{1}{n} \sum \left( x_i - X \right)^2_{}}\] and
\[\sigma' = \sqrt{\frac{1}{n} \sum x_i^2 - X^2_{}}\] are equivalent, where \[X = \frac{1}{n}\sum_{} x_i\]
Find the standard deviation for the following data:
| x : | 3 | 8 | 13 | 18 | 23 |
| f : | 7 | 10 | 15 | 10 | 6 |
Calculate the A.M. and S.D. for the following distribution:
| Class: | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 |
| Frequency: | 18 | 16 | 15 | 12 | 10 | 5 | 2 | 1 |
Calculate the mean, median and standard deviation of the following distribution:
| Class-interval: | 31-35 | 36-40 | 41-45 | 46-50 | 51-55 | 56-60 | 61-65 | 66-70 |
| Frequency: | 2 | 3 | 8 | 12 | 16 | 5 | 2 | 3 |
Mean and standard deviation of 100 observations were found to be 40 and 10 respectively. If at the time of calculation two observations were wrongly taken as 30 and 70 in place of 3 and 27 respectively, find the correct standard deviation.
Two plants A and B of a factory show following results about the number of workers and the wages paid to them
| Plant A | Plant B | |
| No. of workers | 5000 | 6000 |
| Average monthly wages | Rs 2500 | Rs 2500 |
| Variance of distribution of wages | 81 | 100 |
In which plant A or B is there greater variability in individual wages?
The means and standard deviations of heights ans weights of 50 students of a class are as follows:
| Weights | Heights | |
| Mean | 63.2 kg | 63.2 inch |
| Standard deviation | 5.6 kg | 11.5 inch |
Which shows more variability, heights or weights?
Coefficient of variation of two distributions are 60% and 70% and their standard deviations are 21 and 16 respectively. What are their arithmetic means?
The mean and standard deviation of marks obtained by 50 students of a class in three subjects, mathematics, physics and chemistry are given below:
| Subject | Mathematics | Physics | Chemistry |
| Mean | 42 | 32 | 40.9 |
| Standard Deviation | 12 | 15 | 20 |
Which of the three subjects shows the highest variability in marks and which shows the lowest?
If v is the variance and σ is the standard deviation, then
The standard deviation of the data:
| x: | 1 | a | a2 | .... | an |
| f: | nC0 | nC1 | nC2 | .... | nCn |
is
Let a, b, c, d, e be the observations with mean m and standard deviation s. The standard deviation of the observations a + k, b + k, c + k, d + k, e + k is
The standard deviation of first 10 natural numbers is
A set of n values x1, x2, ..., xn has standard deviation 6. The standard deviation of n values x1 + k, x2 + k, ..., xn + k will be ______.
Two sets each of 20 observations, have the same standard derivation 5. The first set has a mean 17 and the second a mean 22. Determine the standard deviation of the set obtained by combining the given two sets.
The mean life of a sample of 60 bulbs was 650 hours and the standard deviation was 8 hours. A second sample of 80 bulbs has a mean life of 660 hours and standard deviation 7 hours. Find the overall standard deviation.
If for distribution `sum(x - 5)` = 3, `sum(x - 5)^2` = 43 and total number of items is 18. Find the mean and standard deviation.
Let x1, x2, x3, x4, x5 be the observations with mean m and standard deviation s. The standard deviation of the observations kx1, kx2, kx3, kx4, kx5 is ______.
Let x1, x2, ... xn be n observations. Let wi = lxi + k for i = 1, 2, ...n, where l and k are constants. If the mean of xi’s is 48 and their standard deviation is 12, the mean of wi’s is 55 and standard deviation of wi’s is 15, the values of l and k should be ______.
Standard deviations for first 10 natural numbers is ______.
Coefficient of variation of two distributions are 50 and 60, and their arithmetic means are 30 and 25 respectively. Difference of their standard deviation is ______.
If the variance of a data is 121, then the standard deviation of the data is ______.
The standard deviation is ______to the mean deviation taken from the arithmetic mean.
The mean and standard deviation of six observations are 8 and 4, respectively. If each observation is multiplied by 3, find the new mean and new standard deviation of the resulting observations.
