Advertisements
Advertisements
Question
The mean and standard deviation of a set of n1 observations are `barx_1` and s1, respectively while the mean and standard deviation of another set of n2 observations are `barx_2` and s2, respectively. Show that the standard deviation of the combined set of (n1 + n2) observations is given by
S.D. = `sqrt((n_1(s_1)^2 + n_2(s_2)^2)/(n_1 + n_2) + (n_1n_2 (barx_1 - barx_2)^2)/(n_1 + n_2)^2)`
Advertisements
Solution
Let xi' = 1, 2, 3, 4, ..., n1
And yj' = 1, 2, 3, 4, ..., n2
∴ `barx_1 = 1/n_1 sum_(i = 1)^n x_i`
And `barx_2 = 1/n_2 sum_(j = 1)^n y_j`
⇒ `sigma_1^2 = 1/n_1 sum_(i = 1)^(n_1) (x_i - barx_1)^2`
And `sigma_2^2 = 1/n_2 sum_(j = 1)^(n_2) (y_i - barx_2)^2`
Now mean of the combined series is given by
`barx = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) + sum_(j = 1)^(n_2) y_j]`
= `(n_1 barx_1 + n_2 x_2)/(n_1 + n_2)`
Therefore, `sigma^2` of the combined series is
`sigma^2 = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) (x_i - barx)^2 + sum_(j = 1)^(n_2) (y_j - barx)^2]`
Now, `sum_(i = 1)^(n_1) (x_i - barx)^2 = sum_(i = 1)^(n_1) (x_i - barx_j + bar_j - barx)^2`
= `sum_(i = 1)^(n_1) (x_i - x_j)^2 + n_1 (barx_j - barx)^2 + 2(barx_j - barx) sum_(i = 1)^(n_1) (x_i - barx_j)^2`
But `sum_(i = 1)^n (x_i - barx_i)` = 0
∵ The algebraic sum of the deviation of values of first series from their mean is zero
Also `sum_(i = 1)^(n_1) (x_i - barx)^2 = n_1s_1^2 + n_1(barx_1 - barx)^2`
= `n_1s_1^2 + n_1d_1^2`
Where `d_1 = (barx_1 - barx)`
Similarly, we have
`sum_(j = 1)^(n_2) (y_j - barx)^2 = sum_(j = 1)^(n_2) (y_j - barx_i + barx_i - barx)^2`
= `n_2s_2^2 + n_2d_2^2`
Where `d_2 = (barx_2 - barx)`
Now combined Standard Deviation (S.D.)
`sigma = sqrt((n_1(s_1^2 + d_1^2) + n_2(s_2^2 + d_2^2))/(n_1 + n_2))`
Where `d_1 = barx_1 - barx`
= `barx_1 - ((n_1barx_1 + n_2 barx_2)/(n_1 + n_2))`
= `(n_2(barx_1 - barx_2))/(n_1 + n_2)`
And `d_2 = barx_2 - barx`
= `barx_2 - ((n_1barx_1 + b_2barx_2)/(n_1 + n_2))`
= `(n_1(barx_2 - barx_1))/(n_1 + n_2)`
∴ `sigma^2 = 1/(n_1 + n_2)[n_1s_1^2 + n_2s_2^2 + (n_1n_2^2(barx_1 - barx_2)^2)/(n_1 + n_2)^2 + (n_2n_1^2(barx_2 - barx_1)^2)/(n_1 + n_2)^2]`
So, `sigma = sqrt((n_1s_1^2 + n_2s_2^2)/(n_1 + n_2) + (n_1n_2(barx_1 - barx_2)^2)/(n_1 + n_2)^2`
Hence proved.
APPEARS IN
RELATED QUESTIONS
Find the mean and variance for the data.
6, 7, 10, 12, 13, 4, 8, 12
Find the mean and variance for the first n natural numbers.
The mean and variance of 7 observations are 8 and 16, respectively. If five of the observations are 2, 4, 10, 12 and 14. Find the remaining two observations.
Find the mean, variance and standard deviation for the data:
2, 4, 5, 6, 8, 17.
The variance of 15 observations is 4. If each observation is increased by 9, find the variance of the resulting observations.
The mean and standard deviation of 6 observations are 8 and 4 respectively. If each observation is multiplied by 3, find the new mean and new standard deviation of the resulting observations.
The mean and standard deviation of 100 observations were calculated as 40 and 5.1 respectively by a student who took by mistake 50 instead of 40 for one observation. What are the correct mean and standard deviation?
Show that the two formulae for the standard deviation of ungrouped data
\[\sigma = \sqrt{\frac{1}{n} \sum \left( x_i - X \right)^2_{}}\] and
\[\sigma' = \sqrt{\frac{1}{n} \sum x_i^2 - X^2_{}}\] are equivalent, where \[X = \frac{1}{n}\sum_{} x_i\]
Calculate the mean and S.D. for the following data:
| Expenditure in Rs: | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |
| Frequency: | 14 | 13 | 27 | 21 | 15 |
Calculate the standard deviation for the following data:
| Class: | 0-30 | 30-60 | 60-90 | 90-120 | 120-150 | 150-180 | 180-210 |
| Frequency: | 9 | 17 | 43 | 82 | 81 | 44 | 24 |
A student obtained the mean and standard deviation of 100 observations as 40 and 5.1 respectively. It was later found that one observation was wrongly copied as 50, the correct figure being 40. Find the correct mean and S.D.
Calculate the mean, median and standard deviation of the following distribution:
| Class-interval: | 31-35 | 36-40 | 41-45 | 46-50 | 51-55 | 56-60 | 61-65 | 66-70 |
| Frequency: | 2 | 3 | 8 | 12 | 16 | 5 | 2 | 3 |
Find the mean and variance of frequency distribution given below:
| xi: | 1 ≤ x < 3 | 3 ≤ x < 5 | 5 ≤ x < 7 | 7 ≤ x < 10 |
| fi: | 6 | 4 | 5 | 1 |
Coefficient of variation of two distributions are 60% and 70% and their standard deviations are 21 and 16 respectively. What are their arithmetic means?
From the data given below state which group is more variable, G1 or G2?
| Marks | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 |
| Group G1 | 9 | 17 | 32 | 33 | 40 | 10 | 9 |
| Group G2 | 10 | 20 | 30 | 25 | 43 | 15 | 7 |
Find the coefficient of variation for the following data:
| Size (in cms): | 10-15 | 15-20 | 20-25 | 25-30 | 30-35 | 35-40 |
| No. of items: | 2 | 8 | 20 | 35 | 20 | 15 |
If X and Y are two variates connected by the relation
In a series of 20 observations, 10 observations are each equal to k and each of the remaining half is equal to − k. If the standard deviation of the observations is 2, then write the value of k.
If each observation of a raw data whose standard deviation is σ is multiplied by a, then write the S.D. of the new set of observations.
If v is the variance and σ is the standard deviation, then
If the standard deviation of a variable X is σ, then the standard deviation of variable \[\frac{a X + b}{c}\] is
If the S.D. of a set of observations is 8 and if each observation is divided by −2, the S.D. of the new set of observations will be
Let a, b, c, d, e be the observations with mean m and standard deviation s. The standard deviation of the observations a + k, b + k, c + k, d + k, e + k is
The standard deviation of first 10 natural numbers is
The mean of 100 observations is 50 and their standard deviation is 5. The sum of all squares of all the observations is
The standard deviation of the observations 6, 5, 9, 13, 12, 8, 10 is
Find the standard deviation of the first n natural numbers.
The mean and standard deviation of some data for the time taken to complete a test are calculated with the following results:
Number of observations = 25, mean = 18.2 seconds, standard deviation = 3.25 seconds. Further, another set of 15 observations x1, x2, ..., x15, also in seconds, is now available and we have `sum_(i = 1)^15 x_i` = 279 and `sum_(i = 1)^15 x^2` = 5524. Calculate the standard derivation based on all 40 observations.
The mean life of a sample of 60 bulbs was 650 hours and the standard deviation was 8 hours. A second sample of 80 bulbs has a mean life of 660 hours and standard deviation 7 hours. Find the overall standard deviation.
If for distribution `sum(x - 5)` = 3, `sum(x - 5)^2` = 43 and total number of items is 18. Find the mean and standard deviation.
Let x1, x2, ..., xn be n observations and `barx` be their arithmetic mean. The formula for the standard deviation is given by ______.
Let x1, x2, ... xn be n observations. Let wi = lxi + k for i = 1, 2, ...n, where l and k are constants. If the mean of xi’s is 48 and their standard deviation is 12, the mean of wi’s is 55 and standard deviation of wi’s is 15, the values of l and k should be ______.
The standard deviation is ______to the mean deviation taken from the arithmetic mean.
