Submission is due on Tuesday in Week 3.
Table 0.8 gives the consumer price index (CPI) for the UK, US, France and Germany from 2001 to 2010. CPI is a measure of the change in time of the price paid by the customer for a basket of fixed goods and services; here it has been standardised to be 100 in 2005. The data were provided by the World Bank International Monetary Fund and were obtained from http://datamarket.com/.
| Year | France | Germany | United Kingdom | United States | 
|---|---|---|---|---|
| 2001 | 92.5 | 94.5 | 94.2 | 90.7 | 
| 2002 | 94.3 | 95.8 | 95.4 | 92.1 | 
| 2003 | 96.2 | 96.8 | 96.7 | 94.2 | 
| 2004 | 98.3 | 98.5 | 98.0 | 96.7 | 
| 2005 | 100.0 | 100.0 | 100.0 | 100.0 | 
| 2006 | 101.7 | 101.6 | 102.3 | 103.2 | 
| 2007 | 103.2 | 103.9 | 104.7 | 106.2 | 
| 2008 | 106.1 | 106.6 | 108.5 | 110.2 | 
| 2009 | 106.2 | 107.0 | 110.8 | 109.9 | 
| 2010 | 107.8 | 108.2 | 114.5 | 111.7 | 
Assume that the data for the US are an IID sample from a Normal distribution and that the data for Germany are an IID sample from a Normal distribution. We wish to test whether or not the mean CPI for the US is the same as the mean CPI for Germany.
Explain why a paired approach is appropriate for this test.
[marks: 1]
Calculate the differences .
[marks: 1]
Using your answer to part (b) or otherwise, test at the 10% level whether or not there is evidence that the mean CPI for the US is the same as the mean CPI for Germany. You should state your -value.
[marks: 3]
Table 0.6 shows UK interest rates for January split into two decades, 1960–1969 and 1970–1979. Is there evidence that the mean interest rate in the 1960’s is lower than the mean interest rate in the 1970’s?
| 1960–1969 | 1970–1979 | 
|---|---|
| 5.0 | 8.0 | 
| 5.0 | 7.0 | 
| 6.0 | 5.0 | 
| 4.0 | 8.8 | 
| 4.0 | 12.8 | 
| 7.0 | 11.0 | 
| 6.0 | 10.5 | 
| 6.5 | 12.2 | 
| 8.0 | 6.5 | 
| 7.0 | 12.5 | 
Calculate the pooled sample variance for the data.
[marks: 2]
Carry out an appropriate -test at the 5% level. What do you conclude about the difference between the mean interest rates in the two decades?
[marks: 3]
Challenge
The geometric distribution gives probabilities for the number of Bernoulli trials required before the first success. The distribution has a single parameter  which can be interpreted as the probability of success for each trial. In the game of ludo a ‘six’ must be thrown on a six-sided dice before a player can start. A player recorded the number of throws that they made before obtaining a six in each of ten games. The same dice was used throughout all games, and the results are as follows:
Assuming that these data come from a Geometric distribution, is there evidence that is greater than , i.e. that the dice is loaded?
Given that the expectation of a random variable is , explain why is a sensible estimator for . Obtain an estimate of using the above sample.
[marks: 2]
By writing appropriate code in R, carry out a non-parametric bootstrap routine to obtain a 90% confidence interval for . Hint you might want to adapt the code given in workshop question WS2.1.
[marks: 2]
Is there evidence at the 5% level that is greater than 1/6?
[marks: 1]