Escolar Documentos
Profissional Documentos
Cultura Documentos
Problem Set 1
2. Plot histograms for the samples generated in Question 1 and match them with the shapes of
the original sampling distributions.
3. Generate a random sample of size of n = 10 from the exponential distribution with mean 3.
Calculate the mean of the generated sample. Repeat this process 100 times and in each case
record the sample mean. Plot the histogram of these sample means. Does this plot resemble
to normal distribution? If the size of the sample is changed from n = 10 to n = 150, what
is your observation. Can you think of a theoretical result that supports this observation.
Further, perform the same exercise by replacing the exponential distribution with Poisson
distribution with mean equal to 31 .
5. Data about the caret size of diamonds and their corresponding price is given in the file
diamond.csv. Fit a linear regression model to predict the price of a diamond given its
caret size.
6. Global warming is an important environment issue in the contemporary world. Data about
the cover of ice on earth and its corresponding year is provided in the file ice data.csv.
Fit a linear regression model to predict the cover of ice in the year 2017.
7. Load the data given in the file data 1.csv. Fit a linear regression model for this data and
compute the residuals. Can you say the residuals are normally distributed?
8. Load the data given in the file data 2.csv. Fit a linear regression model for this data and
compute the residuals. Can you say the residuals are normally distributed?
1
9. Consider the following simple linear regression model
y = 0 + 1 x +