Descriptive Statistics and Introduction to R

Overview and Rationale

It is important for you to be able to describe data numerically and graphically and using

multiple regression to predict influential variables. In this assignment you will use R in a

hands-on experience on data analytics as a review.

Course Outcomes

This assignment is directly linked to the following key learning outcomes from the course

syllabus:

? Describe data numerically and graphically and predict influential variables for real world

business problems

Assignment Summary

Use R functions to characterize data numerically by using descriptive statistics and

graphically.

You will then report your work and findings in a paper of appropriate length accommodate

the content .

Use the following supporting materials for R syntax, data sets and tools:

? Using R for Data Analysis and Graphics by J H Maindonald.

? Quick R

Follow the instructions below for each part of the assignment:

Submission Guidelines

Please submit your R code as a commented R-script (with R-Studio project file if available)

or R-notebook, the output, graphs and figures (if any), and your interpretation. Providing

alternative ways to calculate the same data in R (there are usually multiple ways of doing

same thing in R), compare, reason why you prefer one over the other is a plus.

Task 1

Use the “Trees” data or another data set that is part of R. Then, use the functions in sections

2.5, 3.5 and 3.6 of “Using R for Data Analysis and Graphics” to describe data numerically

and construct the graphs to describe data graphically. Follow the steps below.

a. Invoke R and use the “Tree” dataset

b. Find the 5 summary numbers in the data

c. Create Histograms and density plots

d. Create Boxplots

e. Normal probability plots

Follow the submission guidelines, see above.

Task 2

According to CDCP, in 2010 the average height of adult women/men in the USA was 63.8

inches, with the standard deviation of 2.7 inches. Calculate the standardized height (zscore)

of a U.S. woman who is 71 inches tall. Feel free to find the data for adult women/men

in the country of your choice and calculate their standardized height. Explain what the

findings mean. Follow the submission guidelines, see above.

Task 3

Using “weight-height.csv” data file (https://www.kaggle.com/mustafaali96/weight-height), plot a

histogram of men’s and women’s height (or weight) distributions and overlay them so you

can see both distributions at the same time. Follow the submission guidelines, see above.

Task 4

Using R, generate z-score data. Using that data, calculate SD. Follow the submission

guidelines, see above.

Report

Follow the submission guidelines, see above combined in a document that has your name,

course information, instructor name, and the assignment number.

版权所有：留学生编程辅导网 2018 All Rights Reserved 联系方式：QQ:99515681 电子信箱：99515681@qq.com

免责声明：本站部分内容从网络整理而来，只供参考！如有版权问题可联系本站删除。