LIS 4273 Adv Statistics: Module 2 Descriptive Statistics

For the second module of LIS 4273 there were 2 sets of data given, each to have their central tendency and variation described using the R Language. The finding of the various statistics is relatively easy, the important aspect though is to understand what is being described. Below is the data that was given and then the results from the R script. The R script that I wrote is available through my GitHub linked below or just click here.

Set 110232425
Set 220121312141215
Table 1: The 2 given data sets
Set 1Set 2
Mean414
Median313
Mode212
Range88
Interquartile Range2.52.5
Variance8.3333338.333333
Standard Deviation2.8867512.886751
Table 2: Descriptive Statistics results after running my R script

What is very interesting about these two data sets is that their measures of variation, range, Interquartile Range, Variance, and Standard Deviation are exactly the same, almost. However the mean, median, and mode differ between the two sets. In this data I have noticed an interesting pattern, means of 4 and 14, medians of 3 and 13, etcetera. The second data set seems to add 10 to every value. It can be seen through this data that the combination of central tendency and variation are important in descriptive statistics.

GitHub Link:

https://github.com/SimonLiles/LIS4273AdvStatistics/blob/master/LIS4273Mod2.R