In this set of exercises, we are going to explore some applications of probability functions and how to plot some density functions. The package
MASS will be used in this set.
Note: We are going to use random numbers functions and random processes functions in R such as
runif. A problem with these functions is that every time you run them you will obtain a different value. To make your results reproducible, you can specify the value of the seed using
set.seed(‘any number’) before calling a random function. (If you are not familiar with seeds, think of them as the tracking number of your random number process.) For this set of exercises, we will use
set.seed(1). Don’t forget to specify it before every exercise that includes random numbers.
Answers to the exercises are available here. If you obtained a different (correct) answer than those listed on the solutions page, please feel free to post your answer as a comment on that page.
Generating dice rolls Using the functions
round, simulate the results of 100 dice rolls.
Let’s assume that we want to simulate a game in which we throw an unfair coin (success probability is 0.48) 10 times and you win $10 every time the result is tails and lose $10 when the result is heads. Simulate this game1000 time using
rbinom, and find the expected amount of money you will gain or lose in this game using the simulated values.
Simulate an experiment of throwing one dice 30 times using the function
rmultinom, and find out how many 6’s are in the simulated sample.
Obtain a vector that shows how many 1’s, 2’s,….6’s were obtained in the previous simulation.
Simulate normal distribution values. Imagine a population in which the average height is 1.70 m with a standard deviation of 0.1. Use
rnorm to simulate the height of 1000 people and save it in an object called
a) Plot the density of the simulated values.
b) Generate 10000 values with the same parameters and plot the respective density function on top of the previous plot in red to differentiate it.
This plot will show you how much a sample with 10000 simulations approximate to the real normal distribution.
Find the 90% interval of a population with mean = 1.70 and standard deviation = .1
Simulate 100000 people with height (cm) and weight (kg) using the function
mu = c(170, 60) and
Sigma = matrix(c(10,17,17,100), nrow = 2), and save it in an object called
Apply the function
population to get an idea of the values created.
Plotting bivariate distribution. Use the function
kde2d to generate a two-dimensional kernel density of the matrix
population and plot the values using
Simulating with a Bayesian approach. Unlike the frequentist statistics approach, Bayesian statistics assume the parameters of a distribution are a random variable with its own distribution. Let’s simulate a poisson variable.
a) Simulate a gamma variable with shape = 20 and scale = 0.5
b) Simulate using the previous value a poisson random variable
Simulating one variable doesn’t make sense if you want to know the properties of a certain distribution. Repeat the previous simulation but create 100 poisson variables and plot the distribution.
Leave a Reply