Data science enhances people’s decision making. Doctors and researchers are making critical decisions every day. Therefore, it is absolutely necessary for those people to have some basic knowledge of data science. This series aims to help people that are around medical field to enhance their data science skills. This is the third part of the […]

# probability distributions

## Generating Data Exercises

Let’s make data R is good a making simulated data sets. These data sets are useful for learning programming. Instead of having to spend all your time cleaning up your data you have data ready to use for learning how to program. The data that will be generated here will be about cats, toys and […]

## Creating Sample Datasets – Exercises

Creating sample data is a common task performed in many different scenarios. R has several base functions that make the sampling process quite easy and fast. Below is an explanation of the main functions used in the current set of exercices: 1. set.seed() – Although R executes a random mechanism of sample creation, set.seed() function […]

## Combinations Exercises

When doing data analysis it happens often that we have a set of values and want to obtain various possible combinations of them. For example, taking 5 random samples from a dataset of 20. How many possible 5-sample sets are there and how to obtain all of them? R has a bunch of functions that […]

## Lets Begin with something sample

The following R-exercises constitute the first set in a series of posts aimed to review fundamental probability and (bio)statistics concepts while learning to use R. Today we will focus on generating random numbers from some of the built-in distributions in R as well as using the sample() function to obtain random samples from a given […]