This exercise is the last series of basic analysis of hydrological data. We will use a precipitation dataset from the previous exercise here and additional flow data at Saugeen here. At this time, we will explore seasonal analysis and visualization, including creating a hydrograph and a hydrograph with baseflow and additional plot and flow duration […]

## Introduction to Statistical Testing and Sampling Exercises (Part 2)

This is part 2 in a series on statistical theory using R. For part 1, go here. This tutorial concerns itself with MLE calculations and bootstrapping. Answers to the exercises are available here. Exercise 1 Set a seed to 123 and create the following dataframe: lifespans = data.frame(index = 1:200, lifespans = rgamma(200, shape = 2, […]

## Spatial Data Analysis: Introduction to Raster Processing (Part 2)

Background In the second part of this tutorial series on spatial data analysis using the raster package, we will explore new functionalities, namely: Raster algebra Cropping Reprojection and resampling We will also introduce a new type of object named RasterStack, which, in its essence, is a collection of RasterLayer objects with the same spatial extent, […]

## Bonus: Using geo mapping with leaflet

## Easy Web Scraping With Rvest: Exercises

The Internet is full of interesting data, there’s no doubt about it. Some sites, such as Twitter, provide users with systemized access (API) around which some neat R packages have been built. In this exercise set, we practice much more general techniques of extracting/scraping data from the web directly, using the rvest package. Note […]

## Tensorflow – Basics Part 1: Exercises

Tensorflow is an open source, software library for numerical computation using data flow graphs. Nodes in the graph are called ops (short for operations), while the graph edges represent the R multidimensional data arrays (tensors) communicated between them. An op takes zero or more Tensors, performs some computation, and produces zero or more Tensors. In […]

## Introduction to Statistical Testing and Sampling: Exercises (Part 1)

For a majority of users, the primary use of R is for statistical testing and analysis. At the heart of this, within the frequentist world, lies hypothesis testing and distribution sampling. The skill in conducting this sort of work is being able to identify an appropriate distribution on which to model the question and test […]

## Predicting Housing Prices with Linear Regression Exercises

Regression techniques are a crucial skill in any data scientist or statisticians toolkit. It is even crucial for people who are unfamiliar with regression modeling. It is a nice way to introduce yourself to the topic through a simple linear model. A linear model is an explanation of how a continuous response variable behaves, dependent […]