#### COVID-19 in Belgium

COVID-19 in Belgium

A classic epidemiological model: the SIR model Before diving into the real-life application, we first introduce the model that will be used. There are many epidemiological models but we will use one of the simplest, the SIR model. Tim Churches' explanation of this model and how to fit it using R is so nice, I will reproduce it here with a few minor changes. The basic idea behind the SIR model (Susceptible – Infectious – Recovered) of communicable disease outbreaks is that there are three groups (also called compartments) of people: those ...

#### How to create a simple Coronavirus dashboard specific to your country in R

How to create a simple Coronavirus dashboard specific to your country in R

Coronavirus dashboard: the case of Belgium The Novel COVID-19 Coronavirus is the hottest topic right now. Every day, the media and newspapers share the number of new cases and deaths in several countries, try to measure the impacts of the virus on citizens and remind us ...

#### How to do a t-test or ANOVA for many variables at once in R and communicate the results in a better way

How to do a t-test or ANOVA for many variables at once in R and communicate the results in a better way

T-test Although it was working quite well and applicable to different projects with only minor changes, I was still unsatisfied with another point. Someone who is proficient in statistics and R can read and interpret the output of a t-test without any difficulty. However, as you may have noticed with your own statistical projects, most people do not know what to look for in the results and are sometimes a bit confused when they see so many graphs, code, output, results and numeric values in a document. They are quite easily overwhelmed ...

#### Top 5 R resources on COVID-19 Coronavirus

Top 5 R resources on COVID-19 Coronavirus

Photo by CDC The Coronavirus is a serious concern around the globe. With its expansion, there are also more and more online resources about it. This article presents a selection of the best R resources on the COVID-19 virus. This list is by no means exhaustive ...

#### How to perform a one sample t-test by hand and in R: test on one mean

How to perform a one sample t-test by hand and in R: test on one mean

Photo by Helloquence After having written an article on the Student's t-test for two samples (independent and paired samples), I believe it is time to explain in details how to perform one sample t-tests by hand and in R. One sample t-test is an important part ...

#### The 9 concepts and formulas in probability that every data scientist should know

The 9 concepts and formulas in probability that every data scientist should know

Photo by Josh Appel Probability is the likelihood of an event occurring; it is a mathematical model to describe random phenomena. In other words, probability is a branch of mathematics that provides models to describe random processes. These mathematical tools allow to establish theoretical models for ...

#### Student’s t-test in R and by hand: how to compare two groups under different scenarios

Student's t-test in R and by hand: how to compare two groups under different scenarios

One of the most important test within the branch of inferential statistics is the Student's t-test. The Student's t-test for two samples is used to test whether two groups (two populations) are different in terms of a quantitative variable, based on the comparison of two samples drawn from these two groups. In other words, a Student's t-test for two samples allows to determine whether the two populations from which your two samples are drawn are different (with the two samples being measured on a quantitative continuous variable). The reasoning behind this statistical ...

#### Correlogram in R: how to highlight the most correlated variables in a dataset

Correlogram in R: how to highlight the most correlated variables in a dataset

Photo by Pritesh Sudra Correlation, often computed as part of descriptive statistics, is a statistical tool used to study the relationship between two variables, that is, whether and how strongly couples of variables are associated. Correlations are measured between only 2 variables at a time. Therefore, ...

#### Getting started in R markdown

Getting started in R markdown

Code chunks Below the YAML header, there is a first code chunk which is used for the setup options of your entire document. It is best to leave it like this at the moment, we can change it later if needed. Code chunks in R Markdown documents are used to write R code. Every time you want to include R code, you will need to enclose it with three backwards apostrophes. For instance, to compute the mean of the values 1, 7 and 11, we first need to insert a R code ...

#### The complete guide to clustering analysis: k-means and hierarchical clustering by hand and in R

The complete guide to clustering analysis: k-means and hierarchical clustering by hand and in R

Perform by hand the k-means algorithm for the points shown in the graph below, with k = 2 and with the points i = 5 and i = 6 as initial centers. Compute the quality of the partition you just found and then check your answers in R.Assume that the variables have the same units so there is no need to scale the data. Solution by hand Step 1. Here are the coordinates of the 6 points: point x y 1 7 3 2 4 5 3 2 4 4 0 1 ...

