Posts by Gary Hutson
Author: Gary Hutson
Automating and downloading Google Chrome images with Selenium
Feed: R-bloggers. Author: Gary Hutson.
Fork
Star
Watch
Download I love Nottingham Forest and have been trying to find a way to include them in one of my tutorials, as they are in the play-offs to go into the top flight leagues. This tutorial allows you to download images from Selenium and automate Google Chrome. Can I get a live tutorial? The live tutorial is here: Creating the bones of the project The first stage would be to define the dependencies: Downloading the web driver The next step would be to download the ChromeDriver for your ... Read More
CRAN package development is addictive
Feed: R-bloggers. Author: Gary Hutson. During my R career I have had the opportunity to work on some really exciting projects, and with some really clever dude and dudettes. Through sitting in lectures and webinars and package development tutorials I thought perhaps I should make some of my code, that was sitting there in a dusty private GitHub repo, open and accessible to all. This started me on the journey to develop R packages. OddsPlotty – the first package I created The first package I created was called OddsPlotty and I created this as a way to easily explain the ... Read More
SangerTools – a R package for working with population health datasets has arrived
Feed: R-bloggers. Author: Gary Hutson. Asif Laldin was the main developer of this package, but I contributed a few functions and offered overall package development advice. This is a great effort and is crammed full of functions for working with population health datasets, standardised rates, standard visuals, etc. It contains a smorgasbord of delightful functions to wet any R developers appetite. About the package SangerTools was born out of necessity. It is the formal aggregation of a set of coding solutions employed regularly to carry out Population Health Management Analytics; rapidly and consistently. It has has been created specifically for ... Read More
A package of Machine Learning datasets has arrived for R – MLDataR
Feed: R-bloggers. Author: Gary Hutson. I am proud to announce my seventh package for you lovely R folks. This time it is a collection of datasets aimed at giving people in healthcare, and beyond, some solid examples for using with R. There remains a few excellent packages for this, such as mlbench, but they are limited in the number of datasets you can derive. Using the package To use the package I have compiled a YouTube video to show you how the datasets could be used, with a thorough example in TidyModels of how to implement the thyroid disease dataset ... Read More
TidyModels hands on for the NHS-R community – building a supervised ML classification model from scratch
Feed: R-bloggers. Author: Gary Hutson. The inclusive NHS-R community have welcomed me to complete my webinar hat trick for this year. In this webinar I focus on building a ML model from scratch. The overview of the webinar is: Build your first Machine Learning classification model with tidymodelsUnderstanding data processing for machine learningEvaluate your machine learning models with ROC curves and Confusion MatricesUnderstand the tidy models process for model creationUnderstand sampling methods in machine learning model creationWork with packages such as recipes, yardstick, rsample, tune, parsnip and caretUsing the ConfusionTableR package to flatten confusion matrix outputs for storing in databasesSerialise ... Read More
What a NHS-R Community Conference it was – simply wow!
Feed: R-bloggers. Author: Gary Hutson. The NHS-R Virtual Conference concluded this week and we have had a number of excellent speakers from the Health & Social Care sector, alongside working with key partners of interest. It kicked off with a full day jammed packed of speakers from all over the NHS. The second day we had international speakers from the US, Australia and other European countries. The keynotes this year were provided by Professor Frank Harrell who did an excellent talk on “statistical mistakes to avoid” and Julia Silge @RStudio discussing “Preparing and processing text for Machine Learning”. I was ... Read More
TidyModels or CARET – how they compare?
Feed: R-bloggers. Author: Gary Hutson. People have been telling me not to teach caret anymore, albeit I do teach TidyModels as well. Their argument is that TidyModels is new and shiny and we should be using the most up to date tools. I get this justification, but is caret really dead? My answer is “not by a long shot!“. Hats off to Max Kuhn and the team for rolling out these awesome packages and without Max, modelling in R would not be as easy and fun. How they stack up? This is not really a fair comparison, as CARET is ... Read More
Crash Course in R Model Deployment with Docker and friends
Feed: R-bloggers. Author: Gary Hutson. I have put together a complete guide to model training, docker file creation and then consuming your API in R. This has arisen as part of a workshop the NHS-R community are doing around R in Production: show and tell, but instead of just making it local, I thought I would open up my part of the tutorial to everyone. The process follows this generic process for ML model deployment: The first tutorial focuses on the training component and model building. The second tutorial focuses on Deployment via Docker and the third tutorial is the ... Read More
Roll up, roll up the NHS-R Community Conference 2021 is coming to town
Feed: R-bloggers. Author: Gary Hutson. The conference will be held virtually and will kick-off : Monday 8th of November – Wednesday 10th November Featuring: Main Conference Events. This will include more workshops, lightning talks and plenary sessions. And the week before we will have lots of hands-on workshops: Monday 1st November – Friday 5th November Series of workshops for all R levels, including Introduction to R training, package usage and tips and tricks. Further details and how to register will be released shortly. Key note speakers These two influential R contributors: Julia Silge – co-creator of tidytext and now works ... Read More
Common mistakes we Data Scientists make
Feed: R-bloggers. Author: Gary Hutson. DISCLAIMER I am a data scientist and have made all these mistakes, but I have had the privilege of sitting on the managerial, project lead and developer side of the fence, and here are some tips to getting your stakeholders (i.e. anyone involved in the project team or has an interest in the success of the project) on board and delivering a successful data science project. #1 Technical Language is a no no Often when I have managed data scientists, and have been a data scientist, I have seen statisticians being too technical about their ... Read More
Recent Comments