Posts by Benjamin Smith
Author: Benjamin Smith
RObservations #34: Using NLP with keras to understand market sentiment with LSTM networks
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Natural Language Processing (NLP) is a powerful tool in the Machine Learning landscape that can (among other things) allow users to classify sentiment and predict text. Many of recent my blogs have been about data manipulation and data engineering, so I decided change things up to look into showing some applications ... Read More
RObservations #33: Merging Excel Spreadsheets with Base R and openxlsx
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. I was recently asked as part of a larger task to combine multiple sheets from an excel workbook into a into a single sheet. When approached about the problem I immediately was asked if I was going to use VBA to do it. While I know my way around VBA, since ... Read More
RObservations #32: Creating an Instant Answer Oracle with httr and Shiny
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Knowing how to write API requests and handle their responses is a valuable skill that a developer, dataengineer or data analyst/scientist needs to know. In this short blog I share how its possible leverage DuckDuckGo’s instant answer API to create a oracle which can answer (some) of your questions using the ... Read More
RObservations #31: Using the magick and tesseract packages to examine asterisks within the Noam Elimelech
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Since my last blog on Tesseract-OCR I have been playing around casually with it to see what it is possible of doing. Tesseract supports optical character recognition for over 100 languages. That together with straight forward usage for implementing it in R inspired me to try using it for Hebrew text ... Read More
RObservations #30: Fixing R’s “messy string concatenation” with a special function
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Recently I discovered stackshare.io’s stackups which offers comparisons of different programming languages as well as their pros and cons. While looking at the all too classic comparison available between R and Python I noticed that one of the cons listed was: Messy syntax for string concatenation While it is possible to ... Read More
RObservations #29 – Classifying and Filtering Coordinates By Using the sf Library
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Geo-spatial analysis and visualizations is a powerful tool for providing insight bringing an idea or a result in a more tangible manner. Oftentimes, we are only interested in a specific points or we wan to classify the data we have by a larger location it belongs to. In this blog I ... Read More
RObservations #28 Canada’s Political Leadership and Inflation (Another Kaggle Contribution)
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. In my last blog I shared a basic dataset listing the Prime Minister’s of Canada, the start and end of their terms and the political party they associated themselves with during their tenure. In this blog I share my second dataset contribution that compliments this- Canadian inflation rate data. Note: This ... Read More
RObservations #27: Canadian Prime Minister’s Dataset (my “first” Kaggle submission)
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Kaggle has been a platform that has piqued my interest for some time. Based on how I see it, its an open-source driven data science “social-network”. Recently, I have decided check out the platform and contribute to it. In this blog I share and describe the data I collected on Canadian ... Read More
BensstatsTalks#3: 5 Tips for Landing a Data Professional Role
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Disclaimer: This was originally written on my Medium blog here, so the formatting is a little different from my usual style. If you just got started or have been working a while in a data role, the jargon thrown around can sometimes get overwhelming with all the things to need to learn ... Read More
BensstatsTalks #2: Two hard truths you need to know before breaking into break into tech
Feed: R-bloggers. Author: Benjamin Smith. [This article was first published on r – bensstats, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Disclaimer: This was originally written on my Medium blog here, so the formatting is a little different from my usual style. I have been blogging for nearly two years and with it I got to learn a lot about writing good code, challenges that arise (particularly with data science) and how ... Read More
Recent Comments