- Home
- Tag: Anaconda
Posts tagged Anaconda
Tag: Anaconda
AWK — a Blast from Wrangling Past.
Feed: Featured Blog Posts - Data Science Central. Author: steve miller. I recently came across an interesting account by a practical data scientist on how to munge 25 TB of data. What caught my eye at first was the article's title: "Using AWK and R to parse 25tb". I'm a big R user now and made a living with AWK 30 years ago as a budding data analyst. I also empathized with the author's recountings of his painful but steady education on working with that volume of data: "I didn’t fail a thousand times, I just discovered a thousand ways not ... Read More
Using Python to run jobs in your SAS Grid

Feed: SAS Blogs. Author: Ajmal Farzam. Recent updates to SAS Grid Manager introduced many interesting new features, including the ability to handle more diverse workloads. In this post, we'll take a look at the steps required to get your SAS Grid Manager environment set up accept jobs from outside of traditional SAS clients. We'll demonstrate the process of submitting Python code for execution in the SAS Grid. Preparing your SAS Grid Obviously, we need a SAS Grid Manager (SAS 9.4 Maintenance 6 or later) environment to be installed and configured. Once the grid is deployed, there's not a whole lot ... Read More
Jobs, Unemployment and 45’s Performance.
Feed: Featured Blog Posts - Data Science Central. Author: steve miller. Despite the consuming controversy surrounding his presidency, POTUS 45 has been able to secure solid ratings on the performance of the economy over his so-far 30-month administration. And he certainly isn't bashful about taking credit for the successes, opining loudly and often that his tax cuts and de-regulation initiatives have significantly goosed the economy. 45 has a point, sort of. Job growth has been steady, and the stock market has mostly prospered under his watch, though not as much as he's claimed -- and not as much as it ... Read More
Pros and Cons of Top Data Science Online Courses
Feed: R-bloggers. Author: data technik. [This article was first published on R – data technik, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. There are a variety of data science courses online, but which one is the best? Find out the pros and cons of each! Coursera, EdX, etc These MOOCs have been around for several years now and continue to grow. But are they really the best option for learning online? Pros: ... Read More
The tools you should know for the Machine Learning projects
Feed: Featured Blog Posts - Data Science Central. Author: Damian Widera. I have been frequently asked about the tools for the Machine Learnign projects There are lot of them on the market so in my newest post you will find my view on them. I would like to start my first Machine Learning project. But I do not have tools. What should I do? What are the tools I could use? I will give you some hints and advices based on the toolbox I use. Of course there are more great tools but you should pick the ones you like ... Read More
Using Python and R to Load Relational Database Tables, Part I
Feed: Featured Blog Posts - Data Science Central. Author: steve miller. I enjoy data prep munging for analyses with computational platforms such as R, Python-Pandas, Julia, Apache Spark, and even relational databases. The wrangling cycle provides the opportunity to get a feel for and preliminarily explore data that are to be later analyzed/modeled. A critical task I prefer handling in computation over database is data loading. This is because databases generally demand the table be created before records are inserted, while computational platforms can create/load data structures simultaneously, inferring attribute data types on the fly (these datatypes can also be ... Read More
Grades Aren’t Normal
Feed: R-bloggers. Author: ntguardian. This article is also available in PDF form. A while back someone posted on Reddit about the grading policies of their academic department. Specifically, the department chair made a statement claiming that grades should be Normally distributed with a C average. I responded, claiming that no statistician would ever take the idea that grades follow a Normal distribution seriously. Some asked for context, and I wrote a long response explaining my position. I repeat that argument here, and also give some R code demonstrations showing what curving grades does. A cheap shot would be to say ... Read More
A Comment on Data Science Integrated Development Environments
Feed: R-bloggers. Author: John Mount. A point that differs from our experience struck us in the recent note: A development environment specifically tailored to the data science sector on the level of RStudio, for example, does not (yet) exist. “What’s the Best Statistical Software? A Comparison of R, Python, SAS, SPSS and STATA” Amit Ghosh Actually, Python has a large number of very capable integrated development environments, some of which are specifically tailored for data science. Please read on for a small list of tools, and my recommendations for a specific data science in Python toolchain. Off the top of ... Read More
How to Run PHP 5 Applications with MySQL 8.0 on CentOS 7

Feed: Planet MySQL; Author: Severalnines; Despite the fact that PHP 5 has reached end-of-life, there are still legacy applications built on top of it that need to run in production or test environments. If you are installing PHP packages via operating system repository, there is still a chance you will end up with PHP 5 packages, e.g. CentOS 7 operating system. Having said that, there is always a way to make your legacy applications run with the newer database versions, and thus take advantage of new features. In this blog post, we’ll walk you through how we can run PHP 5 ... Read More
CPAT and the Rényi-Type Statistic; End-of-Sample Change Point Detection in R
Feed: R-bloggers. Author: ntguardian. This article is also available in PDF form. I started my first research project as a graduate student when I was only in the MSTAT program at the University of Utah, at the very end of 2015 (or very beginning of 2016; not sure exactly when) with my current advisor, Lajos Horváth. While I am disappointed it took this long, I am glad to say that the project is finished and I am finally published. Our article, entitled “A new class of change point test statstics of Rényi type”, is available online, published by the Journal ... Read More
Recent Comments