- Home
- Tag: university
Posts tagged university
Tag: university
BI and Data Science: The Best of Both Worlds
Feed: R-bloggers. Author: Lou Bajuk. [This article was first published on RStudio Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Photo by Stillness InMotion on Unsplash In previous posts, we’ve talked about the critical importance of interoperability, and how it helps organizations and data science teams get the most out of their analytic investments. We’ve focused recently on the ways that R and Python can be used together, and how RStudio’s products ... Read More
A Plethora of Machine Learning Articles: Part 2
Feed: Featured Blog Posts - Data Science Central. Author: Vincent Granville. Source: see here Part 1 of this short series focused on the business analytics / BI / operational research aspects, see here. In this Part 2, you will find the most interesting machine learning and statistics articles that I wrote in the last few years, focusing on core technical aspects. The whole series will feature articles related to the following aspects of machine learning: Mathematics, simulations, benchmarking algorithms based on synthetic data (in short, experimental data science) Opinions, for instance about the value of a PhD in our field, or ... Read More
cary huang: How PostgreSQL Handles Sub Transaction Visibility In Streaming Replication

Feed: Planet PostgreSQL. 1. Introduction As an experienced PostgreSQL user, you may have a lot of experience in setting up streaming replication in your database clusters to make multiple backups of your data. But have you wondered how the standby is able to correctly determine if a tuple sent from the primary should be visible to the user or not. In the case where a transaction contains multiple subtransactions, how does the standby determine the visibility in this case? You might say… well it is PostgreSQL so it will just work… This is true. If you are someone who is ... Read More
Using AWS for on-premises WordPress site continuity

Feed: AWS Public Sector Blog. Author: Farhad Jahangirov. Applications running on LAMP (Linux, Apache, MySQL, and PHP) stack are ubiquitous—WordPress alone represents 38% of all content management systems. Other popular CMS applications such as Drupal and Joomla also run on LAMP as well as Moodle, a widely used learning management system (LMS). Because of the popularity of these applications, public sector organisations such as educational institutions should protect their business continuity by implementing disaster recovery (DR) solutions: policies, tools, and procedures to help the recovery or continuation of technology infrastructure and systems following a disaster. Amazon Web Services (AWS) Professional ... Read More
Monitor data quality in your data lake using PyDeequ and AWS Glue

Feed: AWS Big Data Blog. In our previous post, we introduced PyDeequ, an open-source Python wrapper over Deequ, which enables you to write unit tests on your data to ensure data quality. The use case we ran through was on static, historical data, but most datasets are dynamic, so how can you quantify how your data is changing and detect anomalous changes over time? At Amazon, we’ve leveraged PyDeequ on AWS Glue to address this problem. AWS Glue is a serverless data integration service that allows you to easily prepare and combine your data for analytics, machine learning (ML), and ... Read More
AWS High Performance Computing (HPC) Competency Partners Are Fueling Fast, Secure, and Cost-Effective Innovation

Feed: AWS Partner Network (APN) Blog. Author: AWS Admin. We are excited to introduce the new AWS High Performance Computing (HPC) Competency featuring AWS Partners with deep technical expertise and proven customer success. These trusted AWS Partners provide technology offerings in areas such as high performance engineering simulation solvers, HPC workload management, high throughput computing, and foundational HPC technology. To help customers run the most compute-intensive workloads across industry verticals, Amazon Web Services (AWS) delivers an integrated suite of services that provide everything needed to quickly and easily build and manage HPC clusters in the cloud. These workloads span the ... Read More
Andreas ‘ads’ Scherbaum: David E. Wheeler
Feed: Planet PostgreSQL. Date: 01.03.2021 Tags: postgresql pgxn sqitch pgtap citext Category: Interviews Interviewed by: Andreas Scherbaum PostgreSQL is the World’s most advanced Open Source Relational Database. The interview series “PostgreSQL Person of the Week” presents the people who make the project what it is today. Read all interviews here. Please tell us about yourself, and where you are from. I’m a long-time open-source hacker in his early fifties. I’ve lived and worked in San Francisco, Portland OR, and now New York City. David E Wheeler How do you spend your free ... Read More
Compiling Book Exercises to pdf | html | Moodle | Blackboard
Feed: R-bloggers. Author: R | msperlin. How it works? All book exercises in the exams format: each exercise is a .Rmd file containing code, exercise text and solution. The files themselves can be found in the installation directory of the package: eoc_dir Each folder will have several exercises. Let’s try one out: eoc_files We can also read one of the files to show the strucuture of the exercise in code and text: readLines(eoc_files[1])
## [1] "```{r datageneration, echo = FALSE, results = "hide"}" ## [2] "my_answers Basically, we define all sections of a question – text, solution, alternatives – using ... Read More
Is the Centralized Data Warehouse Dead?
Feed: Teradata Blog. In the past I’ve heard some criticisms around Teradata’s founding “vision” -- criticisms that have been bolstered by the misperception that Teradata has admitted that its original vision of a “single store of business data” is no longer correct. Allow me to clear a few things up… Teradata started with the vision of “bring data together” so users could ask “any question of any data at any time.” At our heart we started as an analytic software company. We revolutionized a new paradigm of MPP software that was able to scale to (at the time) unfathomable amounts ... Read More
The Time-Series Ecosystem

Feed: Featured Blog Posts - Data Science Central. Author: Luis. The Time-Series ecosystem Time-series analysis has been studied for more than a hundred years, however, the extraordinary growth of data available from numerous sources and more frequent growth of data alongside the growth of computer power (GPU & Multicore) makes the analysis of large-scale time-series data possible today in a way that was not previously practical. The use of time-series data has been traditionally linked to sectors where time is not just a metric but a primary axis, such as in finance, Industrial IoT, and energy. However, in the last ... Read More
Recent Comments