- Home
- Tag: statistics
Posts tagged statistics
Tag: statistics
SSMS Encryption Wizard – Enabling Always Encrypted in a Few Easy Steps
Feed: Comments for SQL Server Security Blog. Author: viresh. As we explained in the previous articles, Always Encrypted is a client-side encryption technology – the database system (SQL Server or Azure SQL Database) does not have access to plaintext encryption keys and cannot encrypt or decrypt data protected with Always Encrypted. Consequently, enabling Always Encrypted in a database requires the use of client-side tools to provision Always Encrypted keys and to download encrypted and upload encrypted data back to the database. The new Encrypt Columns wizard in the latest version of SQL Server Management Studio makes it possible to enable Always ... Read More
In case you missed it: March 2017 roundup

Feed: Planet big data. Author: David Smith. In case you missed them, here are some articles from March of particular interest to R users. A tutorial and comparison of the SparkR, sparklyr, rsparkling, and RevoScaleR packages for using R with Spark. An analysis of Scrabble games between AI players. The doAzureParallel package, a backend to "foreach" for parallel computations on Azure-based clusters. The UK government project to automate reporting of official statistics with R. Data science languages R and Python rank highly in the latest Redmonk popularity rankings. FiveThirtyEight used R to find clusters of similar subreddits. RTVS 1.0, which provides ... Read More
Integrate SparkR and R for Better Data Science Workflow – Hortonworks

Feed: Hortonworks Blog – Hortonworks. Author: Yanbo Liang. R is one of the primary programming languages for data science with more than 10,000 packages. R is an open source software that is widely taught in colleges and universities as part of statistics and computer science curriculum. R uses data frame as the API which makes data manipulation convenient. R has powerful visualization infrastructure, which lets data scientists interpret data efficiently. However, data analysis using R is limited by the amount of memory available on a single machine and further as R is single threaded it is often impractical to use ... Read More
The MySQL 8.0.1 Milestone Release is available

Feed: Planet MySQL. Author: Geir Hoydalsvik. The MySQL Development team is happy to announce our 8.0.1 development milestone release (DMR), now available for download at dev.mysql.com. (8.0.1 adds features to 8.0.0). The source code is available at GitHub. You can find the full list of changes and bug fixes in the 8.0.1 Release Notes. Here are the highlights. Enjoy! Character Sets We are continuing our work on UTF8MB4 support for MySQL 8.0. In 8.0.1 we make UTF8MB4 the default character set, we add case and accent sensitive collations, and we add our first Japanese collation for UTF8MB4. Switch to new ... Read More
The Most Popular Languages for Data Scientists/Engineers

Feed: Planet big data. Author: David Smith. The results of the 2017 StackOverflow Survey of nearly 65,000 developers were published recently, and includes lots of interesting insights about their work, lives and preferences. The results include a cross-tabulation of the most popular languages amongst the "Data Scientist/Engineer" subset, and the results were ... well, surprising: When thinking about data scientists, it certainly makes sense to see SQL, Python and R in this list. (I've only included the top 10 above.) But it's a real surprise to see JavaScript at the top of the list, and the presence of PHP is just ... Read More
The Future of Data Science in One Picture
Feed: Featured Blog Posts - Data Science Central. Author: Vincent Granville. We have published various "one picture" articles about data science topics. Our readers find them very useful, as they convey the same amount of information in one visual, as a long article. The picture below is from the Data Science Field Guide published by Booz Allen Hamilton. You can download the guide here. The picture below is from page 117. Other "One Picture" Articles 15 Amazing Infographics and Other Visual Tutorials R for Big Data in One Picture Python for Big Data in One Picture Data Science Summarized in One ... Read More
PolyBase use cases clarified
Feed: James Serra's Blog. Author: James Serra. I previously talked about PolyBase and its enhancements (see PASS Summit Announcements: PolyBase enhancements). There is some confusion on PolyBase use cases as they are different depending on whether you are using PolyBase with Azure SQL Data Warehouse (SQL DW) or SQL Server 2016, as well as the sources you are using it against. The three main use cases for using PolyBase are: Loading data, federating querying, and aging out data. Here is the support for those three uses cases in SQL DW and SQL Server 2016: PolyBase in: Parallelize Data Load (Blob and ... Read More
Factoring Massive Numbers with Machine Learning Techniques
Feed: Featured Blog Posts - Data Science Central. Author: Vincent Granville. We are interested here in factoring numbers that are a product of two very large primes. Such numbers are used by encryption algorithms such as RSA, and the prime factors represent the keys (public and private) of the encryption code. Here you will also learn how data science techniques are applied to big data, including visualization, to derive insights. This article is good reading for the data scientist in training, who might not necessarily have easy access to interesting data: here the dataset is the set of all real ... Read More
New Features Coming in PostgreSQL 10

Feed: Planet PostgreSQL. The list of new features coming in PostgreSQL 10 is extremely impressive. I've been involved in the PostgreSQL project since the 8.4 release cycle (2008-2009), and I've never seen anything like this. Many people have already blogged about these features elsewhere; my purpose here is just to bring together a list of the features that, in my opinion, are the biggest new things that we can expect to see in PostgreSQL 10. [Disclaimers: (1) Other people may have different opinions. (2) It is not impossible that some patches could be reverted prior to release. (3) The list ... Read More
The Changing Face of Data Science Education

Feed: Database Trends and Applications : All Articles. By now we are all in agreement: The business of data is changing. Business users are more empowered to work with data; IT is becoming less about control and more about enablement. New data science job descriptions—such as the data scientist—are springing up as companies everywhere look for the right people with the right skill sets to squeeze more value from their data. Data itself is getting bigger, hardware more economical, and analytical software more “self-service.” We’ve embraced the paradigm shift from traditional BI to iterative data discovery. It’s a new era.Naturally, ... Read More
Recent Comments