- Home
- Tag: statistics
Posts tagged statistics
Tag: statistics
Factoring Massive Numbers: Machine Learning Approach – Why and How
Feed: Featured Blog Posts - AnalyticBridge. Author: Vincent Granville. We are interested here in factoring numbers that are a product of two very large primes. Such numbers are used by encryption algorithms such as RSA, and the prime factors represent the keys (public and private) of the encryption code. Here you will also learn how data science techniques are applied to big data, including visualization, to derive insights. This article is good reading for the data scientist in training, who might not necessarily have easy access to interesting data: here the dataset is the set of all real numbers -- ... Read More
How to Become a Data Scientist

Feed: BDU Author: Candi Halbert “You can have data without information, but you cannot have information without data” – Daniel Keys Moran The question of how to become a data scientist comes up often and many seem to have the answer. However, before you take their advice make sure that you’re speaking the same language, because there’s Paris, France and there’s Paris, Texas. Both lovely, of course, but what a surprise if you meant one and got directions to the other. Likewise, there are multiple interpretations of what a data scientist is and does. So, let’s begin with some definitions ... Read More
My First Steps with MariaDB 10.2 and RocksDB Storage Engine

Feed: Planet MySQL. Author: Valeriy Kravchuk. Last year I started to explore MyRocks, that is, RocksDB used as a storage engine with MySQL. So far I had to use Facebook's MySQL 5.6 to do this. I could try to use some specific development branches of MariaDB (or maybe even Percona Server) for this, but I preferred to wait until the engine is included into a main branch by the company I work for. Recently this happened, and now you can get RocksDB loaded and working in main MariaDB 10.2 branch. In this blog post I am going to explain how ... Read More
Learning Scrabble strategy from robots, using R

Feed: Planet big data. Author: David Smith. While you might think of Scrabble as that game you play with your grandparents on a rainy Sunday, some people take it very seriously. There's an international competition devoted to Scrabble, and no end of guides and strategies for competitive play. James Curley, a psychology professor at Columbia University, has used an interesting method to collect data about what plays are most effective in Scrabble: by having robots play against each other, thousands of times. The data were generated with a Visual Basic script that automated two AI players completing a game in Quackle. Quackle ... Read More
Basics of MySQL Administration and best practices

Feed: Planet MySQL. Author: Lalit. Following are the few best practices and basic commands for MySQL Administration. MySQL Access and credential security shell> mysql -u testuser -pMyP@ss0rd mysql: [Warning] Using a password on the command line interface can be insecure. By looking at OS cmd’s history using history cmd other os users can see/get MySQL user password easily. It always good to not use a password on the command line interface. Another option for securing password while automating MySQL scripts is a use of mysql_config_editor. For more info on this check out my blog post about credential security. Consider of having following implementation for Strong ... Read More
Weekly Digest, April 3
Feed: Featured Blog Posts - Data Science Central. Author: Vincent Granville. Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. Announcement Register today for the Wharton Customer Analytics Conference - May 10-11, 2017 in Philadelphia, PA. This conference is dedicated to real-world applications that exemplify a balance of high-level rigor and business know-how, as well as elevating the role of analytics in an organization’s strategic decision-making. Join us for technical workshops and on May 10, and a day filled with practical examples of customer ... Read More
UK government using R to modernize reporting of official statistics

Feed: Planet big data. Author: David Smith. Like all governments, the UK government is responsible for producing reports of official statistics on an ongoing basis. That process has traditionally been a highly manual one: extract data from government systems, load it into a mainframe statistical analysis tool and run models and forecasts, extract the results to a spreadsheet to prepare data for presentation, and ultimately combine it all in a manual document editing tool to produce the final report. The process in the UK looks much like this today: Matt Upson, a Data Scientist at the UK Government Digital Service, is ... Read More
Finding “Gems” in Big Data
Feed: Featured Blog Posts - Data Science Central. Author: Peter Bruce. In 1945, Count ,Richard Taaffe* a Dublin gem collector, was sorting through a set of spinel gems that he had bought, and found one that refracted light differently - instead of simply bending light rays, it split them into two rays (“double refraction”). The anomalous gem was named after him and earned a place on the “world’s rarest gems” list. In analytics, it sometimes not the rule (i.e. the model) that is of interest, but rather the exception. Detecting anomalous cases in large datasets is critical in conducting surveillance, ... Read More
Sfera Provides Professional Internet Services With the Help of DataStax Enterprise

Feed: Blog Post – Corporate – DataStax. Author: Diego Ferreira. This post is one in a series of quick-hit interviews with companies using DataStax Enterprise (DSE) for key parts of their business. In this interview, we talked with Enrico Gaspani, Software Engineer at Sfera. DataStax: Hello Enrico, thanks a lot for your time today. Could you please tell us a bit about Sfera and Surfree? What is your role there? Sfera: Of course. Sfera is an Italian integrated service provider. Surfree is an advanced Wifi network data analysis software for cities, malls and public places, and I’m the Software Development ... Read More
DBMS 2 : DataBase Management System Services

Feed: Planet big data. Author: Curt Monash. March 26, 2017 A huge fraction of analytics is about monitoring. People rarely want to frame things in those terms; evidently they think “monitoring” sounds boring or uncool. One cost of that silence is that it’s hard to get good discussions going about how monitoring should be done. But I’m going to try anyway, yet again. Business intelligence is largely about monitoring, and the same was true of predecessor technologies such as green paper reports or even pre-computer techniques. Two of the top uses of reporting technology can be squarely described as monitoring, ... Read More
Recent Comments