- Home
- Tag: R
Posts tagged R
Tag: R
Microsoft & Hortonworks – Q&A with Hans Weiser & Mark Mason – Hortonworks

Feed: Hortonworks Blog – Hortonworks Author: Louise Matthews Needless to say, partners are a crucial part of the Hadoop community and that’s evidenced by the number of organisations Hortonworks is connected with as part of our PartnerWorks Programme. Back in October 2011, we announced our partnership with Microsoft to collaborate on a Hadoop distribution for Microsoft Azure and Windows Server. Since that announcement, our partnership has gone from strength to strength. Earlier today, I spent some time talking to Hans Wieser who is responsible for Hortonworks’ relationship with Microsoft in Europe, Middle East and Africa and Mark Mason, our director ... Read More
The democratization of big data is a big win for democracy | Google Cloud Big Data and Machine Learning Blog

Feed: Google Cloud Big Data and Machine Learning Blog Author: Google Cloud Big Data and Machine Learning Blog Team Innovation in data processing and machine learning technology Monday, June 20, 2016 Posted by Felipe Hoffa, Developer Advocate “Big data.” “Machine learning.” “Data visualization.” For people outside of industries that use them on a regular basis (and even people within them), technologies like these often seem abstract—as if they only apply to large-scale financial modeling or genomics research. It’s difficult to grasp how they impact the real world. We figured one of the best ways to demonstrate how powerful these technologies ... Read More
Why Apache Beam? A Google Perspective | Google Cloud Big Data and Machine Learning Blog

Feed: Google Cloud Big Data and Machine Learning Blog Author: Google Cloud Big Data and Machine Learning Blog Team Innovation in data processing and machine learning technology Tuesday, May 3, 2016 - Posted by Tyler Akidau, Staff Software Engineer & Apache Beam PPMC When we made the decision (in partnership with data Artisans, Cloudera, Talend, and a few other companies) to move the Google Cloud Dataflow SDK and runners into the Apache Beam incubator project, we did so with the following goal in mind: provide the world with an easy-to-use, but powerful model for data-parallel processing, both streaming and batch, ... Read More
[Data Mining] Association Rules in R (diapers and beer)
Feed: Featured Blog Posts - Data Science Central Author: Gregory Choi [Introduction of Association Rules]Sometimes, the anecdotal story helps you understand the new concept. But, this story is real. About 15 years ago, in Walmart, a sales guy made efforts to boost sales in his store. His idea was simple. He bundled the products together and applied some discounts to the bundled products. (Now, it became common practices in marketing) For example, this guy bundled bread with jam, so that customers easily found them together. Moreover, customers could afford to buy them together as the bundled product was discounted. In ... Read More
Microsoft R Open 3.3.1 now available for Windows, Mac and Linux

Feed: Planet big data Author: David Smith Microsoft R Open 3.3.1, our enhanced disstribution of open source R, is now available for download for Windows, Mac, and Linux. This update upgrades the R langauge engine to version 3.3.1, streamlines the installation process, and bundles some additional packages for parallel programming. R version 3.3.1 fixes a few rarely-encountered bugs, for example to generate Gamma random numbers with zero or infinite rate parameters, and correctly match text that only differed in the encoding. (See here for a complete list of fixes.) There are no user-visible changes in the language itself, which means that ... Read More
Key tools of Big Data for Transformation: Review & Case Study
Feed: Featured Posts - Hadoop360 Author: Andrei Macsin Guest blog post by Syed Danish Ali Review The challenges of big data can be captured succinctly as follows[1],[2]: Volume; ever increasing volume which breaks down traditional data-holding capacity Variety; more and more heterogeneous data from many formats and types are bombarding the data environment Velocity; more and more data is time sensitive now; frequent updates are taking place instead of relying on historical old data and data in real time is being generated now by the internet of things, amongst others. Veracity; how valid and reliable is the data? Since now we ... Read More
Learning from Imbalanced Classes – Silicon Valley Data Science

Feed: Planet big data Author: Meg Blanchette August 25th, 2016 If you’re fresh from a machine learning course, chances are most of the datasets you used were fairly easy. Among other things, when you built classifiers, the example classes were balanced, meaning there were approximately the same number of examples of each class. Instructors usually employ cleaned up datasets so as to concentrate on teaching specific algorithms or techniques without getting distracted by other issues. Usually you’re shown examples like the figure below in two dimensions, with points representing examples and different colors (or shapes) of the points representing the ... Read More
R with Power BI: Import, Transform, Visualize and Share

Feed: Planet big data Author: David Smith Power BI, Microsoft's data visualization and reporting platform, has made great strides in the past year integrating the R language. This Computerworld article describes the recent advances with Power BI and R. In short, you can: Power BI desktop is completely free to download and use, and includes all the features you need to create visualizations, reports and dashboards. (Publishing to Power BI online requires a subscription, though.) Power BI desktop and R are both included in the Data Science Virtual Machine, so that's another easy way to get started. Sharon Laivand from the Power ... Read More
Where to Study Data Science in Scotland?

Feed: Planet big data Author: admin Posted on August 24th, 2016 in To help narrow down the plethora of Data Science and related MSc and BSc courses in Scotland, we have created a simple web application that details what each course has to offer. What is it? Our web application allows you to search for a specific degree by the modules and programming languages that are taught. It allows you to highlight the courses that you think look particularly interesting. Information about the entry requirements and course details such as duration and part time study are also given. Furthermore, ... Read More
This Week in Data Science (August 23, 2016)

Feed: Planet big data Author: cora Posted on August 23, 2016 by cora Here’s this week’s news in Data Science and Big Data. Don’t forget to subscribe if you find this useful! Interesting Data Science Articles and News This is What It Takes To Connect a Volcano to the Internet – Explorers, volcanologists, GE, and the Nicaraguan government have banded together to bring the Masaya volcano in Nicaragua into the 21st century, by installing sensors that can compile huge amounts of data. Uber’s First Self-Driving Fleet Arrives in Pittsburgh This Month – Starting later this month, Uber will allow customers ... Read More
Recent Comments