Category: AWS
YCSB 0.10.0 Now in Cloudera Labs
Feed: Cloudera Engineering Blog » Cloudera Labs. Author: Cy Jervis. Since the last blog post announcing the release of YCSB 0.6.0 in Cloudera Labs, users of Cloudera CDH and EDH will have noticed regular updates to the Labs version, keeping it in lockstep with the upstream release. This should help assure users of a consistent and easy mechanism to deploy the current version of YCSB (which at the moment is v.0.10.0 in CLABS) to evaluate the performance of the NoSQL stores employed within their clusters such as HBase, Kudu and Accumulo. New Bindings There have been several updates and improvements ... Read More
VPC Subnet Zoning Patterns for SAP on AWS, Part 1: Internal-Only Access

Feed: AWS for SAP. Author: Somckit Khemmanivanh. This post is by Harpreet Singh and Derek Ewell, Solutions Architects at Amazon Web Services (AWS).SAP landscapes that need to reside within the corporate firewall are comparatively easy to architect, but that’s not the case for SAP applications that need to be accessed both internally and externally. In these scenarios, there is often confusion regarding which components are required and where they should be placed. In this series of blog posts, we’ll introduce Amazon Virtual Private Cloud (Amazon VPC) subnet zoning patterns for SAP applications, and demonstrate their use through examples. We’ll show ... Read More
Redis Cloud Private

Feed: Redis Labs. Author: Aviad Abutbul. We are very excited to announce the preview release of the new simplified Redise Cloud Private (RCP) managed DBaaS. Redise Cloud Private delivers a fully managed, cost effective, stable high performance Redis databases in dedicated clusters within your cloud account, using your own instances, inside your VPC, with the option to run Redis databases on RAM or RAM+Flash (Redise Flash) as an extension of memory, using High IOPS NVMe-based SSD instances. While Redise Cloud Private (RCP) is a long standing option for customers running production clusters, there were manual aspects to the setup. Today ... Read More
VPC Subnet Zoning Patterns for SAP on AWS, Part 2: Network Zoning

Feed: AWS for SAP. Author: Somckit Khemmanivanh. This post is by Harpreet Singh and Derek Ewell, Solutions Architects at Amazon Web Services (AWS).In part one of this article series on VPC subnet zoning patterns, we described possible ways in which SAP applications may be accessed, and then discussed Amazon Virtual Private Cloud (Amazon VPC) subnet zoning patterns for internal-only access in detail. In this second article in the series, we’ll discuss how traditional application network zoning can be mapped to AWS.In a traditional on-premises deployment model, applications are segregated into various network zones: Restricted zone: This is the most secure ... Read More
Getting to Know APN Genomics Partners BioTeam, DNAnexus, Illumina, and Seven Bridges

Feed: AWS Partner Network (APN) Blog. Author: Aaron Friedman. Aaron Friedman is a Healthcare and Life Sciences Partner Solutions Architect at AWSThis past week, my colleague, Angel Pizarro, and I published a four-part blog series on the AWS Compute Blog that describes how you can build batch genomics workflows on AWS. This approach can be generalized to any type of batch workflow, such as post-trade analytics or fraud surveillance in financial services, or rendering and transcoding in media and entertainment. You can read these posts here: In healthcare and life sciences, high-throughput workflows end up being only one part of ... Read More
Join Teresa Carlson, Werner Vogels, and More at the AWS Public Sector Summit

Feed: AWS Government, Education, & Nonprofits Blog. Author: publicsector. We are excited to announce our lineup of customer keynote speakers joining us on the main stage for the eighth annual AWS Public Sector Summit June 12-14th at the Washington DC Convention Center. Hear these technology leaders from around the world share their firsthand stories of innovation for the public good and how digital transformation is changing the public sector.John G. Edwards, Chief Information Officer, CIA Craig Fox, Assistant Commissioner, Australian Taxation Office Jeffrey D. Armstrong, President, California Polytechnic State University Melinda Rogers, Chief Information Security Officer, Department of Justice Dr ... Read More
Amazon RDS for PostgreSQL Supports PostgreSQL Minor Versions 9.6.2, 9.5.6, 9.4.11 and 9.3.16

Feed: What's New. Highlights include fixes to prevent data corruption issues in index builds and in certain write-ahead-log replay situations, support for the pg_hint_plan, log_fdw and pg_freespacemap extensions, and many bug fixes and improvements. You can create a new Amazon RDS for PostgreSQL 9.6.2 database instance with just a few clicks in the AWS Management Console, or upgrade an existing PostgreSQL 9.5 database instance using a point-and-click upgrade. Upgrading from version 9.3 and 9.4 requires you to perform a point-and-click upgrade to the next major version, reaching version 9.5 before upgrading to 9.6.2. Each upgrade operation involves a short period ... Read More
Seven Tips for Using S3DistCp on Amazon EMR to Move Data Efficiently Between HDFS and Amazon S3
Feed: AWS Big Data Blog. Have you ever needed to move a large amount of data between Amazon S3 and Hadoop Distributed File System (HDFS) but found that the data set was too large for a simple copy operation? EMR can help you with this. In addition to processing and analyzing petabytes of data, EMR can move large amounts of data.In the Hadoop ecosystem, DistCp is often used to move data. DistCp provides a distributed copy capability built on top of a MapReduce framework. S3DistCp is an extension to DistCp that is optimized to work with S3 and that adds several ... Read More
Top Data Analytics Trends for Retailers of 2017
Feed: Featured Blog Posts - Data Science Central. Author: Robert Morris. It’s not easy for a retailer to face ongoing economic challenges. The power of the customers is rising. They have the right to choose the best, and they are not happy with anything less. The competition in every single industry is brutal. We’re not exaggerating when we say that every business battles for survival. In this war of competitors, data analytics are the most effective weapon. In February 2017, JDA Software Group and PwC (PricewaterhouseCoopers) released the results of a survey of retail CEOs. 69% of CEOs said they planned ... Read More
Progress Report: Hive-on-Spark Nears Production Readiness
Feed: Cloudera Engineering Blog » Cloudera Labs. Author: Justin Kestelyn. Contributors from Intel, Cloudera, and the rest of the community have been making strong progress on the Hive-on-Spark initiative. This post provides an update. [Editor’s note (April 20, 2016): Hive-on-Spark is now GA/shipping starting in CDH 5.7.] Since its inception about one year ago, the community initiative to make Apache Spark a data processing engine for Apache Hive (HIVE-7292) has attracted widespread interest from developers around the world and gone through phases of rapid development, testing, and early deployment. (For example, based on downloads data and user questions about the ... Read More
Recent Comments