- Home
- Tag: S3
Posts tagged S3
See Amazon Simple Storage Service (Amazon S3).
Tag: S3
Snowflake Computing Makes its Data Warehousing Platform More Accessible to Users

Feed: Database Trends and Applications : All Articles. Snowflake Computing is making its platform more accessible to users with Snowflake On Demand—a sign-up process for data users to get immediate insight from Snowflake’s data warehouse.According to the vendor, with a virtual swipe of a credit card on Snowflake’s website, data users can access the only data warehouse built for the cloud. They can store and analyze their data without relying on their own IT group to get up and running quickly.The offer gives customers a secure sign-up process; access to all Snowflake Standard Edition features, including Time Travel, automatic data ... Read More
Paxata Unveils New Platform

Feed: Database Trends and Applications : All Articles. Paxata is releasing Paxata Connect to extend the Paxata Platform with a connectivity framework that creates a nexus to acquire, shape, and publish meaningful data for faster time to value.“The idea behind Paxata Connect is creating a self-service information platform,” said Nenshad Bardoliwalla. “We’re extending the use cases and capabilities our platform provides. Connect is an interesting payoff on that information platform vision.”With Connect, information architects and developers can take advantage of out-of-the-box connectors, build their own repeatable data services and pipelines, and maintain transparency and oversight to ensure data provides a ... Read More
Running sparklyr – RStudio’s R Interface to Spark on Amazon EMR

Feed: AWS Big Data Blog. Tom Zeng is a Solutions Architect for Amazon EMRThe recently released sparklyr package by RStudio has made processing big data in R a lot easier. sparklyr is an R interface to Spark that allows users to use Spark as the backend for dplyr, one of the most popular data manipulation packages. sparklyr provides interfaces to Spark packages and also allows users to query data in Spark using SQL and develop extensions for the full Spark API. Amazon EMR is a popular, hosted big data processing service on AWS that provides the latest version of Spark ... Read More
Streaming Messages from Kafka into Redshift in near Real-Time

Feed: Planet MySQL. Author: Yelp Engineering. This is the sixth post in a series covering Yelp's real-time streaming data infrastructure. Our series explores in-depth how we stream MySQL updates in real-time with an exactly-once guarantee, how we automatically track & migrate schemas, how we process and transform streams, and finally how we connect all of this into datastores like Redshift and Salesforce. Read the posts in the series: The Yelp Data Pipeline gives developers a suite of tools to easily move data around the company. We have outlined three main components of the core Data Pipeline infrastructure so far. First, ... Read More
Holy Easy PostgreSQL deployment

Feed: Planet PostgreSQL. Holy Easy PostgreSQL deployment!In case you missed it, the BigSQL team released an awesome package manager for installing and configuring PostgreSQL and many related, useful components. The package manager can be found here: https://www.bigsql.org/package-manager.jsp. Playfully named pgc, for ‘pretty good command line’, pgc is a utility similar to yum or apt-get that allows you to install, configure, update and manage Postgres related components including foreign data wrappers, stored procedure languages, connectors, devops tools, HA tools and monitoring tools. Common uses: Provision Postgres (9.2 through 9.6, including multiple versions on same server) Installing pgBouncer, Backrest, and other community projects ... Read More
Whiteboard with an SA: AWS Direct Connect

Feed: AWS Government, Education, & Nonprofits Blog. Author: publicsector. In this brief whiteboarding video, learn how to establish a dedicated network connection from your premises to AWS with AWS Direct Connect. Todd Gagorik, AWS Solutions Architect, shows you how you can establish private connectivity between AWS and your datacenter, office, or colocation environment with AWS Direct Connect. In many cases, this can reduce your network costs, increase bandwidth throughput, and provide a more consistent network experience than Internet-based connections.Todd will walk you through how to establish a dedicated network connection between your network and one of the AWS Direct Connect ... Read More
How Eliza Corporation Moved Healthcare Data to the Cloud

Feed: AWS Big Data Blog.
This is a guest post by Laxmikanth Malladi, Chief Architect at NorthBay. NorthBay is an AWS Advanced Consulting Partner and an AWS Big Data Competency Partner
"Pay-for-performance" in healthcare pays providers more to keep the people under their care healthier. This is a departure from fee-for-service where payments are for each service used. Pay-for-performance arrangements provide financial incentives to hospitals, physicians, and other healthcare providers to carry out improvements and achieve optimal outcomes for patients.
Eliza Corporation, a company that focuses on health engagement management, acts on behalf of healthcare organizations such as ... Read More
Orchestrating GPU-Accelerated Workloads on Amazon ECS

Feed: AWS Compute Blog. Author: Chris Barclay. My colleagues Brandon Chavis, Chad Schmutzer, and Pierre Steckmeyer sent a nice guest post that describes how to run GPU workloads on Amazon ECS.— It’s interesting to note that many workloads on Amazon ECS fit into three primary categories that have obvious synergy with containers: PaaS Batch workloads Long-running services While these are the most common workloads, we also see ECS used for a wide variety of applications. One new and interesting class of workload is GPU-accelerated workloads or, more specifically, workloads that need to leverage large amounts of GPUs across many nodes ... Read More
Optimizing Amazon S3 for High Concurrency in Distributed Workloads

Feed: AWS Big Data Blog.
Aaron Friedman is a Healthcare and Life Sciences Solution Architect with Amazon Web Services
The healthcare and life sciences landscape is being transformed rapidly by big data. By intersecting petabytes of genomic data with clinical information, AWS customers and partners are already changing healthcare as we know it.
One of the most important things in any type of data analysis is to represent data in a cost-optimized and performance-efficient manner. Before we can derive insights from the genomes of thousands of individuals, genomic data must first be transformed into a queryable format. This ... Read More
25 Questions and Answers About Hortonworks DataFlow – Hortonworks

Feed: Hortonworks Blog – Hortonworks. Author: Haimo Liu. Last week, we had a jam-packed webinar on Hortonworks DataFlow, with over 700 registrants and so we were unable to get back to everyone to answer their questions. We’ve grouped the questions (and answers) below into the following categories, and if you have more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where an entire community of folks are monitoring and responding to questions. For those who may have missed the session you can check out the on-demand webinar, slideshare and still ... Read More
Recent Comments