Category: Cloudera
Partnerships that Enrich Solutions: a Spotlight Interview with Dell Enterprise Germany’s General Manager, Benjamin Krebs
Feed: Cloudera Blog. Author: Cloudera. Posted in Business | September 22, 2021 4 min read During this Partner Perspective interview, Cloudera’s Alvin Heib seizes the opportunity to speak with Benjamin Krebs, General Manager of Technology Enterprise in Germany. The pair discuss Benjamin’s role at Dell, the importance of partnerships in his region, how the pandemic has altered Dell’s working landscape and finally, some predictions Benjamin has on Dell’s future. Benjamin has eleven years of experience working for Dell and is responsible for taking care of all top two-hundred customers’ revenue. The sector he manages, Technology Enterprise, generates approximately $1 billion ... Read More
Supercharge your Airflow Pipelines with the Cloudera Provider Package

Feed: Cloudera Blog. Author: Philippe Lanoe. Posted in Technical | September 21, 2021 5 min read Many customers looking at modernizing their pipeline orchestration have turned to Apache Airflow, a flexible and scalable workflow manager for data engineers. With 100s of open source operators, Airflow makes it easy to deploy pipelines in the cloud and interact with a multitude of services on premise, in the cloud, and across cloud providers for a true hybrid architecture. Apache Airflow providers are a set of packages allowing services to define operators in their Directed Acyclic Graphs (DAGs) to access external systems. A provider ... Read More
Apache Kafka Deployments and Systems Reliability – Part 1

Feed: Cloudera Blog. Author: Joseph Niemiec. Posted in Technical | September 20, 2021 9 min read There are many ways that Apache Kafka has been deployed in the field. In our Kafka Summit 2021 presentation, we took a brief overview of many different configurations that have been observed to date. In this blog series, we will discuss each of these deployments and the deployment choices made along with how they impact reliability. In Part 1, the discussion is related to: Serial and Parallel Systems Reliability as a concept, Kafka Clusters with and without Co-Located Apache Zookeeper, and Kafka Clusters deployed ... Read More
Living on the Edge: How to Accelerate Your Business with Real-time Analytics
Feed: Cloudera Blog. Author: Cloudera Contributors. Posted in Business | September 15, 2021 3 min read Leveraging the Internet of Things (IoT) allows you to improve processes and take your business in new directions. But it requires you to live on the edge. That’s where you find the ability to empower IoT devices to respond to events in real time by capturing and analyzing the relevant data. Edge computing relies on squeezing the power and functionality of a data center into a micro site as close to data sources as possible to enable real-time tasks. Whether the task involves self-driving ... Read More
Meet Sudhir Menon, Ram Venkatesh and Paul Codding – Champions of the Cloudera Hybrid Data Cloud
Feed: Cloudera Blog. Author: Rob Bearden. Posted in Business | September 14, 2021 3 min read In June, we announced the beginning of a new chapter for Cloudera, with a mission to make data and analytics easy and accessible, for everyone. With transformation comes change, and today I’m thrilled to announce the promotion of Sudhir “Suds” Menon, Ram Venkatesh and Paul Codding, three leaders driving our mission forward. The foundation of our mission is a move to a hybrid data cloud platform, an evolution of our Cloudera Data Platform, a hybrid and multi-cloud solution purpose-built with the power and flexibility ... Read More
Operating Apache Kafka with Cruise Control
Feed: Cloudera Blog. Author: Viktor Somogyi-Vass. Posted in Technical | September 13, 2021 19 min read About Cruise Control There are two big gaps in the Apache Kafka project when we think of operating a cluster. The first is monitoring the cluster efficiently and the second is managing failures and changes in the cluster. There are no solutions for these inside the Kafka project but there are many good 3rd party tools for both problems. Cruise Control is one of the earliest open source tools to provide a solution for the failure management problem but lately for the monitoring problem ... Read More
Cloudera and NVIDIA Help IRS Fight Fraud, Safeguard Taxpayers
Feed: Cloudera Blog. Author: Nasheb Ismaily. Posted in Business | September 10, 2021 3 min read Across the federal government, agencies are struggling to identify, organize, analyze, and act on troves of data. It’s a problem that leaders are working actively to tackle, but they’re in a race against immeasurable volumes of data that is continuously being generated in perpetuity in stores known and unknown. At the Internal Revenue Service, decades’ worth of data exceeds even the most cutting-edge processing capabilities. By more effectively leveraging its petabytes of current and historical data, the IRS is working to stave off costly ... Read More
Enabling Multi-User Fine-Grained Access Control for Cloud Storage in CDP
Feed: Cloudera Blog. Author: Jonathan Hsieh. Posted in Technical | September 10, 2021 4 min read Shared Data Experience (SDX) on Cloudera Data Platform (CDP) enables centralized data access control and audit for workloads in the Enterprise Data Cloud. The public cloud (CDP-PC) editions default to using cloud storage (S3 for AWS, ADLS-gen2 for Azure). This introduces new challenges around managing data access across teams and individual users. To solve these challenges for S3 and ADLS-gen2, Cloudera has introduced a new service — the Ranger Authorization Service (RAZ). CDP-PC provides the same fine-grained access control as on-prem for data warehouse ... Read More
Value Proposition of the Cloudera Operational Database over Legacy Apache HBase Deployments

Feed: Cloudera Blog. Author: Andreas Skouloudis. Posted in Business | September 09, 2021 12 min read The CDP Operational Database (COD) builds on the foundation of existing operational database capabilities that were available with Apache HBase and/or Apache Phoenix in legacy CDH and HDP deployments. Within the context of a broader data and analytics platform implemented in the Cloudera Data Platform (CDP), COD will function as highly scalable relational and non-relational transactional database allowing users to leverage big data in operational applications as well as the backbone of the analytical ecosystem, being leveraged by other CDP experiences (e.g., Cloudera Machine ... Read More
Supporting Transformation with an Integrated Data Platform. Three Common Questions Answered.

Feed: Cloudera Blog. Author: Daniel Hand. Posted in Technical | September 08, 2021 4 min read In recent years there has been increased interest in how to safely and efficiently extend enterprise data platforms and workloads into the cloud. CDOs are under increasing pressure to reduce costs by moving data and workloads to the cloud, similar to what has happened with business applications during the last decade. Our upcoming webinar is centered on how an integrated data platform supports the data strategy and goals of becoming a data-driven company. Before that, companies should think about whether the right foundations for ... Read More
Recent Comments