- Home
- Tag: columnar
Posts tagged columnar
Tag: columnar
Uber’s case for incremental processing on Hadoop

Cubes. (source: Michael Pardo on Flickr). For more on how Uber uses Hadoop and Spark, check out the Strata + Hadoop World NY 2016 session with Praveen Murugesan "Big data processing with Hadoop and Spark, the Uber way." Uber’s mission is to provide “transportation as reliable as running water, everywhere, for everyone.” To fulfill this promise, Uber relies on making data-driven decisions at every level, and most of these decisions can benefit from faster data processing. For example, using data to understand areas for growth or accessing of fresh data by the city operations team to debug each city. Needless ... Read More
Virtual Panel: Current State of NoSQL Databases

Key takeaways
Learn about the current state of NoSQL databases in terms of industry adoption
Read about tools that will help improve the developer productivity when working on NoSQL based applications
Understand multi-model database concept and the pros and cons of multi-model database option versus polyglot persistence approach
How to use NoSQL databases and big data technologies (like Hadoop and Spark) together to solve big data problems
Learn how to integrate NoSQL databases with other emerging trends like microservices and container technologies like Docker.
NoSQL databases have been around for several years now and have become the ... Read More
Accelerating Spark workloads using GPUs

Warp speed. (source: Pixabay). Spark has emerged as the infrastructure of choice for developing in-memory distributed analytics workloads. It provides high-level abstractions in multiple languages (e.g., Java, Scala, and Python) that hide the underlying data and work distribution operations such as data transfer to and from the Hadoop Distributed File System (HDFS) or that maintain resiliency in the presence of system failures. Spark also provides libraries for relational Online Analytical Processing (OLAP) using SQL, machine learning, graph analytics, and streaming workloads. These features enable developers to build complex analytics workflows quickly to support different data sources in various operating environments ... Read More
SQL Polybase to the test
Feed: Henk's tech blog. Author: Henk. Introduction The Microsoft Analytics Platform System (APS) comes with a very powerful feature that’s called Polybase. Polybase has been introduced over 2.5 years ago and extended ever since to integrate the world of structured and unstructured data, either on-premise as well in the Microsoft Azure cloud. The concept is simple: within an APS database you create an ‘External’ table that points to data located in a hadoop hdfs file system or in Windows Azure Blob Storage enabling hybrid data access. It allows you to seamlessly import, export & access data even with a small ... Read More
Is Cassandra a column oriented or columnar database
Vote count:
4
Columnar database should store group of columns together. But Cassandra stores data row-wise. SS Table will hold multiple rows of data mapped to their corresponding partition key. So I feel like Cassandra is a row wise data store like MySQL but has other benefits like "wide rows" and every columns are not necessarily to be present for all the rows and of course it's in memory . Please correct me if I'm wrong.
edited 10 hours ago
tharindu_DG
asked Aug 22 '14 at 7:40
... Read More
Recent Comments