- Home
- Tag: partition
Posts tagged partition
Tag: partition
Table Partitioning In MySQL NDB Cluster and What’s New (Part I)

Feed: Planet MySQL; Author: Saroj Tripathy; This blog is about table partitioning in NDB Cluster. We will see how this feature has been enhanced from version to version. Also we will see which partitioning user should use under which scenario. Here I will assume that the user has some knowledge on NDB cluster.Data distribution and table partitioning are usually coupled together. In NDB, when we talk about table partitioning, we mean ‘data distribution’ mainly as NDB doesn’t fully support RANGE, HASH or LIST partitioning. Most of the enhancements made to partitioning over the years are about ‘data distribution’ rather than ... Read More
Evolving Schemaless into a Distributed SQL Database

Feed: Planet MySQL; Author: Ovais Tariq; In 2016 we published blog posts (I, II) about Schemaless – Uber Engineering’s Scalable Datastore. We went over the design of Schemaless as well as explained the reasoning behind developing it. In this post today we are going to talk about the evolution of Schemaless into a general-purpose transactional database called Docstore. Docstore is a general-purpose multi-model database that provides a strict serializability consistency model on a partition level and can scale horizontally to serve high volume workloads. Features such as Transaction, Materialized View, Associations, and Change Data Capture combined with modeling flexibility and ... Read More
Ryan Lambert: Round Two: Partitioning OpenStreetMap
Feed: Planet PostgreSQL. By Ryan Lambert -- Published February 26, 2021A few weeks ago I decided to seriously consider Postgres' declarative table partitioning for our OpenStreetMap data. Once the decision was made to investigate this option, I outlined our use case with requirements to keep multiple versions of OpenStreetMap data over time. That process helped draft my initial plan for how to create and manage the partitioned data. When I put the initial code to the test I found a snag and adjusted the plan. This post shows a working example of how to partition OpenStreetMap data loaded using PgOSM-Flex ... Read More
MySQL Query Profiling Using Performance Schema

Feed: Planet MySQL; Author: Vlad Mihalcea; Introduction In this article, I’m going to explain how to do query profiling using the MySQL Performance Schema. The Performance Schema has been available since MySQL 5.5.3 and allows MySQL to instrument SQL queries and store their trace information in various tables that you can later inspect in order to determine why a given SQL statement is slow. On older versions of MySQL, you might have used the SHOW PROFILE command, but since this feature has been deprecated, you should now use the Performance Schema for SQL query profiling. Enabling the MySQL Performance Schema ... Read More
Hazelcast C++ Client Thread Structure

Feed: Blog – Hazelcast. Author: Ihsan Demir. We have just released our newest version of Hazelcast C++ Client API. It has a lot of improvements and features compared to older releases and I would like to provide some insights into the thread structure of our client, including how it interacts with the user threads. IO Thread IO thread is one of the most important threads. It writes and reads client messages to and from the server. This thread is the busiest and on the critical path and should process IO bytes very fast. Therefore, on the reader side, it only ... Read More
January 2020: “Top 40” New CRAN Packages
Feed: R-bloggers. Author: R Views. [This article was first published on R Views, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Two hundred thirty new packages made it to CRAN in January. Here are my “Top 40” selections in ten categories: Data, Finance, Genomics, Machine Learning, Medicine, Science, Statistics, Time Series, Utilities, and Visualization. Data igoR v0.1.1: Provides tools to extract information from the Intergovernmental Organizations (‘IGO’) Database , version 3, provided by ... Read More
Operating Lambda: Building a solid security foundation – Part 1

Feed: AWS Compute Blog. Author: James Beswick. In the Operating Lambda series, I cover important topics for developers, architects, and systems administrators who are managing AWS Lambda-based applications. This two-part series discusses core security concepts for Lambda-based applications. In the AWS Cloud, the most important foundational security principle is the shared responsibility model. This broadly shares security responsibilities between AWS and our customers. AWS is responsible for “security of the cloud”, such as the underlying physical infrastructure and facilities providing the services. Customers are responsible for “security in the cloud”, which includes applying security best practices, controlling access, and taking ... Read More
Ryan Lambert: First Review of Partitioning OpenStreetMap
Feed: Planet PostgreSQL. By Ryan Lambert -- Published February 21, 2021My previous two posts set the stage to evaluate declarative Postgres partitioning for OpenStreetMap data. This post outlines what I found when I tested my plan and outlines my next steps. The goal with this series is to determine if partitioning is a path worth going down, or if the additional complexity outweighs any benefits. The first post on partitioning outlined my use case and why I thought partitioning would be a potential benefit. The maintenance aspects of partitioning are my #1 hope for improvement, with easy and fast loading ... Read More
Create a custom data connector to Slack’s Member Analytics API in Amazon QuickSight with Amazon Athena Federated Query

Feed: AWS Big Data Blog. Amazon QuickSight recently added support for Amazon Athena Federated Query, which allows you to query data in place from various data sources. With this capability, QuickSight can extend support to query additional data sources like Amazon CloudWatch Logs, Amazon DynamoDB, and Amazon DocumentDB (with Mongo DB compatibility) via their existing Amazon Athena data source. You can also use the Athena Query Federation SDK to write custom connectors and query any source accessible with a Java API, whether it is relational, non-relational, object, or a custom data endpoint. A common analytics use case is to access ... Read More
Amazon MSK backup for Archival, Replay, or Analytics

Feed: AWS Architecture Blog. Amazon MSK is a fully managed service that helps you build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can use native Apache Kafka APIs to populate data lakes. You can also stream changes to and from databases, and power machine learning and analytics applications. Amazon MSK simplifies the setup, scaling, and management of clusters running Apache Kafka. MSK manages the provisioning, configuration, and maintenance of resources for a highly available Kafka clusters. It is fully ... Read More
Recent Comments