First Level Trade-Off to select the right storage for your needs

Intro

Every time you start a new project, the question arises as to which data platform and data storage to choose. There is a big variety of storages to choose from. Each option has different features, advantages, and disadvantages. It is really time-consuming to puzzle over trade-off analysis.

The purpose of…


Making Sense of Big Data

Set of Best Practices (dis)proved by Benchmarking

Introduction

The market of MPP engines is pretty broad and the cloud big players have their offerings that constantly evolve. So it’s really interesting to have a better understanding of their capacities and how they perform.

Let’s review Azure’s Synapse Dedicated SQL Pool MPP database platform, which is the evolution of…


Opinion

Invented by F. Puppini and promoted by B. Inmon it pretends to be a revolution in Self Service BI

Intro

Recently I accidentally came across the new book of Bill Inmon and Francesco Puppini called “Unified Star Schema” (will refer to it USS downstream). Having a new book in 2020 from the father of data warehousing definitely grabbed my attention, I bought it and read it in the following 3…


Making Sense of Big Data, BigData Modeling Patterns

How to model hierarchies in NoSQL leveraging the best practices from the relational world

Intro

Sometimes we come across cases when we need to model a hierarchy of different complexity levels and not really sure how to do that properly in the most efficient, reliable, and flexible way. Let’s review one of the data modeling patterns that give us some answers for that.

Problem Statement

Consider we…


IoT Analytics Part 3: Comparison of Time Series Engines

Intro

Time Series use cases in general, and IoT domain in particular, are growing so fast, so it’s vital to select the right storage for each particular use case.

Nowadays, every other database engine or platform is marketed as the Time Series oriented, so let’s try to go deeper and find…


Opinion

Why to think twice before implementing Data Mesh

Intro

Recently in the area of data platform architectures, there was introduced a new concept/paradigm called data mesh. It pretends to drive a new architecture approach for building the analytics solutions which often is treated as cutting edge, fancy approach and started already to be adopted by some of the organizations.


Best Practices of DW Modelling applied on IoT data for most flexible and efficient analytics

Intro

This is a continuation of the previous part, where there was described a problem statement related to the analysis of IoT data on the example of fitness tracking activities. There were also described the reasoning behind the storage type selection and recommended to have 2 types of storages:

  1. Big data…


Photo by Louis Hansel @shotsoflouis on Unsplash

Why Consistency Issues or C in CAP theorem

As many of you probably know, Cassandra is an AP big data storage. In other words, when a network partition happens, Cassandra remains available and relaxes the Consistency property. …


Intro

There is no need to explain how IoT solutions are growing right now and the reasoning behind that. Let’s take it as a fact and from the data architecture perspective consider the following two challenges:

  1. what kind of storage to select to store the data
  2. how to model the data…

Andriy Zabavskyy

Big Data Architect & Data Warehouse Expert at SoftServe Inc.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store