What is Amazon RedShift
🤔What is #Amazon #Redshift? #Why #Redshift ?
..
.
.
.
.
.
.
.
.
.
.
Let's understand In depth :-
=====================
✍️What is Amazon Redshift?
✏️Amazon Redshift is #Cloud based #DataWarehouse
✏️Amazon Redshift is one of the most popular Cloud based datawarehouse.
✏️It is #OLAP System meant to do #analytical Processing.
✏️Datawarehouse stores large amount of petabytes of data on which we do #Analytical Processing.
✏️Datawarehouse is meant for analytical processing not for doing transactional processing which Databases do.
✍️Pricing
======
✏️It's very low which starts from 0.25 cents per hour
✏️Clean up the resources once you are done .
✍️🙃🤔Why Redshift ?
================
To understand why Redshift We should understand the Traditional Datawarehouse challenges :-
========================
On-Prem Challenges:-
=================
✏️Intial Setup and Maintanence cost for traditional datawarehouse is high.
✏️First we have a upfront cost which means we need to buy Multiple machines(nodes let's say 40 nodes)
✏️ We have to set up them
✏️We have to bring coolers for reducing heat
✏️After that we need to have a team to maintain them .
✏️Scaling is difficult , suppose today you have capable of managing 10tb data using 40 nodes but for tommorow if say 100 tb comes we have to bring the extra nodes(Machines) and setup then manually .
✏️Tommorow Let's say we have come up with 1000tb , we can't increase the machines manually .To what extent we can do this process of manual scaling it is not that simple.
✏️Scaling the traditional Datawarehouse is difficult due to continuous upgradation.
✏️Possibility of loss of Information due to network issue.
✏️Data security issues we face
✏️Setting up the datawarehouse also takes time it won't happen in 1-2 days.
Let's say if you think today whole things might end up taking months.
✍️Redshift:-
==========
✏️Redshift Overcomes all the problems which we face with #Onprem traditional #datawarehouse.
✏️Amazon Redshift is a fully managed , #petabyte-scale , #fast scalable datawarehouse in the cloud.
✏️Redshift is said to be #10 times faster than #traditional datawarehouse.
✏️It is meant to for doing analysis on history of data.
✏️Redshift is faster due to Massive Parallelism.
✍️Redshift is 10Times faster due to internal Optimization :-
=====================
a) Uses columunar storage
b)Compute Optimised hardware
c)Massive parallel processing d)Compression techniques
e) Query Optmizer
f) Resultset-caching
g) Machine learning
✍️Internally all these optimization will undergo due to which #Redshift is #10x faster than #onprem #traditional #datawarehouse
.
Comments
Post a Comment