I will skip the topic of compression encoding in this post as I think Redshift Documentation already explains it very well. I am going to focus on other strategies like finding an alternative of Oracle’s Primary Key and Partitions in Amazon Redshift and choosing the right data types for columns.
Primary Keys Alternative:
In our practice, we have seen the most frequently used keys are great options for distribution keys and many a time those keys turn out to be Primary Keys. So while designing Redshift table, if you define your distribution keys columns as primary key columns of Oracle, it will be a good beginning.
Primary Keys Alternative:
Sort Keys are a good replacement for Oracle Partitions.
Column Data Types:
Below is the query which take a Oracle table’s DDL and generates the corresponding column Definition at Redshift. Again its only an estimated data type selection that I have come up with, which should give you a head start.
No comments:
Post a Comment