TITLE: Q&A from the Live session | session name
Answers generated from Webex transcript and chat interactions
Q: What is the advantage of Redshift spectrum pipeline over Athena ORC/Parquet pipeline?
A: The advantage could be in terms of the organization use case. So let’s say your organization is using Redshift as a data warehousing tool, right? So, you, you are having lots of data and you are storing the data in redshift, and that’s like a data warehousing database for those organizations, there is no need to copy the data anywhere else. You can easily scan the data using, using your existing redshift and you can build the data sources on top of it. Athena wise I’ll say, it works more or less similar to what you have. With Athena also you can query the S3 data and then you can connect it to QuickSight And same as the case with redshift.
If you have a data warehousing use case in your organization where the data has a redshift database or for organizations with years growing the data, for them, Redshift is a classic use case where they can just have a Redshift spectrum table and then they can query the data from S3.
Q: What about Redshift serverless, does spectrum consume RPU, or the pricing is only based on data scanned (like Athena?)
A: You can connect your QuickSight dashboard to Redshift serverless as well and under the hood when you are working with the spectrum database, the serverless pricing works in the similar way, which is listed on the redshift serverless pricing page on the wiki. It’s working the similar format. There is no difference in that.