Can Quick Sight fetch the data from S3 on regular cadence?

deeppatel7981 · November 2, 2023, 1:28am

I want to create a Quick Sight dashboard for my event based application to visualize the events. I am receiving events from customers daily and I am using AWS Lambda function to store the events into AWS S3 bucket like this YYYY-MM-DD/EventA and YYYY-MM-DD/EventB. I want to fetch that data into Quick Sight on daily basis (Every 24Hrs) in order to visualize and analyze the data as per the need.

Can someone help me understand how I could fetch the data from S3 to Quick Sight every 24 hours (like automate the process)?
Also I would like to understand the possible ways, I can arrange the data on QS Dashboard. For each day, I want to show some metrics for an entity and that entity could be same.

The design I am considering: Events → Lambda → S3 → Quick Sight.

Do you suggest any improvements on this design? Can you explain?

For example: entityId= ABC123, number of errors = 8, Date = 10/28/2023.

Sanjeeb2022 · November 2, 2023, 3:20am

Hi @deeppatel7981 - Welcome to AWS Quick Sight and thanks for posting the question. Couple of improvements on the design.

Is your source data is in compressed format? If not, I will advise to change the file format to parquet before writing to S3 and on top of that create an athena table with 2 partition keys ( date and the event type).
Integrate athena table with Quick Sight .
For performance improvements, you can think of using SPICE and make the data refresh time as 1 day ( possibly see whether FULL or incremental refresh).

I believe you can update the logic of lambda to compressed the data to parquet and and append the data in the athena table with partition.

Let’s hear from other experts as well, tagging @sagmukhe @David_Wong on this.

Regards - Sanjeeb

deeppatel7981 · November 7, 2023, 12:57am

Thanks @Sanjeeb2022 After considering other things, I proposed a new design:

Redshift → AWS Glue Crawler → AWS Glue Data Catalog → AWS Glue ETL → S3 → Quick Sight.

No, the source data is not compressed. Definitely, I will consider changing as per.
Is there a need to do that with the above design?
Sure

I also want to understand the cost estimations for using Quicksight here.

Sanjeeb2022 · November 10, 2023, 2:37am

Hi @deeppatel7981 - Thanks for sharing the details. If you are data is already in Redshift, you really no need to crawl and put the data in S3. Quick Sight can directly connect with Redshift and you can do the reporting on top of that.

If you want to have a quick discussion, we can connect.

Regards - Sanjeeb

deeppatel7981 · November 15, 2023, 12:26am

Hi @Sanjeeb2022 - Thanks for a quick reply. Let’s connect whenever you have sometime.

Sanjeeb2022 · November 15, 2023, 3:35am

Hi @deeppatel7981 - Thanks.

Topic		Replies	Views
Quick Sight Reporting over S3 Datalake directly Q&A data-source , analysis , performance , feature-request , s3	2	2402	June 7, 2023
S3 to Quick Sight Q&A developer , SDK	6	6382	January 29, 2024
Can you create an automatic data update from aws s3 t? Q&A data-source	5	475	October 5, 2023
S3 to quicksight daily data ingestion Q&A data-source , dataset , Business-Intelligence-Engineer	1	115	April 17, 2024
How to use Quick Sight API to fetch data of dataset in AWS Quick Sight which is imported by AWS S3 Q&A data-source , dataset , Business-Intelligence-Engineer	2	276	May 24, 2024

Can Quick Sight fetch the data from S3 on regular cadence?

Related topics