I want to integrate show the S3 data on QuickSight Dashboard manually. The data in S3 is in parquet format. I want to know if I would be able to integrate S3 bucket to QuickSight Dashboard directly. The data inside S3 bucket will be in parquet format. Also, can quicksight handle the new data added into S3 automatically? S3 expects to get new data daily, once the data is Put into S3 β expectation is that the data from S3 will be shown on QS Dashboard.
You can use files in Amazon S3 or on your local (on-premises) network as data sources. QuickSight supports files in the following formats:
CSV and TSV β Comma-delimited and tab-delimited text files
ELF and CLF β Extended and common log format files
JSON β Flat or semistructured data files
XLSX β Microsoft Excel files
QuickSight supports UTF-8 file encoding, but not UTF-8 (with BOM).
Files in Amazon S3 that have been compressed with zip, or gzip (www.gzip.org
), can be imported as-is. If you used another compression program for files in Amazon S3, or if the files are on your local network, remove compression before importing them.
Regarding your question on frequency to update the data into QuickSight
You can set the Glue Crawler schedule for how often it should update the Glue Data Catalog for Athena.
Schedule the crawler to keep the AWS Glue Data Catalog and Amazon S3 in sync. (in the Best practices link above)
Hi @deeppatel7981 - The preferred option to create an athena table for the parquet file and connect QuickSight with Athena. There is NO direct integration for parquet files to QS at present.
Thanks a lot for your quick response, Parquet Format is not a hard requirement for me. I was able to change the format of the S3 data to βjsonβ, with that being considered I am adding compression to the S3 files to gzip.
Its been a while, we have not heard back from you. we assume the issue has been solved based on the suggestion provided before. I will mark the question as solution provided. Let us know otherwise.