Import failing

ineedqshelp · February 11, 2025, 3:28pm

I have an S3 folder with data in it, and am accessing it in quicksight using Athena. It is a large amount of data, and I keep getting this SQL timeout message.

"SQL_EXCEPTION Learn More

This is a general SQL error. This can be caused by query timeouts, resource constraints, unexpected DDL alterations before or during a query, and other database errors. Check your database settings and your query, and try again.

Error Details: Query timeout"

I’m generating the data using AWS glue, and importing it into S3. How can I go about getting this data into quicksight to avoid this error?

Xclipse · February 11, 2025, 3:50pm

Hi @ineedqshelp

Partition Your Data: Ensure your Glue ETL job is partitioning the data effectively (e.g., by date, category, or another relevant key). You can check your partitions using:

SHOW PARTITIONS your_table_name;

Use Parquet/ORC Instead of CSV/JSON: Parquet and ORC are columnar formats that speed up queries. Convert your Glue job output to Parquet.

Please refer to the below documentations this might be helpful for you.

ineedqshelp · February 11, 2025, 3:51pm

Thank you for the quick response, I’ll look at this and give an update.

ineedqshelp · February 11, 2025, 6:30pm

Does the compression type matter, is snappy okay?

Xclipse · February 12, 2025, 4:50am

Hi @ineedqshelp

Yes, you can enable compression. Use Snappy or Zstd compression.

Snappy and Zstd (Zstandard) are commonly used compression formats for optimizing query performance in Amazon Athena.

Compression reduces the size of the data stored in S3, which helps Athena read less data, improving query performance and reducing costs.

Snappy: Fast compression/decompression, widely used with Parquet and ORC.
Zstd (Zstandard): Higher compression ratio than Snappy, but slightly slower decompression. Recommended for large datasets.

Please refer to the below documentation this might be helpful for you.

ineedqshelp · February 12, 2025, 3:35pm

I tried using snappy with Parquet and snappy, and I’m still getting the import failed message, so I will try with zstd.

Brett · February 24, 2025, 8:26pm

Hi @ineedqshelp,
It’s been awhile since last communication on this thread, were you able to find a work around for your case or did you have any additional questions?

If we do not hear back within the next 3 business days, I’ll close out this topic.

Thank you!

ineedqshelp · February 24, 2025, 9:24pm

I have not found a workaround. Do you know if there’s a way to increase the SPICE refresh limit if I’m using Athena as the datasource? It keeps timing out after 30 minutes, but the data refreshes no problem if it’s using a direct query.

Xclipse · February 25, 2025, 4:35am

Hi @ineedqshelp

30-minute timeout you’re encountering during SPICE refreshes with Athena is due to Athena’s default query timeout setting. To resolve this, you can request an increase in Athena’s timeout limit by contacting AWS Support. Alternatively, optimizing your queries or configuring incremental refreshes in QuickSight may help reduce query duration.

Please refer to the below documentation this might be helpful for you.