Xlsx vs csv format - data inconsistency

Hi,

I’ve encountered a scenario that I’m unable to understand. I’m using a table-type visual, and when I download the table in Excel format and CSV format, the sum(column) of values (as a test) are not the same.

The correct data is provided by the CSV format; however, in the Excel format(xlsx), I’ve found (after multiple tests) that some rows are randomly duplicated while others are deleted.

If this helps, the data doesn’t undergo any transformation in the analysis or in the dataset. In other words, I’m simply importing the data to a QuickSight dataset through a custom SQL query and then visualizing it through an analysis. No calculated fields have been created either.

The table columns are being added under the VALUE section, NOT under the GROUP BY section

Regards.

Hi @jgrueso ,

There are different limits for the export of XLSX and CSV. Could this be the reason for the different totals?
The duplicates sound like an error. Have you checked the limits?

Best regards,
Nico

Yes, I’m almost certain that the problem is caused when the limits are exceeded. In fact, I’m trying to understand the behavior to report the error. I could say, although it hasn’t been confirmed to me, that once the Excel format limits are exceeded, the extracted data may contain randomly duplicated lines and some lines are eliminated just as randomly.

I would like honestly to understand if this is the expected behaviour or someone that can confirm it or test it.

Hi @jgrueso ,

Can you reduce the amount of data and check whether the duplicates still exist?
If the limit is reached, the totals will not be correct either way.

Best regards,
Nico

When the amount of data is reduced, there are no duplicates. Actually the data matches with the version in csv. This is actually part of the investigation that I am doing since my customers are affected by this.

If my assumption is correct what I dont understand is why quicksight decides to create duplicates when the limit is reached.

Regards/

Hi @jgrueso ,

to answer that you need a developer from AWS.
I am not sure what you are investigating, because the rows are correct in a download that does not exceed the limits.
And in case you exceed the excel-limit the data is not complete at all.
The CSV-limits are much higher and CSVs can easily be migrated into excel files. Could you (your customer) work with CSV exports?

Best regards,
Nico

Hi Nico,

Customers are now working only with csv format. I want to check if it possible to hide/block the excel format option(Export to Excel). I think it is, but not sure if this can be applied only to a specific dashboard.

Regards,

Hi @jgrueso ,

you can restrict excel export in the custom permissions. This link may help you:

Best regards,
Nico

Hi @jgrueso

It’s been a while since we last heard from you. If you have any further questions, please let us know how we can assist you.

If we don’t hear back within the next 3 business days, we’ll proceed with close/archive this topic.

Thank you!

Hi @jgrueso

Since we have not heard back from you, I’ll go ahead and close/archive this topic. However, if you have any additional questions, feel free to create a new topic in the community and link this discussion for relevant information.

Thank you!