I’ve encountered a scenario that I’m unable to understand. I’m using a table-type visual, and when I download the table in Excel format and CSV format, the sum(column) of values (as a test) are not the same.
The correct data is provided by the CSV format; however, in the Excel format(xlsx), I’ve found (after multiple tests) that some rows are randomly duplicated while others are deleted.
If this helps, the data doesn’t undergo any transformation in the analysis or in the dataset. In other words, I’m simply importing the data to a QuickSight dataset through a custom SQL query and then visualizing it through an analysis. No calculated fields have been created either.
The table columns are being added under the VALUE section, NOT under the GROUP BY section
There are different limits for the export of XLSX and CSV. Could this be the reason for the different totals?
The duplicates sound like an error. Have you checked the limits?
Yes, I’m almost certain that the problem is caused when the limits are exceeded. In fact, I’m trying to understand the behavior to report the error. I could say, although it hasn’t been confirmed to me, that once the Excel format limits are exceeded, the extracted data may contain randomly duplicated lines and some lines are eliminated just as randomly.
When the amount of data is reduced, there are no duplicates. Actually the data matches with the version in csv. This is actually part of the investigation that I am doing since my customers are affected by this.
If my assumption is correct what I dont understand is why quicksight decides to create duplicates when the limit is reached.
to answer that you need a developer from AWS.
I am not sure what you are investigating, because the rows are correct in a download that does not exceed the limits.
And in case you exceed the excel-limit the data is not complete at all.
The CSV-limits are much higher and CSVs can easily be migrated into excel files. Could you (your customer) work with CSV exports?
Customers are now working only with csv format. I want to check if it possible to hide/block the excel format option(Export to Excel). I think it is, but not sure if this can be applied only to a specific dashboard.
Since we have not heard back from you, I’ll go ahead and close/archive this topic. However, if you have any additional questions, feel free to create a new topic in the community and link this discussion for relevant information.