Remove duplicate rows from dataset

EnriqueS · October 27, 2022, 3:10pm

You can also create calculated fields at the dataset level and then create a filter at the dataset level as well.

This way you can filter the data directly there and make sure that all the analyses that use it received the filtered data.

However remember that it is the best practice to work with already clean and massaged datasets to avoid overloading the BI layer with calculations and filters so your visuals can render blazing fast.

I would recommend to create a data pipeline using AWS Glue or or your favorite ETL tool to ensure that the data that arrives to the consumption layer (Quicksight in this case) doesn’t contain duplicates.

Hope it helps, please mark this solution as solved if that’s the case also to help other members of the community., otherwise let us know.

Happy dashboarding!

Topic		Replies	Views
Delete duplicate with Incrementally refreshing a dataset Q&A data-source , analysis , parameters , formatting , error , calculations	13	1725	March 2, 2023
How do I remove duplicates from the dataset when uploaded to quicksight? Q&A Business-Intelligence-Engineer	5	559	March 20, 2024
Remove duplicates on quicksight Q&A developer , dashboard-embed	1	98	October 11, 2024
How to Remove Duplicate Rows by Event ID in a Dataset Q&A dataset , Business-Intelligence-Engineer	4	89	September 23, 2024
Quicksight table visual skipping excluding rows Q&A analysis , formatting	13	619	June 30, 2023

Remove duplicate rows from dataset

Related topics