How do I remove duplicates from the dataset when uploaded to quicksight?
–*
ErikG
March 8, 2024, 7:16am
2
Hi @Sharell
could you please share some sample data?
But maybe the following will help.
Hello @egobbo .
You can also create calculated fields at the dataset level and then create a filter at the dataset level as well.
This way you can filter the data directly there and make sure that all the analyses that use it received the filtered data.
However remember that it is the best practice to work with already clean and massaged datasets to avoid overloading the BI layer with calculations and filters so your visuals can render blazing fast.
I would recommend to create a data pipelin…
BR
Thanks @ErikG Unfortunately that response was not useful for my case.
I’m trying to filter out the duplicates within columns. Or at least use conditional formatting to then filter from there within Quicksight. Kindly see below:
ErikG
March 8, 2024, 7:32am
4
Hi @Sharell
and there are no distinctions possible? Why are there so many duplicates in the data?
Maybe you can use runningCount to create a row_count
I guess you would need the most columns within the partition field
BR
ErikG
March 15, 2024, 2:20pm
5
Hi @Sharell
any updates on your side?
BR