How to Remove Duplicate Rows by Event ID in a Dataset

Joao_Nascimento · August 29, 2024, 2:53pm

I have a dataset that contains duplicate rows based on the “Event ID” primary key. I would like to keep only one row for each unique Event ID, and discard any additional rows that have a duplicate Event ID.

The dataset has several other columns beyond just the Event ID, but I want to focus the deduplication process on just the Event ID column.

The challenge is that I am unable to perform any ETL work on the dataset. I need to find a way to remove the duplicate rows directly within the Quick Sight platform, preferably at the dataset level, to make it easier for the business users to create their analyses.

What is the best approach to identify and remove these duplicate rows in Quick Sight, keeping only a single row for each unique Event ID?

Thank you in advance for your help!

Shahid_Muhammad · August 30, 2024, 7:38am

Hi

This might be helpful.

Brett · September 16, 2024, 6:27pm

Hi @Joao_Nascimento,
It’s been awhile since we last heard from you; did you have any additional questions regarding your initial topic?

If we do not hear back within the next 3 business days, I’ll go ahead and close out this topic.

Thank you!

Brett · September 23, 2024, 7:56pm

Hi @Joao_Nascimento,
Since we haven’t heard back, I’ll go ahead and close out this topic. However, if you have any additional questions, feel free to create a new post in the community and link this discussion for any relevant information that may be needed.

Thank you!

Topic		Replies	Views
How do I remove duplicates from the dataset when uploaded to quicksight? Q&A Business-Intelligence-Engineer	5	553	March 20, 2024
Remove duplicate rows from dataset Q&A direct-query	11	8720	October 15, 2024
Delete duplicate with Incrementally refreshing a dataset Q&A data-source , analysis , parameters , formatting , error , calculations	13	1711	March 2, 2023
Remove duplicates on quicksight Q&A developer , dashboard-embed	1	93	October 11, 2024
How can we use the primary key feature to avoid duplication during incremental refresh? Q&A dataset , Business-Intelligence-Engineer	3	91	July 28, 2025

How to Remove Duplicate Rows by Event ID in a Dataset

Related topics