How to know where my dataset came from

Hi community. I have a quick question. When I create a dataset from an S3 file, I use the json manifest and everything works great.
Now that I have my dataset loaded to quicksight (using SPICE), I want to know if it is possible to know the location of the file that acts as a source for that dataset. Because I have a fews datasets that come from S3, but I don’t remember the location of some of the files, I think quicksight should let me know the path of the source S3 files for each dataset

Hi

I believe it’s not possible as the dataset depicts the source name for S3 only and not the link/URL.

@JeroDS7 ,

1/ Run a describe-data-set command to get the data source ID for that S3 data source.
2/ Run a describe-data-source command on the data source ID : The output will display for you the location of manifest file . If you have access to the manifest file, then you can view the S3 bucket details where the actual data is stored

“DataSource”: {
“Arn”: “arn:aws:quicksight:eu-central-1:XXXXX:datasource/Toy-Sales-datasource-id-V1”,
“DataSourceId”: “Toy-Sales-datasource-id-V1”,
“Name”: “Toy Sales”,
“Type”: “S3”,
“Status”: “CREATION_SUCCESSFUL”,
“CreatedTime”: “2025-01-08T17:35:07.205000+01:00”,
“LastUpdatedTime”: “2025-01-08T17:35:07.205000+01:00”,
“DataSourceParameters”: {
“S3Parameters”: {
“ManifestFileLocation”: {
“Bucket”: “abctoysales”,
“Key”: “bcdtoysales/toy_sales.manifest”
}
}
}
}

Kind regards,
Koushik

1 Like

Thanks, but I don’t quite understand your answer.
The describe-data-set command requires the data set ID as an argument, so how can I use it to fetch the ID if it is one of the arguments?

Where can I get the data set ID?

In the URL : https://eu-central-1.quicksight.aws.amazon.com/sn/data-sets/dataset ID

Thanks, I see it know and it worked. But my output doesn’t show the manifest file, it only prints

{
“Status”: 200,
“DataSource”: {
“Arn”: “arn:aws:quicksight:us-east-1:XXX:datasource/XXX”,
“DataSourceId”: “XXX”,
“Name”: “flujo”,
“Type”: “S3”,
“Status”: “CREATION_SUCCESSFUL”,
“CreatedTime”: “2025-01-12T19:34:10.020000-03:00”,
“LastUpdatedTime”: “2025-01-12T19:34:10.020000-03:00”
},
“RequestId”: “XXX”
}

is this becuase I uploaded the manifest file from my local pc and don’t have it stored in S3?

2 Likes