Not able to join of three s3 datastes

I have 3 s3 datasets ( Csv files uploaded to S3 then connect to QS ) below highlighted

. ------1

I have created 4 CC fields in one of the table, below highlighted

------2

I am going to join the above mentioned datasets ( costrepcostandvarince ) to date & delivery_prod… datasets ) . For that joining Purpose I have created a CC field in each table ( Short Date Column )… Short Date, Short Date & Prod Short Date

. -----3
I see the Short Date column in both Date table & delivery_prod table when i try to joining… but
when i try to join costreprcostandvarince and Data table, i am not able to see the short Date column in cost table
( All CC fields that i have created in 2 screenshot not visible after adding the other 2 datasets — scrolled down too

)
( it’s weird ). Moreover, Publish & Visualize option in the right top is disabled.

Pls help me out here.
Why i am not able to join + not able to see CC fields in screenshot when i try to join

  • why the Publish & Visualize option is not visible

Hi @Venkat.Metadata

The publish option will not be enabled unless your dataset definition is all correct. In this case since your joins are not completely defined the Publish button is disabled.

I suggest you do things one step at a time so you are able to debug the issues; resolve them and then go ahead.

Unless you share the details of the individual datasets and their data type it is difficult to say what the problem is. Since you say the Calendar Date and Deivery_prod_vol_and_variance has the required columns to join complete that and then deal with the next ones.

Check for data types; that may be one of the reasons for the issues. It may not be compatible for the columns you want to join

Hi @Giridhar.Prabhu , I have cross checked & verified the datatypes of the joining columns ( Short Date… whatever i have created those CC fields ), are same datatypes ( String - in all 3 datasets ).

—1

—2

—3

Moreover, Let’s say I am about create an analysis using Data Table & I have added the Cost table ( Screenshot —3, for the joining purpose Short Date = Short Date ). After adding the another dataset, i don’t see the CC fields in Date table.

Before adding the Cost Dataset ( I am again saying all the datasets are from S3 )

After Adding the Cost Dataset – Not able to see the Short Data column in Date table dataset for the joining purpose

( But i see that CC column in Cost Table ). Similarly, if i start analysis using cost dataset then adding Date dataset, i don’t see CC fields in Cost dataset & able to see CC fields in Date table

Pls help me out. ( Is this a limitation - not possible to use CC columns in Joining ). I would say the data Modelling is majorly causing my development + ETL also major issues in QS because most Clients give raw data not cleaned & normalized data for the report development, right?.
Besides, the data modeling using joining causing duplications or removing records in the existing datasets thus resulting incorrect data in the visuals.

Hi @Venkat.Metadata - Are you able to take one data set and publish it and see the result and then add another one and do the join and able to see the data or not. I agree with you Data modelling is very much require which is already raised to QS team and some new feature will come down the line. If you can create samples in Arena, then we can verify it .

Regards - Sanjeeb

1 Like

@Sanjeeb2022 , Yeah, as you said , i took one dataset first ( which is cost table ) & published then i have added Date table but facing same issue ( not able to see CC columns in Cost table but see in Date table while try to joining 2 tables using CC columns which is Short Date).

So, i decided to join Cost & Date tables with existing columns & able to did.

Then after joining i am able to see all CC columns but not able to see while joining using CC fields.

Pls answer me

  1. Any limitation in joining of datasets using CC fields
  2. Any timeline that this feature will add in QS ( Data Modeling + ETL )

@Hi All , pls reply of the above question

Hi @Venkat.Metadata

Data modeling in QS is not like Power BI. The setup like Power BI is a request that many have raised and I think has been listed as a feature request with AWS.

You have mentioned CC fields/columns. Can you elaborate what you mean by these fields?

@Giridhar.Prabhu Calculated fields

Hi @Venkat.Metadata

In general Calculated fields in dataset are available for joins. Refer the documentation that has the details

There may be some limitations since you have datasets from two different datasources. There is an article in the learning section. You can go through the same.

1 Like

Hi @Giridhar.Prabhu , All datasets are form s3 only as i mentioned above as well in the screenshots. ( Same source ).
Issue: I am not able to see calculated fields while joining & why?

Hi @Venkat.Metadata

Looking at the screenshot below it appeared datasources were different. One seems like a database Aurora/RDS and the other S3.

Anyway, I did a small test to show that calculated fields could be joined

Sales dataset

Test Currencies dataset

You may want to upload your case to the Arena so someone from AWS can take a look at it.

@Giridhar.Prabhu , I am not sure but they are from s3 location & how the image is changing

image

  1. I am about to create a analysis using " delivery_flex_metrics_downtime_by_line "

  2. Adding another dataset " reference line table "

after clicking reference table, coming like below



Noted.

You may simulate your dataset and make them available in Arena so it is easy for an AWS resource to look at the issue.