Connect Starburst Galaxy with Amazon QuickSight

Starburst is an AWS Data and Analytics Competency Partner with the Amazon Graviton and EC2 Spot Ready product designations. Starburst Galaxy is a fully-managed data platform, powered by Trino, that combines the scalability of a data lake, the functionality of a data warehouse, and the reach of data virtualization in a single platform. Galaxy lets you easily discover, govern, and analyze petabyte-scale datasets in and around your cloud data sources—including Amazon Simple Storage Service (Amazon S3), Amazon Redshift, and Amazon Relational Database Service (Amazon RDS).

Galaxy offers several business intelligence (BI) and visualization tool integrations, including Amazon QuickSight. QuickSight is a cloud-native, serverless BI service that lets you build visualizations, perform ad hoc analysis, and gain insights through machine learning (ML) capabilities such as anomaly detection, forecasting, and natural language querying.

This post shows you how to connect QuickSight to Starburst Galaxy for powerful visualizations across all your cloud data sources.

Prerequisites

To complete this configuration, you need the following:

For additional discussions and help getting answers to your questions, check out the QuickSight Community.

Retrieve Galaxy connection details

We connect from QuickSight to the sample catalog from free-cluster, as shown in the following screenshot.

To get all the required connection details, complete the following steps:

  1. In Galaxy, choose Partner connect in the navigation pane.
  2. Choose the QuickSight tile.
  3. For Select cluster, choose free-cluster.
  4. Copy the information about the user, host, and port to use in a later step.
  5. Choose Connect to partner to be redirected to the QuickSight console.

Alternatively, you can navigate to the QuickSight console in your browser.

Configure QuickSight

Complete the following steps to configure QuickSight:

  1. On the QuickSight console, choose Datasets in the navigation pane.
  2. Choose New dataset.
  3. Choose Starburst.
  4. Select the Starburst product you want to connect to (Starburst Enterprise is self-hosted and Starburst Galaxy is fully managed). For this post, we select Starburst Galaxy.
  5. For Data source name, enter a unique name for your dataset.
  6. Choose your connection type.
  7. For Database server, enter the host URL you copied earlier.
  8. For Port, enter the port you copied earlier.
  9. For Catalog, enter your catalog (for this post, we use sample).
  10. Enter the user name and password that you copied earlier.
  11. Choose Validate connection, then choose Create data source.
  12. For Schema, choose your preferred method to configure the data source in QuickSight:
      1. Use the default option to enter your schema and table. For this post, we specify the burstbank schema from the sample catalog.
      2. Choose Custom SQL for more advanced use cases, or migrations from other BI tools where SQL is already defined.

  13. For Tables, select your table (for this post, we use customer).
  14. Choose Select.
  15. Select Directly query your data to get data from Starburst Galaxy in live mode.

You can alternatively use SPICE if you have large datasets and want to import the data into QuickSight for quicker analytics.

  1. Choose Visualize.
  2. Specify which type of report you want to create, then choose Create.
  3. When a new sheet is created, drag the countryfield into the empty canvas and change the type of visual to a map.

You will quickly see a geographical representation of the customers on the map.

You can keep exploring the amazing visualization capabilities of the QuickSight platform!

Conclusion

In this post, we showed you how to connect Starburst Galaxy with QuickSight to visualize data from all of your cloud data sources—across Regions and even clouds.

If you haven’t already, your next step is to create a data product in Starburst Galaxy. Data products are a collection of curated, high-quality related datasets and relevant metadata that let you easily govern, share, and monitor the data you share with your end-users. You can also attach specific compute resources to individual data products so that you can tailor compute to the needs of your individual workloads.

After you have your data product set up, simply pull it into QuickSight so your end-users can start visualizing and exploring the data.

To get started, check out AWS Marketplace and sign up for a free trial with Starburst Galaxy.


About the authors

Piotr Leniartek is a Data oriented Product Manager with more than 10 years of experience in the software industry, truly believes that data is the biggest asset of any organization and is fundamental to unlock AI adoption. He spends his spare time as an advisor for educational non-profits.

Alex Albu is an engineering manager at Starburst, whose teams are responsible for client tools, business intelligence tool integrations and the Python ETL ecosystem. He has over 20 years of experience building software for companies of all sizes, from start-ups to Fortune 500 companies.

Antony Prasad Thevaraj is a Sr. Partner Solutions Architect in Data and Analytics at AWS. He has over 12 years of experience as a Big Data Engineer, and has worked on building complex ETL and ELT pipelines for various business units.


This is a companion discussion topic for the original entry at https://aws.amazon.com/blogs/business-intelligence/connect-starburst-galaxy-with-amazon-quicksight/