Automating QuickSight Asset Migration: A Python-Powered Guide

By Abishek Balasubramaniam Technical Blog February 14, 2025

Moving QuickSight assets such as dashboards, analyses, and datasets from development to production can often feel complex and prone to manual errors. However, leveraging a custom Python script can automate the entire process, ensuring a seamless transition with minimal human intervention. In this guide, I’ll show you how to set up and use a Python script to export, update, and import your QuickSight assets while switching data sources and adjusting asset names for production use.

Overview

I’ll share how I designed a custom Python script to move QuickSight assets between environments, specifically from development (dev) to production (prod). My use case involved using the same AWS account but updating the data source for datasets while migrating dashboards and analyses without issues. Here’s a breakdown of the key elements:

I used folders to manage user access for each dashboard.
The goal was to update the dataset’s data source when moving from dev to prod while maintaining asset structure and content.
Asset names in dev followed a convention with _dev, and I needed to remove _dev during the migration to production.

This blog will walk you through the process and explain how the script automates key steps like name changes, data source updates, and more. You’ll learn how to adapt this process to your own workflows for smooth QuickSight asset management.

The Initial Setup

# Input: JSON object containing dashboard names and corresponding ARNs
DASHBOARD_ARNS = {
    "DASHBOARD_NAME": "arn:aws:quicksight:us-east-1:ACCOUNT_ID:dashboard/dashboard_id",
    # Add other dashboards as needed
}

ACCOUNT_ID = "ACCOUNT_ID"
AWS_REGION = "us-east-1"
ASSETS = ["dataset", "dashboard", "analysis"]
ZIP_FOLDER = "zipfile/"

NEW_DATASOURCE_ARN = "arn:aws:quicksight:us-east-1:ACCOUNT_ID:datasource/DATASOURCE_ID"
FOLDER_ARN = "arn:aws:quicksight:us-east-1:ACCOUNT_ID:folder/FOLDER_ID"
SYSTEM = "prod"

NEW_ASSET_ARNS = []
client = boto3.client('quicksight', region_name=AWS_REGION)

This initial setup defines the necessary imports, sets the AWS region and account ID, and prepares the asset migration process by defining the ARNs of the dashboards.

Exporting the Assets

def start_export(dashboard_arn, export_id):
   response = client.start_asset_bundle_export_job(
       ResourceArns=[dashboard_arn],
       ExportFormat="QUICKSIGHT_JSON",
       IncludeAllDependencies=True,
       AssetBundleExportJobId=export_id,
       AwsAccountId=ACCOUNT_ID
   )

def check_export_status(export_id):
   try:
       export_status = "IN_PROGRESS"
       while export_status == "IN_PROGRESS":
           response = client.describe_asset_bundle_export_job(
               AssetBundleExportJobId=export_id,
               AwsAccountId=ACCOUNT_ID
           )
           print(f'Export status: {response["JobStatus"]}')
           if response["JobStatus"] == "SUCCESSFUL":
               print("Export job completed")
               return response["DownloadUrl"]
           elif response["JobStatus"] == "FAILED":
               print("Export job failed")
               return ""
           else:
               print("Export job not completed yet")
               time.sleep(10)
   except Exception as e:
       print(f"Failed due to {e}")

def unzip_file():
   print("Unzipping export file")
   import zipfile
   with zipfile.ZipFile("export.zip", "r") as zip_ref:
       zip_ref.extractall(ZIP_FOLDER)
       print("Unzip complete")

This function initiates the export of a QuickSight dashboard using the Export job. IncludeAllDependencies=True ensures the export of not only the dashboard but also related datasets and analyses, thereby packaging everything required for production. After receiving a zip file, we unzip it locally to update the assets before import.

Updating Asset IDs

QuickSight’s import job does not automatically generate new IDs for assets. This function adds a -prod suffix to the asset IDs, ensuring the imported assets don’t overwrite existing ones in the same account.

def update_asset_ids_in_file():
   print("Updating asset IDs in exported files")
   for root, dirs, files in os.walk(ZIP_FOLDER):
       for filename in files:
           if any(asset in root for asset in ASSETS):
               filepath = os.path.join(root, filename)
               try:
                   with open(filepath, 'r') as file:
                       data = json.load(file)
                       if 'dataSetId' in data:
                           new_id = f"{data['dataSetId']}-prod"
                           data['dataSetId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:dataset/{new_id}")

                       if 'dashboardId' in data:
                           new_id = f"{data['dashboardId']}-prod"
                           data['dashboardId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:dashboard/{new_id}")

                       if 'analysisId' in data:
                           new_id = f"{data['analysisId']}-prod"
                           data['analysisId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:analyses/{new_id}")

                       if 'logicalTableMap' in data:
                           for table_id, table in data['logicalTableMap'].items():
                               if 'source' in table and 'dataSetArn' in table['source']:
                                   old_arn = table['source']['dataSetArn']
                                   if not old_arn.endswith('-prod'):
                                       table['source']['dataSetArn'] = old_arn + '-prod'
                                   print(f"Updated logicalTableMap dataset ARN: {old_arn} -> {table['source']['dataSetArn']}")
                   with open(filepath, 'w') as file:
                       json.dump(data, file, indent=4)
                   print(f"Updated asset IDs in {filename}")
               except Exception as e:
                   print(f"Error updating {filename}: {e}")

This step is crucial to avoid unintended updates to your dev assets.

Renaming and Updating Assets

In this specific case, the naming convention and same-account constraints required some asset renaming.

def update_asset_names():
   dev_assets = ["dashboard", "analysis", "dataset"]
   print("Updating asset names in exported files")
   for root, dirs, files in os.walk(ZIP_FOLDER):
       for filename in files:
           filepath = os.path.join(root, filename)
           try:
               with open(filepath, 'r') as file:
                   data = json.load(file)
                   if 'name' in data and data['resourceType'] in dev_assets:
                       data['name'] = data['name'][:-4]  # Remove "dev" or "_dev"
               with open(filepath, 'w') as file:
                   json.dump(data, file, indent=4)
               print(f"Updated Name in {filename}")
           except Exception as e:
               print(f"Error updating {filename}: {e}")

def update_asset_ids_in_file():
   print("Updating asset IDs in exported files")
   for root, dirs, files in os.walk(ZIP_FOLDER):
       for filename in files:
           if any(asset in root for asset in ASSETS):
               filepath = os.path.join(root, filename)
               try:
                   with open(filepath, 'r') as file:
                       data = json.load(file)
                       if 'dataSetId' in data:
                           new_id = f"{data['dataSetId']}-prod"
                           data['dataSetId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:dataset/{new_id}")

                       if 'dashboardId' in data:
                           new_id = f"{data['dashboardId']}-prod"
                           data['dashboardId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:dashboard/{new_id}")

                       if 'analysisId' in data:
                           new_id = f"{data['analysisId']}-prod"
                           data['analysisId'] = new_id
                           NEW_ASSET_ARNS.append(f"arn:aws:quicksight:{AWS_REGION}:{ACCOUNT_ID}:analyses/{new_id}")

                       if 'logicalTableMap' in data:
                           for table_id, table in data['logicalTableMap'].items():
                               if 'source' in table and 'dataSetArn' in table['source']:
                                   old_arn = table['source']['dataSetArn']
                                   if not old_arn.endswith('-prod'):
                                       table['source']['dataSetArn'] = old_arn + '-prod'
                                   print(f"Updated logicalTableMap dataset ARN: {old_arn} -> {table['source']['dataSetArn']}")
                   with open(filepath, 'w') as file:
                       json.dump(data, file, indent=4)
                   print(f"Updated asset IDs in {filename}")
               except Exception as e:
                   print(f"Error updating {filename}: {e}")

Assets are renamed by removing the _dev suffix to mark them as production-ready. Additionally, the data source ARN is updated to point to the production database, ensuring the assets use the correct source of data in the new environment.

Additionally, we updated the dataset ID to include the prod suffix in all the assets.

Updating the Datasource

def replace_datasource_in_file():
   print("Replacing DataSourceArn in exported dataset files")
   for root, dirs, files in os.walk(ZIP_FOLDER):
       for filename in files:
           if "dataset" in root:
               filepath = os.path.join(root, filename)
               try:
                   with open(filepath, 'r') as file:
                       data = json.load(file)
                       if 'physicalTableMap' in data:
                           for table in data['physicalTableMap'].values():
                               if 'customSql' in table and 'dataSourceArn' in table['customSql']:
                                   table['customSql']['dataSourceArn'] = NEW_DATASOURCE_ARN
                               elif 'relationalTable' in table and 'dataSourceArn' in table['relationalTable']:
                                   table['relationalTable']['dataSourceArn'] = NEW_DATASOURCE_ARN
                   with open(filepath, 'w') as file:
                       json.dump(data, file, indent=4)
                   print(f"Updated DataSourceArn in {filename}")
               except Exception as e:
                   print(f"Error updating {filename}: {e}")

Here, the datasource arn is updated to the production datasource arn.

Importing the Assets to Production

def start_import(export_id):
   with open('processed.zip', 'rb') as file:
       response = client.start_asset_bundle_import_job(
           AwsAccountId=ACCOUNT_ID,
           AssetBundleImportJobId=export_id,
           AssetBundleImportSource={'Body': file.read()},
           FailureAction='ROLLBACK',
       )
   return response

def create_folder_if_not_exists(dashboard_name, folder_arn):
   folder_name = dashboard_name.replace("_", "-")
   folder_id = f"{folder_name}-prod"
   try:
       response = client.create_folder(
           AwsAccountId=ACCOUNT_ID,
           FolderId=folder_id,
           Name=dashboard_name,
           ParentFolderArn=folder_arn
       )
       print(f"Folder '{dashboard_name}' created successfully.")
       return folder_id
   except client.exceptions.ResourceExistsException:
       print(f"Folder '{dashboard_name}' already exists. Skipping creation.")
       return folder_id
   except Exception as e:
       print(f"Error creating folder: {e}")
       return None

def add_assets_to_folder(folder_id):
   print(f"Adding new assets to folder {folder_id}")
   for asset_arn in NEW_ASSET_ARNS:
       asset_type = asset_arn.split(":")[-1].split("/")[0].upper()
       if asset_type == "ANALYSES":
           asset_type = "ANALYSIS"
       print(f"Folder id: {folder_id}")
       response = client.create_folder_membership(
           AwsAccountId=ACCOUNT_ID,
           FolderId=folder_id,
           MemberId=asset_arn.split("/")[-1],
           MemberType=asset_type
       )
       print(f"Added {asset_type} {asset_arn} to folder.")


def check_import_status(folder_id, export_id):
   import_status = "IN_PROGRESS"
   new_asset_arns = []
   while import_status == "IN_PROGRESS":
       response = client.describe_asset_bundle_import_job(
           AssetBundleImportJobId=export_id,
           AwsAccountId=ACCOUNT_ID
       )
       print(f'Import status: {response["JobStatus"]}')
       if response["JobStatus"] == "SUCCESSFUL":
           print("Import job completed")
           # Add imported assets to the created folder
           add_assets_to_folder(folder_id)
           return import_status
       elif "FAILED" in response["JobStatus"] or "CANCELLED" in response["JobStatus"] or "TIMED_OUT" in response["JobStatus"]:
           print("Import job failed")
           print([x['Message'] for x in response['Errors']])
           return ""
       else:
           print("Import job not completed yet")
           time.sleep(10)

The import jobs re-zipped the assets and imported them into production after the necessary updates. The function places the assets in a specific folder using Create Folder Membership to maintain user permissions.

If you’re moving assets to a different account, you can skip the asset ID updates and focus only on data source updates to ensure correct data connections in the new environment.

Conclusion

By automating the migration of QuickSight assets using this Python script, you can significantly streamline your workflow, reduce errors, and ensure consistency when moving assets from dev to prod. Whether you’re working within the same account or across accounts, this method makes it easier to manage dashboards, datasets, and analyses across environments.

Abishek Balasubramaniam

+ posts

Cookie	Duration	Description
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other".
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
Zoominfo	session	Zoominfo uses technologies to collect and store information when you interact with services it offer to their partners, such as advertising services or analytics. All of those processes are meant to improve your user experience and the overall quality of our services.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111355416_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	This cookie is used to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	This is set by Hotjar to identify a new user’s first session. It stores a true/false value, indicating whether this was the first time Hotjar saw this user. It is used by Recording filters to identify new user sessions.
_hjid	1 year	This is a Hotjar cookie that is set when the customer first lands on a page using the Hotjar script.
_hjIncludedInPageviewSample	2 minutes	This cookie is set to let Hotjar know whether the user is included in the data sampling defined by site's pageview limit.
_hjIncludedInSessionSample	2 minutes	This cookie is set to let Hotjar know whether the user is included in the data sampling defined by site's daily session limit.
_hjTLDTest	session	Hotjar test cookie to check the most generic cookie path it should use, instead of the page hostname. This is done so that cookies can be shared across subdomains (where applicable). To determine this, we store the _hjTLDTest cookie for different URL substring alternatives until it fails. After this check, the cookie is removed.
oktgid	1 year	This cookie is used for storing the visitor ID of the user who clicked on an okt.to link.
oktsid	session	This cookie is used for storing the session ID of the user who clicked on an okt.to link.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by YouTube and is used to track the views of embedded videos on YouTube pages.

Cookie	Duration	Description
__gwtCookieCheck	session	This cookie is used to check if the visitors' browser supports cookies.
AnalyticsSyncHistory	1 month	These cookies are used to deliver advertisements more relevant to you and your interests. They are also used to limit the number of times you see an advertisement as well as help measure the effectiveness of the advertising campaign. They remember that you have visited a website and this information is shared with other organizations such as advertisers.
li_gc	2 years	These cookies are used to deliver advertisements more relevant to you and your interests. They are also used to limit the number of times you see an advertisement as well as help measure the effectiveness of the advertising campaign. They remember that you have visited a website and this information is shared with other organizations such as advertisers.
UserMatchHistory	1 month	LinkedIn - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.