John Kerski’s Blog

Making Your Power BI Teams More Analytic - PBI Inspector

2025-04-05T00:00:00+00:00

Making Your Power BI Teams More Analytics – Static Analysis for Reports

In my last article, I demonstrated how to set up an Azure DevOps pipeline that detects changes in Power BI reports and publishes the PBIP files as artifacts within the pipeline (not to a workspace). Now, with these artifacts available, we can trigger additional pipelines to analyze, monitor, and review our Power BI reports and semantic models.

Figure 1 – High-level Diagram of analyzing reports with PBI Inspector and storing results

As shown in Figure 1, we’ll walk through two key steps:

Analyzing reports with PBI Inspector – PBI Inspector, developed by Nat Van Gulck, examines the report.json file (which defines a Power BI report’s structure and behavior) for issues based on predefined rules. It offers both a User Interface and a Command Line Interface (CLI); we’ll use the CLI in our pipeline. I’ve customized a ruleset to align with my preferences. You can view an example HTML file outline these rules.
Storing the analysis results – To track improvements over time, we need a place to store our results, aggregate analyses, and generate metrics. But where? This article follows a “choose your own adventure” approach: depending on your environment, you can store results in Git, SharePoint, or OneLake. I’ll provide setup guides for each option, along with their pros and cons.

Key Azure Terms

Before configuring storage, let’s clarify some Azure terminology used in this guide:

Service Principal – An identity used by applications or automation tools to securely access Azure resources without user interaction (similar to a service account).
Client Secret – A password-like credential paired with a Service Principal to authenticate and access Microsoft services. It must be stored securely and updated periodically.
App Registration – The process of creating an identity for an application in Azure Active Directory (AAD)/Entra to enable authentication and resource access via APIs.
Variable Group – A centralized collection of variables in Azure DevOps that can be shared across pipelines, useful for managing configurations and securing sensitive values.
Pipeline Trigger – Just like our first pipeline runs on a Git commit, we can trigger another pipeline when a new artifact is created.

Jargon aside, let’s choose our adventure! This table of contents will let you skip to the platform you’d like to save PBI Inspector’s results.

The Git Away

Pros

Storage – The simplest way to store results. This is my default approach in restricted environments where Fabric is unavailable.
Cons
Extraction - Retrieving results via the Azure DevOps API is slower due to API limitations.
Performance - As the number of stored results grows, the process may slow down because each set needs to be committed to the repository. You must also commit to the latest HEAD to avoid conflicts, especially if multiple pipelines push changes simultaneously.

Prerequisites

You must have Project Administrator or Build Administrator rights for the Azure DevOps project.

Steps:

1. Create Repository

Set up a separate repository to store analysis results. Create a repository named pbi-test-results using the at this link.

2. Create Personal Access Token

You’ll need a Personal Access Token (PAT) to authenticate and commit results. Follow these instructions to create a PAT with the following settings:
- Name: MYPBITMA Token
- Permissions: Read & Write access to the Code section
- Expiration: Set a custom expiration date one year from today
  Tip: Schedule a reminder 11 months from now to extend your PAT before expiration.
Please copy the generated token and save it temporarily (e.g., in Notepad). Once you leave the page, the token cannot be recovered. Figure 2 – Creating Personal Access Token

3. Get the PBIP-CI Pipeline ID

When you created the pipeline in the previous post, it generated a unique identifier within your project. We’ll need this identifier so our other pipeline can monitor for published artifacts.

Navigate to the Pipeline page in your Azure DevOps project. Figure 3 – Navigate to Pipelines Page
Select the PBIP-CI pipeline. Figure 4 – Select PBIP-CI pipeline
Copy the definitionId (a numeric value) from the browser URL and save it for later (e.g., in Notepad). Figure 5 – Copy Pipeline ID

4. Set up the Variable Group

In your Azure DevOps project, navigate to the Pipelines->Library section. Figure 6 – Select Library
Select the “Add Variable Group” button. Figure 7 – Select + Variable Group
Create a variable group called “MYPBITMA” and create the following variables:
- GIT_PAT – This the personal access token you generated in prior steps. Make sure the lock icon is closed, so the GIT_PAT is encrypted.
- UPSTREAM_PIPELINE_ID – This the pipeline ID copied in the prior steps. Figure 8 – Save Variable Group

5. Import Repository

Navigate to Repos, click the breadcrumb menu dropdown (chevron), and select Import. Figure 9 – Import Repository
In the “Import a Git repository” window, enter https://github.com/kerski/pbi-teams-more-analytic-support in the Clone URL field, then click Import. Figure 10 – Import a Git repository Once the Import button is selected, the import process will begin and could take a few minutes. Figure 11 – Example of importing Git repository from GitHub into Azure DevOps

Once completed you’ll see the repository imported.

6. Setup Pipeline

Now we can set up the pipeline to run in Azure DevOps when artifacts are published.

Navigate to the Pipelines screen. Figure 12 – Navigate to the Pipelines screen
Then select the “New Pipeline” button. Figure 13 – Select New Pipeline Button
You will be presented with the question “Where is your code?”. Select the Azure Repos Git option. Figure 14 – Select the Azure Repos Git option
You will be prompted to select a repository. Select the repository name “pbi-teams-more-analytic support”. Figure 15 – Select a repository
You will be prompted to configure your pipeline. Choose the “Existing Azure Pipelines YAML file” option. Figure 16 – Configure your pipeline
From the “Select an existing YAML file” select the file named “pbi-inspector-git.yml” Figure 17 – Select /Scripts/pbi-insepctor-git.yml
Update the project names (line 10 and 33) to reference the project name. Figure 18 – Update project references
Then select the “Run” button and this will kick-off the pipeline. Figure 19 – Select “Run” button
You will be redirected to the pipeline screen and prompted to verify permissions. This is required because the variable group is a resource that needs explicit access granted to the pipeline. This security rule (a good one) prevents unintended access to variables in the group. Click View to proceed. Figure 20 – Select the “View” button
You will be asked to permit the pipeline to access the variable group you created in the prior steps. Select the “Permit” button. Figure 21 – Select the “Permit” button
You may be prompted again to permit. Select the “Permit” button. Figure 22 – Select the “Permit” button again
This will officially kick off the pipeline and it will access the last published packages from the PBIP-CI pipeline. Figure 23 – Pipeline trigger
Once the pipeline completes, verify that there are no errors. Then, check the pbi-test-results repository to confirm that PBI Inspector’s analysis has been saved to Git. Figure 24 – Example of files save to Git
I would recommend renaming the pipeline from the default to “pbi-inspector”. Instructions for renaming a pipeline can be followed at this link.

The SharePoint Saga

Pros:

Storage – Easy to manage, with built-in security and archiving features. SharePoint supports extensive file storage.
Extraction – Easier to extract results compared to Git. SharePoint has native connectors, making extraction more straightforward.
Cons:
Setup - Due to the deprecation of App Only authentication, configuring a service principal to save results via PowerShell is complex. In some environments, administrators may need to assist due to separation of duties.

Prerequisites:

Site Collection Administrator rights on a SharePoint site to grant the service principal access to SharePoint content.
Permissions to create service principals in the Azure tenant, requiring Application.ReadWrite.All permissions in Microsoft Entra.
PowerShell Core installed.

Steps:

1. Setup SharePoint for Service Principal Usage

This is probably the most complex part of the setup, but this is the world we live in with SharePoint, so here we go:

Create or use an existing SharePoint site – If you already have a team site and are the Site Collection Administrator, use it. Otherwise, create a new site and ensure you have admin rights. Instructions for creating a site can be found at this link.
Create a document library called “pbi-test-results” – Instructions for creating a document library can be found at this link.
Enable PnP.PowerShell for SharePoint authentication – This step can be tedious, but PnP.PowerShell is the best module for interacting with SharePoint. Rather than reproducing the steps, I recommend referring to the official documentation at this link. Ultimately, when you connect with PnP.PowerShell, you will use a command like this:
```
Connect-PnPOnline -Interactive -Url $siteUrl -ClientId 1exxb24z-7597-49c2-867d-2e9bf8c7bxx7
```
Create another service principal – This one will specifically interface with SharePoint via the Azure DevOps pipeline. This principal should have the Sites.Selected permission in both Microsoft Graph and SharePoint. Figure 25 – API Permissions

Now we need to set up the certificate. Why a certificate instead of a client secret? The certificate is required to save content to SharePoint—those are the rules we have to follow.

# Variables
$certName = "PnPServicePrincipalCert"
$certPath = "./$certName.pfx"
$certPassword = Read-Host -AsSecureString "Enter the password for the certificate (PFX)"

# Create a self-signed certificate
$cert = New-SelfSignedCertificate -Subject "CN=$certName" -KeySpec KeyExchange -KeyLength 2048 -KeyExportPolicy Exportable -CertStoreLocation "Cert:\CurrentUser\My" -NotAfter (Get-Date).AddYears(1)

# Export the certificate (PFX contains both public and private keys)
Export-PfxCertificate -Cert $cert -FilePath $certPath -Password (ConvertTo-SecureString -String $certPassword -Force -AsPlainText)

# Export the public key (PEM format, for Azure)
Export-Certificate -Cert $cert -FilePath "./$certName.cer"

# Validate the certificate with the password
try {
    $securePassword = ConvertTo-SecureString -String $certPassword -Force -AsPlainText
    $importedCert = Import-PfxCertificate -FilePath $certPath -CertStoreLocation "Cert:\CurrentUser\My" -Password $securePassword -ErrorAction Stop
    Write-Host "Certificate validation successful."
} catch {
    Write-Host "Certificate validation failed: $_"
}

Write-Host "Certificate created and exported:"
Write-Host "PFX: $certPath"
Write-Host "CER: ./$($certName).cer"

# Get the thumbprint of the newly created certificate
$thumbprint = (Get-ChildItem -Path Cert:\CurrentUser\My | Where-Object { $_.Subject -eq "CN=$certName" }).Thumbprint
Write-Host "Thumbprint: $thumbprint"

# Import the certificate into the CurrentUser\My store
$securePassword = ConvertTo-SecureString -String $certPassword -Force -AsPlainText
Import-PfxCertificate -FilePath $certPath -CertStoreLocation "Cert:\CurrentUser\My" -Password $securePassword

$bytes = Get-Content '.\PnPServicePrincipalCert.pfx' -AsByteStream
$encodedPfx = [System.Convert]::ToBase64String($bytes)

$encodedPfx | Out-File -FilePath '.\PnPServicePrincipalCert.b64' -Encoding ascii

Take the .cer file and upload to certificates section in the service principal page. Figure 26 – Upload certificate
The script above also generates a .b64 file containing the Base64-encoded certificate. Keep this file open, as we’ll need to copy the Base64 content for the variable group.
Remember the password you entered—we’ll also need it for the variable group.
Grant the service principal access to the SharePoint document library by executing this code in PowerShell:

# Variables
$siteUrl = Read-Host "Enter the SharePoint Site's URL"
$pnpRockClientId = Read-Host "Enter the Client ID for PnP Rocks" 
$clientId = Read-Host "Enter the Client ID for saving results to SharePoint"

# Connect to SharePoint Online using the current user's credentials
Connect-PnPOnline -Interactive -Url $siteUrl -ClientId $pnpRockClientId

# Grant the service principal full control permissions to the site
# Note: You can adjust to write permissions if you wish to restrict permissions further
Grant-PnPAzureADAppSitePermission -AppId $clientId -DisplayName "Service Principal" -Permissions FullControl

Write-Host "Access granted to the service principal"

# Disconnect the current user session
Disconnect-PnPOnline

If you’ve made this it far, congratulations on your SharePoint journey. It’s a nice platform, but setting up the security is tedious.

2. Set up the Variable Group.

In your Azure DevOps project, navigate to the Pipelines->Library section. Figure 27 – Select Library
Select the “Add Variable Group” button. Figure 28 – Select + Variable Group
Create a variable group called “MYPBITMA-SP” and define the following variables:
- UPSTREAM_PIPELINE_ID – The pipeline ID copied in the prior steps.
- CERT_BASE64 – The Base64-encoded certificate created in the previous step. Make sure to enable the secret lock.
- CERT_PASSWORD – The password used to create the certificate. Make sure to enable the secret lock.
- SITE_URL – The site collection URL for your SharePoint site where results will be stored.
- LIBRARY_NAME – The name of the document library. If using the default Documents library, use “Shared Documents” as the value.
- TENANT_ID – The unique identifier for your Microsoft 365 tenant. This can be found by following these instructions.
- CLIENT_ID – The unique identifier for the service principal (not the PnPRocks one) created to save files to SharePoint.

3. Import Repository

Navigate to Repos, click the breadcrumb menu dropdown (chevron), and select Import. Figure 1 – Import Repository
The "Import a Git repository" window will appear. Enter https://github.com/kerski/pbi-teams-more-analytic-support in the Clone URL field and click Import. Figure 30 – Import a Git repository Once the Import button is selected, the import process will begin and could take a few minutes. Figure 31 – Example of importing Git repository from GitHub into Azure DevOps Once completed you’ll see the repository imported.

4. Setup Pipeline

Now we can set up the pipeline to run in Azure DevOps when artifacts are published.

Navigate to the Pipelines screen. Figure 32 – Navigate to the Pipelines screen
Then select the “New Pipeline” button. Figure 33 – Select “New Pipeline” button
You will be presented with the question “Where is your code?”. Select the Azure Repos Git option. Figure 34 – Select the Azure Repos Git option
You will be prompted to select a repository. Select the repository name “pbi-teams-more-analytic support”. Figure 35 – Select a repository
You will be prompted to configure your pipeline. Choose the “Existing Azure Pipelines YAML file” option. Figure 36 – Configure your pipeline
From the “Select an existing YAML file” select the file named “pbi-inspector-sp.yml” Figure 37 – Select pbi-inspector-sp.yml
Update the project names (line 10 and 33) to reference the project name. Figure 38 – Update project references
Then select the “Run” button and this will kick-off the pipeline. Figure 39 – Select “Run” button
You will be redirected to the pipeline screen and prompted to verify permissions. This happens because the variable group is a resource that requires explicit access for the pipeline. This security rule (a good one) helps prevent inadvertent access to sensitive variables. Click View to proceed. Figure 40 – Select the “View” button
You will be asked to permit the pipeline to access the variable group you created in the prior steps. Select the “Permit” button. Figure 41 – Select the “Permit” button
You may be prompted again to permit. Select the “Permit” button. Figure 42 – Select the “Permit” button again
This will officially kick off the pipeline and it will access the last published packages from the PBIP-CI pipeline. Figure 43 – Pipeline trigger
Once the pipeline completes, verify that there are no errors. You can also check the SharePoint site to confirm that PBI Inspector’s analysis has been saved to the designated document library. Figure 44 – Example of files save to SharePoint
I would recommend renaming the pipeline from the default to “pbi-inspector”. Instructions for renaming a pipeline can be followed .

The Fabric Fantasy

Pros:

Storage – Setting up OneLake to accept files from Azure DevOps involves fewer steps compared to SharePoint.
Easy extraction – Storing results in OneLake simplifies processing, aggregating, and creating metrics. Additionally, results can be accessed in near real-time.

Cons:

Limited access – Not everyone has access to Fabric due to financial or tenant availability constraints.

Prerequisites:

You have an existing Lakehouse created. Instructions can be found at this link.
You have a service principal. If you are using a service principal you will need to make sure the Power BI tenant allows service principals to use the Fabric APIs. The service principal will need at least the Member role to the workspace.

Steps:

1. Capture Lakehouse Variables

We’ll need the URL information for accessing the Lakehouse folder to save the results, so here are the steps to do that:

Navigate to Lakehouse in the Fabric workspace.
Access the Files’ properties by hovering over the Files label, select the option ‘…’ and select Properties.
Copy the URL to your local machine temporarily in Notepad. Figure 45 – Copy the URL for use later
2. Set up the Variable Group.
In your Azure DevOps project, navigate to the Pipelines->Library section. Figure 46 – Select Library
Select the “Add Variable Group” button. Figure 47 – Select + Variable Group
Create a variable group called “MYPBITMA-ONELAKE” and create the following variables:
- UPSTREAM_PIPELINE_ID – This is the pipeline ID copied in the prior steps.
- TENANT_ID – This is the unique identifier for your Microsoft 365 tenant. This can be found by using these instructions.
- CLIENT_ID – This is the unique identifier for the service principal you create to save files to OneLake.
- CLIENT_SECRET – This is the “password” for the service principal so you should set the lock icon, so it’s encrypted and not visible in plain text.
- ONELAKE_ENDPOINT – This is the URL you copied in the prior step.

3. Import Repository

Navigate to Repos, click the breadcrumb menu dropdown (chevron), and select Import. Figure 48 – Import repository
The “Import a Git repository” window should appear and enter “https://github.com/kerski/pbi-teams-more-analytic-support” into the Clone URL field and select the import option. Figure 49 – Import a Git repository Once the Import button is selected, the import process will begin and could take a few minutes. Figure 50 – Example of importing Git repository from GitHub into Azure DevOps

Once completed you’ll see the repository imported.

4. Setup Pipeline

Now we can set up the pipeline to run in Azure DevOps when artifacts are published.

Navigate to the Pipelines screen. Figure 51 – Navigate to the Pipelines screen
Then select the “New Pipeline” button. Figure 52 – Select the “New pipeline” button
You will be presented with the question “Where is your code?”. Select the Azure Repos Git option. Figure 53 – Select the Azure Repos Git option
You will be prompted to select a repository. Select the repository name “pbi-teams-more-analytic support”. Figure 54 – Select a repository
You will be prompted to configure your pipeline. Choose the “Existing Azure Pipelines YAML file” option. Figure 55 – Configure your pipeline
From the “Select an existing YAML file” select the file named “pbi-inspector-onelake.yml” Figure 56 – Select pbi-inspector-onelake.yml
Update the project names (line 10 and 33) to reference the project name. Figure 57 – Update project references
Then select the “Run” button and this will kick-off the pipeline. Figure 58 – Select “Run” Button
You will be redirected to the pipeline screen and prompted to verify permissions. This is because the variable group is a resource that requires explicit access for the pipeline. This security measure (a good one) prevents inadvertent access to variables in the group. Click View to proceed. Figure 59 – Select the “View” button
You will be asked to permit the pipeline to access the variable group you created in the prior steps. Select the “Permit” button. Figure 60 – Select the “Permit” button
You may be prompted again to permit. Select the “Permit” button. Figure 61 – Select the “Permit” button again
This will officially kick off the pipeline and it will access the last published packages from the PBIP-CI pipeline. Figure 62 – Pipeline trigger
When the pipeline completes, verify there are no errors, and you can check the Lakehouse if PBI Inspector’s analysis has been saved to the root folder. Figure 64 – Example of files saved to OneLake
I would recommend renaming the pipeline from the default to “pbi-inspector”. Instructions for renaming a pipeline can be followed at this link.

Conclusion

How was the adventure? Hopefully, this guide helped you successfully save your PBI Inspector analysis results. Now, you’re ready for the next chapter in this series—Semantic Model – Static Analysis. Stay tuned…

As always, let me know your thoughts on LinkedIn or Twitter/X.

The complexity of using Power BI in US Sovereign Clouds

2025-03-18T00:00:00+00:00

The complexity of using Power BI in US Sovereign Clouds

“I hate to tell you this, but it’s not available for us yet.” This is an all-too-common phrase I find myself saying to customers in U.S. Sovereign Cloud tenants who come across new features in search engine results only to find they’re not available in our regions. For those unfamiliar, U.S. Sovereign Cloud regions allow U.S. government, public sector, and other highly regulated entities to use the cloud with the security, compliance, and data sovereignty required by their organizations.

I’ve written an article about how Microsoft can better support US Sovereign Power BI customers. I cover pragmatic approaches that go beyond the common complaint of the lack of parity with commercial.

Would love to hear your thoughts.

As always, let me know your thoughts on LinkedIn or Twitter/X.

Making Your Power BI Teams More Analytic - Tracking Changes

2025-02-11T00:00:00+00:00

Making Your Power BI Teams More Analytic - Tracking Changes

In the last article, I provided a detailed overview of the tools, setup, and general process required for your Power BI teams to generate the data needed for better analytics. In this article, we’ll cover how to leverage Azure DevOps to track changes to our Power BI reports and semantic models.

Why is this important?

We need to know which Power BI reports or semantic models have changed so we can identify which reports require immediate analysis, testing, and deployment. Tracking changes also allows us to log when modifications occurred and when analyses or tests were conducted. As with most analyses in our industry, time is a critical dimension, so we must consider that in our Azure Pipelines.

How do we track changes?

With just a few lines of Git code, we can accomplish this:

# Get changed files in the Semantic Model and Dataset folders
$pbipSMChanges = @(git diff --name-only --relative --diff-filter=d HEAD~1..HEAD -- '*.Dataset/**' '*.SemanticModel/**')
$pbipSMChanges += @(git diff --name-only --relative --diff-filter=d HEAD~2..HEAD -- '*.Dataset/**' '*.SemanticModel/**')
$pbipSMChanges = $pbipSMChanges | Sort-Object -Unique

# Get changed files in the Report folder
$pbipRptChanges = @(git diff --name-only --relative --diff-filter=d HEAD~1..HEAD -- '*.Report/**')
$pbipRptChanges += @(git diff --name-only --relative --diff-filter=d HEAD~2..HEAD -- '*.Report/**')
$pbipRptChanges = $pbipRptChanges | Sort-Object -Unique

This code checks for changes in the last two commits. Why two commits?

Often, when merging work into your Azure DevOps repository, Visual Studio Code will generate two commits. I’m sure a Git expert (which I won’t claim to be) could explain this in more detail, but through experience, I’ve learned that to accurately capture all changes to reports and semantic models, checking the last two commits is necessary.

Additionally, we are only scanning changes within folders containing suffixed by .SemanticModel and .Report. This filtering allows us to examine the PBIP format and focus on relevant modifications—such as DAX measure updates, visual changes, or Power Query changes—while ignoring extraneous modifications.

With these core lines of code, we can begin building Continuous Integration by creating an Azure Pipeline that, upon each commit, performs the following steps, as illustrated in Figure 1.

Figure 1 – Illustration of Continuous Integration Pipeline

Throughout this series, I’ll be using some Azure DevOps terms which I have sourced directly from Microsoft documentation:

Trigger - A trigger kicks off a pipeline. A common trigger is when a sync occurs on a branch.
Pipeline - A pipeline is made up of one or more stages.
Stage - A stage is a way of organizing jobs in a pipeline and each stage can have one or more jobs. We’ll have one stage for the Continuous Integration pipeline for this blog post.
Job - Each job runs on one agent.
Agent - Each agent runs a job that contains one or more steps. The Continuous Integration pipeline will run on a Windows agent provided by Microsoft (known as a Microsoft-hosted agent).
Step - A step can be a task or script and is the smallest building block of a pipeline.
Task - A task is a prepackaged script that performs an action, such as invoking a REST API or publishing a build artifact.
Artifact - An artifact is a collection of files or packages published by a run.
Run - A single execution of the pipeline.
YAML - A pipeline is defined using YAML (Yet Another Markup Language), which specifies the agent, jobs, and instructions on what steps to execute and which code to run. Within the YAML file, you can also execute other code (e.g., PowerShell, Python, etc.), effectively making it a script that tells Azure DevOps what code should be executed.

With the jargon defined, here are the tasks executed in each step of a single run of the Continuous Integration pipeline:

Identifying Semantic Model Changes Using the Git command, we identify each semantic model change and package the necessary metadata to pass that information along for analysis. This includes the commit ID under which the change occurred. A commit ID is a unique identifier for a change made in the Azure DevOps repository, allowing us to track who made the change and when it occurred.
Identifying Report Changes Similar to semantic model changes, we also identify report changes and package the necessary metadata for analysis.
Publishing Build Artifacts Once we have identified the relevant reports and semantic models, we publish the content as build artifacts. This process copies the semantic model and report folders—along with their contents—into a package that can be leveraged by downstream Azure DevOps Pipelines. I’ll cover this topic in more depth in future articles.

Note: The term “publish” here refers to Azure DevOps and does not mean publishing reports or semantic models to a Power BI workspace.
Ensuring .abf Files Are Not in the Repository Lastly, we double-check that no .abf files exist in the repository. Why? .abf files contain actual data stored in a semantic model, and for security reasons, they should not reside in our repositories. In fact, the .gitignore file generated when using the PBIP format explicitly excludes .abf files from commits.

This step ensures that no one has unintentionally circumvented this best practice—often due to a lack of knowledge about .abf files or the purpose of .gitignore. If a .abf file is found, the build fails. By default, a failed build notifies the project manager role within the Azure DevOps project, alerting them to a security concern. Depending on your security requirements, this step can be moved earlier in the pipeline process.

Implementation

Hopefully, this gives you an adequate understanding of what the Continuous Integration pipeline does, so let’s review how to set it up. If you’ve completed instructions from the prior section, you need to perform the following:

Fetch Changes from Upstream

Open Visual Studio to the folder we setup in the prior section.
Verify the branch you are on by looking at the lower left-hand corner of the Visual Studio code. You should be at the development branch. If you see that it does not say development, please open the terminal.

Figure 2 – Example of opening the terminal again

Then enter the command “git checkout development”. This will switch the branch to development.

Figure 3 – Example of git checkout to switch to development

Figure 4 – Example when Visual Studio Code is on the development branch
We now need to temporarily add the GitHub repository I created as an additional remote repository.

Why? Well, as we progress through this series, I want to give you the ability to resync with the template repository to pick up new scripts. From Visual Studio Code’s top menu, select Terminal → New Terminal.

Figure 5 – Opening the Terminal Window
A terminal window should appear near the bottom of the Visual Studio code. Enter the command line: “git remote add upstream https://github.com/kerski/pbi-teams-more-analytic.git”

Figure 6 – Adding the GitHub template repo for remote updates
In the terminal’s command line type in “git fetch upstream”. This will pick up on any new changes since you completed the instructions in the prior section.

Figure 7 – Fetching changes from the GitHub template
In the terminal’s command line type in “git remote remove upstream”. This will remove the linkage to the GitHub repository and retain your Azure DevOps repository as the primary place to save work. Note: Don’t forget this step!

Figure 8 – Removing the GitHub template repo from remote updates
Within new Scripts folder, verify a file called pbi-ci.yml is there.

Figure 9 – Example of pbi-ci.yml synced
Commit and sync your changes to your Azure DevOps repository.

Setup the Pipeline

Navigate to the Pipeline page in your Azure DevOps project.

Figure 10 – Example of Pipeline
Select the Create Pipeline button.

Figure 11 – Select the Create Pipeline button
You will be presented with the question “Where is your code?”. Select the Azure Repos Git option.

Figure 12 – Select the Azure Repos Git option
You will be prompted to select a repository. Select the appropriate repository (should be one at this point).

Figure 13 – Select a repository
You will be prompted to configure your pipeline. Choose the “Existing Azure Pipelines YAML file” option.

Figure 14 – Configure your pipeline
You will then be prompted with a screen called “Select an existing YAML file”. For the branch choose “development” and for the Path choose “/Scripts/pbi-ci.yml”. Then select Continue.

Figure 15 – Example of selecting the branch and yaml file
The “Review your pipeline YAML” page will appear. Select the “Save” option.

Figure 16 – Save the Pipeline
The pipeline should now be created, and you should rename it. Selectccthe Rename/move option.

Figure 17 – Rename option
You will be presented with a “Rename/move pipeline” promote. Rename to “PBIP-CI” and then select the “Save” button.

Figure 18 – Rename to PBIP-CI

Test the Pipeline

With the pipeline created and properly named, we’d like to make sure that when we commit and sync a change, the pipeline is triggered. So, from the development branch, perform the following:

Open SampleModel.pbip, located in your local folder in Visual Studio code. Make sure you are also in the development branch.
Then make a few updates to the report. For example, I will update a DAX measure and update the default choices for one of the slicers. After you have completed the changes, save your changes.

Figure 19 – Example of a measure and slicer change
Commit and sync those changes (this should become a more common sequence for you hopefully).
Once you sync, return to Azure DevOps and view the Pipelines page. You may need to refresh the page to pick up on the changes.
The Continuous Integration pipeline should be running, and you can click on the commit message to drill into the job being executed by the pipeline.

Figure 20 – Select the commit message to view the job results
When the results appear, select the single job that should be running or has completed.

Figure 21 – Select the job to view the steps
This will display a log of all the steps that have been executed (as defined in Figure 1). When finished you should see the two artifacts published by clicking the link labled “2 artifacts published”.

Figure 22 – Display of steps with in the job
Since the SampleModel had both its semantic model (DAX measure) and report (slicer) updated, you should see the contents of each folder (SampleModel.SemanticModel, SampleModel.Report) published.

Figure 23 – Example of artifacts published

Next Time

With a Continuous Integration pipeline established we can track changes and set up other Azure DevOps pipelines to do our analyses and testing. More on that in the next post.

Please let me know what you think on LinkedIn.

Note: Some may wonder why I’m showing the “click-by-click” instructions instead of providing a script to automate the installation. That’s a fair question. I wouldn’t expect you to follow these steps manually for every project—you would definitely want to create a script.

However, after years of training others on integrating Power BI with Azure DevOps, I’ve found that introducing trainees to the actual screens helps build “muscle memory.” This, in turn, enhances their understanding of how Continuous Integration works and ultimately makes them better DataOps practitioners.

Making Your Power BI Teams More Analytic - Foundation

2025-01-28T00:00:00+00:00

Making Your Power BI Teams More Analytic - The Foundation

Continuing this series, in order to make our teams more analytic, we need tools to facilitate data collection. Let’s discuss the prerequisites for each of your Power BI teammates:

Visual Studio Code
In my opinion, this is the best integrated development tool - and it’s free to install! Your team will use Visual Studio Code to check in and sync changes with Azure DevOps.
Azure DevOps
Azure DevOps is essential for storing Power BI content and leveraging Azure Pipelines to automate data collection and testing. It’s free for up to four users, and I’ve found that many organizations already have it available in their environments. I don’t distinguish between Azure DevOps (the service) and Azure DevOps Server, as the templates I’ll provide will work with both.
Git
Git is the linchpin that makes all of this work. For some, learning Git can feel daunting, especially for those without a software development background. I often spend time training teams to use Git effectively. Fortunately, there are excellent resources to help you get started. For example, Kevin Stewart’s video on YouTube and the Visual Studio Code channel both provide great introductions. And the best part? Git is free to install.
Power BI Desktop (at least the July 2024 version)
You might be wondering why this is on the list - after all, a Power BI team should already be using Power BI Desktop, right? However, in my experience, many teams don’t upgrade this software frequently. Some of the newer features, such as the Power BI project format, were introduced only in the past 18 months, making an update essential.

The Setup

With these prerequisites in place, let's discuss the initial setup for Azure DevOps.

Azure DevOps Setup

Azure DevOps provides a convenient way to copy a repository for another GitHub page, and I've got a template to help you get started. You can copy the template by performing the following:

Setup an Azure DevOps Project by following these directions.
Navigate to the Repos and select the Import option.

Figure 1 – Import Repository
The “Import a Git repository” window should appear and enter “https://github.com/kerski/pbi-teams-more-analytic” into the Clone URL field and select the import option.

Figure 2 – Import a Git Repository

Once the Import button is selected, the import process will begin and could take a few minutes.

Figure 3 – Example of importing Git repository from GitHub into Azure DevOps.

Once completed you’ll see the repository imported.

Clone Repository

With the repository established in Azure DevOps, we need to work with it on our local machines. In Git jargon, this process is called cloning the repository. Cloning creates a copy of the repository on your local machine and links it to Azure DevOps, allowing you to sync the changes you make to your Power BI files. Each team member will need to perform this step to collaborate on the same repository in Azure DevOps.

Follow these steps:

From the repository page, we’ll need to select the Clone option.

Figure 4 – Example to Clone repository
Select the “Clone in VS Code” option.

Figure 5 – Example of Clone in VS Code button.
A browser window often appears that will ask permission to open Visual Studio Code. Select the “Open” button.

Figure 6 – Browser may prompt to ask permission to open in Visual Studio Code.
You will be asked to select a folder to import. Pick the appropriate location on your local machine and then select the option “Select a Repository Destination.”

Figure 7 – Example of selecting the repository destination.
Once imported you may be presented with the following pop-up. Select the “Open” option.

Figure 8 – Window to open clone repository.
Now that we’ve cloned the repository locally in Visual Studio Code, we need to inform Git who is doing the work. This ensures that when changes are pushed back to the Azure DevOps repository, we can track who made them. To do this, go to Visual Studio Code’s top menu and select Terminal → New Terminal.

Figure 9 – Opening the Terminal Window.
In the command line you’ll need to enter the following commands:
```
git config --global user.email "{INSERT EMAIL}"
git config --global user.name "{INSERT NAME}"
```
Figure 10 – Example of adding git information.

Saving Into PBIP Format

If this is your first time working with the Power BI project format, you’ll need to enable it. This format allows changes to be saved in a text-based structure, which Git is excellent at tracking. At the time of this writing, the PBIP format is still in preview. I’ll be using the format without TMDL or PBIR. The reason is that the original PBIP format uses model.bim for data and report.json for the report. These formats have been around for several years and are considered stable. While TMDL and PBIR will eventually become the standard, they won’t dramatically change the process for saving and syncing your work. With that disclaimer out of the way, let me walk you through enabling the PBIP format:

Enable the PBIP format in Power BI Desktop by following these instructions. Be sure the only checked box is the first option.

Figure 11 - Enable PBIP format.
You will be prompted to restart Power BI Desktop once you select the “OK” button in the Preview features.
After you restart Power BI Desktop, you can open a PBIP file. As part of the template, I saved a sample Power BI report in the PBIP format to use in this series. You can do so by returning to Visual Studio Code from the prior steps. Right click on the File Explorer where SampleModel.pbip is located.

Figure 12 – Right-click on SampleModel.pbip and select Reveal in File Explorer.
This will open Windows File Explorer where you cloned your repository. You can now double-click on SampleModel.pbip to open the file in Power BI Desktop.
Since the PBIP format does not save data to the repository (a great security feature), you’ll need to select the refresh option. This will pull data from GitHub and should not require any credentials.

Figure 13 – Selecting the bottom “Refresh now” option.
After the refresh is complete, you should see a report that looks like Figure 14.

Figure 14 - Example of SampleModel in PBIP format

From here we’ll keep the SampleModel open to demonstrate how to save changes and synchronize those changes to Azure DevOps.

Saving the Changes and Synchronization

These next set of instructions should be a consistent pattern you apply every day the team starts work.

Close Power BI Desktop. No instances should be up as it may prevent any changes your teammates made from syncing.
Open Visual Studio Code. If you haven't worked on other projects, this repository's folder should come up by default. If not, you will need to go to File🡪Open Folder and select the folder you cloned the repository locally.

Figure 15 – Example of using File 🡪 Open Folder feature in Visual Studio Code.
Verify the branch you are on by looking at the lower left-hand corner of the Visual Studio code. You should be at the development branch. If you see that it does not say development, please open the terminal.

Figure 16 – Example of opening the terminal again.

Then enter the command “git checkout development”. This will switch the branch to development.

Figure 17 – Example of git check to switch to development

Figure 18 – Example when Visual Studio Code is on the development branch
Now, you will then click the synchronization icon next to the branch name. This will check with Azure DevOps if any changes have been made.

Figure 19 – Example of synchronization button
With SampleModel.pbip opened, make a few updates to the report. For example, I will update a DAX measure and update the default choices for one of the slicers. After you have completed the changes, save your changes.

Committing and Syncing Changes

Once you complete your updates or reach a point where you’re ready to save your work and share it with the team, you’ll need to perform two steps. In Git jargon, these are commit and sync. When I first started with Git, I was told: “You commit locally and sync globally.” In other words, you commit your changes to the local version of your repository, which acts as a snapshot of your work. Then, you sync that snapshot to the repository in Azure DevOps so others can see the changes you’ve made. Let’s walk through these steps:

Let’s assume I updated a DAX measure and slicer in the SampleModel file and saved those changes in Power BI Desktop. If you return to Visual Studio Code, you will see a number next to the Source Control button. Click on that button.

Figure 20 – Example of source control button with changes.
You should see the Source Control window. Please enter a short description, less than 50 characters, in the Message field that explains what changes you made. Then you can select the “Commit” button. Note: If you don’t enter a message and enter commit, I often see Visual Studio Code “hang” and the changes won’t commit. If you forget to enter a message (believe me, I’ve done it), you will need to close Visual Studio Code and open it again, then put in the message.

Figure 21 – Example of Committing changes, don’t forget to enter a message.
Once the Commit button is selected, the Commit button switches to a “Sync” button. Select the “Sync Changes” option.

Figure 22 – Example of Sync Changes Button
If all goes well, you can navigate to the repository in Azure DevOps and see the changes you made locally is synchronized to the development branch. Note you will need to make sure you have selected the development branch to view in the browser.

Figure 23 - Example of repository synced.

End of Instructions

Does it seem like a lot? Once you and your team develop the habit Saving the Changes and Synchronization, it becomes much easier to follow. Just remember these steps:

Click the Sync button.
Open the PBIP file and make your changes.
Commit those changes locally.
Sync those changes so the rest of your team can receive them.

Microsoft could also make our lives easier by integrating Git directly into Power BI Desktop. If you like that idea, consider voting for it here.

You might be wondering: What about publishing to a workspace? I’ll get to that in a later article as there are several things to consider with your team and the data you work with. The next article will be on tracking changes to your Power BI Project files in Azure DevOps.

Let me know what you think on LinkedIn.

Making Your Power BI Teams More Analytic - Introduction

2025-01-27T00:00:00+00:00

Making Your Power BI Teams More Analytic - Introduction

You spend all day helping your customers be more analytic, so why are you so un-analytic about how you run your data analytics teams? This aphorism has stuck with me since I first read it years ago in the DataOps Cookbook.

I’ve strived to make my teams more analytic, but this can be a challenge with Power BI. As a frequent reviewer of Power BI semantic models and reports, I aim to answer several critical questions:

Do pending semantic model changes still adhere to the best practices my team has set?
Do pending report changes still adhere to the best practices my team has set?
Have any semantic model changes adversely affected a report by introducing broken visuals?
Are there recurring issues with semantic models or reports that require coaching the team—or individual members—to avoid?
Have the number of issues improved or worsened over time? By team member?

Answering these questions requires one crucial component: data. But how do we generate, store, and analyze the data necessary to answer these questions and make our development process more analytic?

Over the next several blog posts, I’ll introduce techniques to address these questions—and possibly ones you’re thinking of as well. This series involves integrating several open-source third-party tools and Microsoft products, including Power BI and Azure DevOps. To make it manageable, I’ve broken it into the following series:

Making Your Power BI Teams More Analytic

Introduction
The Foundation
Tracking Changes
Report - Static Analysis
Semantic Model - Static Analysis
Analyzing Results
Report - Dynamic Analysis
Semantic Model - Dynamic Analysis

Much of my guidance depends on the Power BI Service with a Premium Per User license and without Fabric. Why? In my line of work, I need to ensure these questions can be answered without relying on Fabric-level capabilities. As of early 2025, not all my customers can use Fabric because it doesn’t yet exist in some tenants (e.g., sovereign tenants). I’ll also discuss, where possible, how Fabric could improve the process.

I hope you find this series helpful in making your teams more analytic. Up next: The Foundation.

As always, let me know your thoughts on LinkedIn.

Hiring and Motivating Technical Polyglots in Data Analytics

2024-12-10T00:00:00+00:00

Hiring and Motivating Technical Polyglots in Data Analytics

If you’re working with Microsoft Fabric and Power BI, chances are you have to learn a lot of different languages - DAX, SQL, M. If you’re having to manage and grow Data Analytics teams, hiring those with a diversity of knowledge can be a real challenge.

I wrote a recent article on Simple Talk concerning the subject: Hiring and Motivating Technical Polyglots in Data Analytics.

As always, let me know your thoughts on LinkedIn or Twitter/X.

Testing for Broken Visuals with Azure DevOps - It’s a Team Sport

2024-11-12T00:00:00+00:00

Automated Testing for Broken Visuals - It’s a Team Sport

Over the past few years, I’ve found the Microsoft Power BI and Data community to be one of the most active and generous I’ve encountered. They truly embrace the DataOps philosophy that analytics is a team sport.

It’s a team sport: Analytics teams bring together a variety of roles, skills, tools, and titles. This diversity in backgrounds and perspectives fuels innovation and productivity.

A few weeks ago, I demonstrated how to use Playwright to test for broken visuals, and attendees like Kevin Arnold provided valuable feedback, including:

Support for bookmarks - Iterating through bookmarks to find broken visuals would be beneficial. Bookmarks may reveal visuals that are otherwise hidden, potentially exposing broken visuals.
Simplified test case generation - Currently, generating test cases involves PowerShell commands. It would be ideal to select a report and automatically generate CSV test cases.

Implementing the Updates

Accessing bookmarks - This was easier said than done. I wanted to avoid parsing .pbix files, and since .pbip isn’t Generally Available (GA) yet, parsing report.json to get bookmarks wasn’t ideal. The most reliable approach was using powerbi-javascript to render the reports and retrieve bookmarks. This made it even more important to accomplish item #2 because rendering reports with powerbi-javascript is essential to access bookmarks.

Generating CSVs for tests - I leveraged existing tooling required to run Playwright tests. Since Node.js is required, I extended the testing tool to include a local Node app. Here’s how it works:

In Visual Studio Code, navigate to the test-generation folder.
Run “npm install” to install the necessary Node modules.
Run “node app.js” to start the app on localhost:3000. This app uses the .env file, set up before testing, to store service principal and tenant information for accessing the Power BI APIs and rendering reports.
Go to localhost:3000. This app has four main steps:

Step 1 - Load Workspaces - This validates the service principal setup and confirms the app can generate an access token.

Figure 1 - Load Workspaces
Step 2 - Select Workspace - Choose the workspace where the report is located.

Figure 2 - Select a Power BI Workspace
Step 3 - Select Report - After selecting a report, the app checks if the report’s semantic model uses Row-Level Security (RLS). This check is done using a PowerShell script, as Dynamic Management Views for a semantic model are restricted in REST APIs. Once verified, the tool has the data needed to generate test cases for each role and an embed token.

During this step, we also render the report to access the Bookmark manager, which is currently the only GA-supported way across Power BI and Fabric to retrieve bookmark information.

Figure 3 - Select the Report
Step 4 – Save Test Cases With all data collected, press the button to save tests to the test-cases folder. The generated CSV includes permutations of test cases (pages/tabs, bookmarks, and roles).

Figure 4 - Save Test Cases

Thanks to the feedback from this amazing community, I’m excited to share the latest version with instructions. This version is also compatible with Azure DevOps. Just follow these instructions to set up the pipeline.

If you’ll be at TechCon365 Dallas, please stop by my session on November 15th, where I go in-depth on broken visual testing and the setup.

As always, let me know your thoughts on LinkedIn or Twitter/X on the approach and ways to improve it and the documentation.

Testing for Broken Visuals with Azure DevOps

2024-10-08T00:00:00+00:00

Automated Testing for Broken Visuals with Microsoft Playwright and Azure DevOps

In my previous article, I explained how to run tests for broken visuals in Power BI using Microsoft Playwright and Visual Studio Code. If you’re looking to avoid the manual process of clicking through each workspace, navigating to every report, and checking each page, this solution can significantly reduce those steps. However, someone still needs to initiate the tests manually. So, how can we fully automate the testing process to ensure it runs on a routine basis and we know when broken visuals appear?

Orchestrate: The beginning-to-end orchestration of data, tools, code, environments, and the analytic teams work is a key driver of analytic success.

Thankfully, the Microsoft Playwright team has provided extensive documentation on integrating tests into various DevOps tools such as GitHub Actions, Jenkins, CircleCI, and Azure DevOps. In this article, I’ll focus on Azure DevOps—a tool I frequently encounter.

Figure 1 - High-Level Diagram of Testing in Azure DevOps Pipeline

Overview

Figure 1 provides a high-level overview of the Azure DevOps pipeline. To implement it, ensure you have the following setup in Azure DevOps:

1) Integrate the latest test suite release into a repository in Azure Repos. This ensures all code is properly stored. Make sure the CSV files (as discussed in the previous article) are placed in the test-cases folder so the pipeline knows what to test.

2) Create a variable group in Azure DevOps to replicate the variables defined in the .env file. You may notice that the .gitignore file includes the .env file. This is because storing the client secret for the service principal in plain text in your repository poses a significant security risk. By setting up the variable group correctly, you can securely manage these values. Your variable group should contain the following items, as shown in Figure 2:

CLIENT_ID -- Set this to the Client ID of the service principal you created during the prerequisites.
CLIENT_SECRET -- The client secret for the service principal.
TENANT_ID -- The tenant ID for your Power BI instance.
ENVIRONMENT -- The tenant type (e.g., Public, Germany, China, USGov, USGovHigh, or USGovDoD) where your Power BI service is located.
CI -- Set this set to true to ensure the pipeline terminates correctly if there are any failed tests.

Figure 2 - Example of Variable Group

3) Create a pipeline from an existing YAML file. You’ll be using the playwright-docker.template.yml file for this step. By default, this job will run every 6 hours, but you can adjust the triggers to run at different intervals or based on specific updates (e.g., an update to a semantic model). This pipeline will run on Linux and leverage a container optimized for running Playwright tests. When executed, the pipeline will follow three main steps:

npm ci -- Installs the Node.js modules required to run the tests.
npx playwright test -- Executes the tests using the specific configuration file, *playwright.config.v1.ts*, which is customized for the pipeline.
Publish Test Results -- Publishes the test results to Azure DevOps in JUnit format, allowing them to be viewed in the browser.

Figure 3 - YAML file for running tests in a Azure Pipeline

With the pipeline in place, test results will be generated during each execution and made available alongside the pipeline results (see Figure 4).

Figure 4 - Example of test results

You can also drill down into the test details and view a screenshot of any failed tests (see Figure 5).

Figure 5 - Example of report with a broken visual

Monitoring

It’s crucial to monitor the new Azure DevOps pipeline for any failures—catch broken visuals before your customers do. I’ve also outlined some best practices for effective monitoring in this article.

Would you like to try it out yourself? Prachiti and I have made it available on GitHub at this link. We walk you through the specific setup steps and what you need to run it in your own Azure DevOps service.

As always, let me know your thoughts on LinkedIn or Twitter/X on the approach and ways to improve it and the documentation.

Automated Testing for Broken Visuals with Microsoft Playwright

2024-09-24T00:00:00+00:00

Automated Testing for Broken Visuals with Microsoft Playwright

Figure 1

This image plagues my dreams, a grey box of death that can infect my projects and diminish my team’s capacity to maintain trust.

Overly dramatic? Maybe, but theatrics aside, a broken visual in a report is the most obvious sign of how well (or not) your project manages deployments. I’ve seen presentations go south and been on the receiving end of an annoyed phone call due to a recent update to a semantic model.

And while I could automate many tests for semantic models, I still couldn’t be sure that those changes didn’t negatively impacted the reports that relied on the models. Furthermore, broken visuals are not always a result of changes to semantic models. From this list of errors, here are just a few I’ve encountered:

Power BI visuals have been disabled by your administrator - If your Power BI tenant admin has decided to disable these settings either accidentally or deliberately due to a policy change, you could suddenly have broken visuals.
This visual has exceeded the available resources - Under certain conditions, the visual’s DAX could be taxing on the model, or the capacity running the report is being heavily utilized, causing you to hit an unforeseen limit.
Couldn’t retrieve the data for this visual – If you’re relying on Direct Query and the gateway or cloud data source goes offline, this type of broken visual can appear.
Data shapes must contain at least one group or calculation that outputs data - Similar to the prior issue, the data source may be available, but it has no data to provide. I’ve seen this occur with an empty dimension table due to a loading issue.

To avoid a broken visual, I’ve seen teams nominate a ‘lucky’ team member to check the reports once or twice a day. This meant they had to go to each workspace, navigate to each report, and then check each page. Tedious, right? Now, what if your semantic model is using Row-Level (RLS) or Object-Level security (OLS)? Multiply the number of checks by the number of roles, and your ‘lucky’ team member will be spending a lot of time performing those checks.

So, how do we ensure that the ‘grey box of death’ doesn’t creep into our work without the drudgery of manually checking each visual? In DataOps, we emphasize the automated detection of issues through the principle of ‘Quality is Paramount.’

Quality is Paramount: Analytic pipelines should be built with a foundation capable of automated detection of abnormalities (jidoka) and security issues in code, configuration, and data, and should provide continuous feedback to operators for error avoidance (poka yoke).

Microsoft Playwright enters the room

In the past, I had teams use various end-to-end testing tools to try to automatically test for broken visuals, with varying degrees of success. However, recently I was introduced to Microsoft Playwright and came across an article by Avi Ulman about testing Power BI using Playwright. Microsoft Playwright is an open-source automation library for browser testing that was launched in January 2020. There is a wonderful YouTube channel that dives deep into the tool, featuring great tutorials by Microsoft’s Debbie O’Brien.

Generating Test Cases

Inspired by the technology and the article, I collaborated with Prachiti Jadhav to establish a template that allows teams to test for broken visuals in a repeatable pattern. The basic concept is that if we could feed a CSV document of test cases to Playwright, we could identify the reports, pages, and roles we want to test across various Power BI implementations. This CSV would contain the following columns:

test_case - A unique name for the test. We include the report URL to make referencing easier.
workspace_id - The GUID for the workspace.
report_id - The GUID for the report.
page_id - The GUID for the specific tab in the report.
dataset_id - The GUID for the semantic model. This is a required field if the semantic model is using RLS/OLS.
user_name – The universal provider name (e.g., email address) for the user if the semantic model is using RLS/OLS.
role – The role name to test. This is a required field if the semantic model is using RLS/OLS.

Thanks to Prachiti’s efforts, this is now an open-source PowerShell module that will generate this CSV for you (see the code snippet below). I should also note that we made sure it works for our GCC colleagues as well.

Get-PowerBIReportPagesForTesting -DatasetId $variables.TestDataset2 -WorkspaceId $variables.TestWorkspace2 `
        -WorkspaceIdsToCheck @($variables.TestWorkspaceToCheck2) ` -Credential $Credential `
        -TenantId "$($variables.TestTenant)" `
        -LogOutput "Table" `
        -Environment Public `
        -Path $testPath1

You could also manually generate a CSV yourself, but the PowerShell module allows for a high-degree of automation.

Playwright takes center stage

With the CSVs in place (specifically in a test-cases folder), we could instruct Playwright to render a report in a browser using the powerbi-client package. This technique required us to leverage a service principal, incorporate some OAuth code in TypeScript, and use JavaScript Promises to detect whether the report page had a broken visual. Figure 2 provides a high-level representation of the testing process.

Figure 2 - High-level illustration of testing process with Playwright

And Figure 3 provides an animated gif of the process in action.

Figure 3 - Lights, Camera, Playwright in Action (yep, full of puns)

Most importantly after the tests are complete, you can see a report of what failed including screenshots of the page (pretty cool!).

Figure 4 - Example of Playwright Test Results

Would you like to try it out yourself? Prachiti and I have made it available on GitHub at this link. We walk you through the specific setup steps and what you need to run it on your own machine.

If you’d like to see this in action in person, I’ll be presenting at SQL Saturday Pittsburgh and TechCon365 Dallas in the next couple of weeks.

In my next article, I’ll cover what is required to further automate this process in Azure DevOps… so stayed tuned.

As always, let me know your thoughts on LinkedIn or Twitter/X on the approach and ways to improve it and the documentation.

Multiple Value Parameters in Paginated Reports

2024-08-27T00:00:00+00:00

Multiple Value Parameters in Paginated Reports

Paginated Reports offer us the ability to generate near pixel-perfect exports of our reports in PowerPoint, PDF, etc. Often on my teams, we provide a button in our regular Power BI Report to export contents of the underlying semantic model that is nicely formatted to help our clients prepare for briefings or hold meetings. Over the years, I have learned that you should not assume that within different working environments a laptop and connectivity is always there, so exports still have a real use case. The image below illustrates the design pattern I’ve used.

Figure 1 - A Design Pattern for having thin reports export in a pix-perfect format using Paginated Reports.

In order to support this design pattern, my teams leverage query string parameters so we can pass the current filters from the report over to the Paginated Report. When we have to send multiple values for the same filter (say Fiscal Year) it typically looks like this: “rp:FY=2024&rp:FY=2023”. Notice that you must repeat the parameter name to pass multiple values to the same parameter.

The Problem

This format can be problematic because of the URL limit of 2,083 characters placed by Microsoft Edge. The repetition of the parameter name over and over for multiple filter values can quickly reach that limit. In DataOps, we try to embrace the concept of Simplicity.

Simplicity: We believe that continuous attention to technical excellence and good design enhances agility; likewise simplicity–the art of maximizing the amount of work not done–is essential.

With that in mind, wouldn’t it be simpler to send “rp:FY= 2023,2024”? … It would except Paginated Reports have a hard time converting that into a format that can be used by the DAX Query in the Paginated Report.

Any multi-value parameter in Paginated Reports that are used to query a semantic model (with DAX) require the mysterious RSCustomDaxFilter. Thanks to Chris Webb, I came across this function early in my Paginated Report journey. Here is an example of the function:

Figure 2 - My function definition for RSCustomDAXFilter

Under the hood, Paginated Reports will translate this function into the appropriate DAX (think FILTER function) so you can work with multi-value parameters. The problem is, the first argument must be an Array (multi-value), so a comma-delimited string like “2023,2024” won’t work (ugh). This feels like a classic square-peg, round-hole issue where two Microsoft products struggle to integrate, and the community must endeavor to look for workarounds to make our customers happy.

Solution

I found a solution with two options that converts a comma-delimited string from a parameter into the array we need. One option is the DAX option, and the other is the Code option. Why two options? Well, the DAX option allows you to manipulate the comma-delimited string and add additional logic using data from the model. For example, maybe you want to convert a string like “PortfolioA,PortfolioB” into a set of IDs that represent the project in each of those portfolios. This allows you to create a set of mnemonics that reduce the URL size and can then be expanded to a range of numbers to query the model. Let’s look at the DAX Option a little deeper.

DAX Option

Figure 3 - DAX Option flow

Figure 3 provides an illustration. At a high-level, here are the steps:

We take the comma-delimited string, split those values into an array, and store those results as another parameter. The key is to leverage the same semantic model which we retrieve data from to also convert our string to an array. Here is the DAX that does the conversion.

-- Define the input variable containing a comma-delimited string  
VAR _X = @CommaDelimitedStr  
  
-- Replace commas with pipe characters to prepare for PATHITEM usage  
VAR commaList = SUBSTITUTE ( _X, ",", "|" )  
  
-- Calculate the number of items in the comma-delimited string  
VAR commaListLength = LEN ( _X ) - LEN ( SUBSTITUTE ( _X, ",", "" ) ) + 1  
  
-- Generate a table with a row for each item in the comma-delimited string  
VAR tempTable =  
    ADDCOLUMNS (  
        GENERATESERIES ( 1, commaListLength ),  -- Create a series from 1 to the number of items  
        "mylist", PATHITEM ( commaList, [Value] )  -- Extract each item based on the series index  
    )  
  
-- Select only the "mylist" column from the temporary table  
VAR arrayOfItems = SELECTCOLUMNS ( tempTable, "list", [mylist] )  
  
-- Output the final table with the list of items  
EVALUATE  arrayOfItems

We receive the array of values from Step 1 and use it to set the default values of an internal parameter. Therefore, in this solution we have a two parameters, a public one that was supplied as a comma-delimited string, and another internal parameter that stores the comma-delimited values in an array (multi-value parameter).
The internal parameter is then used as an argument in the RSCustomDAXFilter to issue the DAX query to the semantic model.
The results of the DAX query are returned to the paginated report. Here those results are used for building charts, setting up tables, in other words, the typical tasks you would do in building a Paginated Report.

Code Option

The Code option is more straightforward and leverages custom code in a Paginated Report. This is a good option when we just need to convert the comma-delimited values and do not need additional business logic from the model. You could also extend this function to do additional steps if need be. With this option, you do the following:

Create a function ConvertToArray that converts the comma-delimited string to an array. Figure 4 - Example of custom code in Paginated Report
Then within the Parameter window of the Dataset properties, create a new parameter by converting the comma-delimited string parameter into an array in the dataset using the custom code you just created. Figure 5 - Using the custom code to convert the comma-delimited string to an array
Issue the DAX query using the RSCustomDAX Filter. Figure 6 - Example of using the RSCustomDAXFilter function in DAX.
The results of the DAX query are returned to the Paginated Report. Here those results are used for building charts, setting up tables… you know, Paginated Report tasks.

Give it a Try

With this workaround, my teams have been able to avoid the URL length limit while maintaining support for the multi-value filters. The exports to PowerPoint and Excel are saved, hurray!

I have shared a template at this link if you’re interested in trying it out.

This workaround also works nicely with the Paginated Report Power BI Visual (see Figure 7) since it’s also difficult to send an array of values in the Field Well. Instead, you can create a comma-delimited list of values using a quick measure and pass that into the visual.

Figure 7 - Example of this solution with a Paginated Report visual in Power BI Desktop

As always, let me know your thoughts on LinkedIn or Twitter/X on the approach and ways to improve it.

John Kerski’s Blog

Making Your Power BI Teams More Analytic - PBI Inspector

Making Your Power BI Teams More Analytics – Static Analysis for Reports

Key Azure Terms

Choose Your Adventure 🗺️

The Git Away

Pros

Cons

Prerequisites

Steps:

1. Create Repository

2. Create Personal Access Token

3. Get the PBIP-CI Pipeline ID

4. Set up the Variable Group

5. Import Repository

6. Setup Pipeline

The SharePoint Saga

Pros:

Cons:

Prerequisites:

Steps:

1. Setup SharePoint for Service Principal Usage

2. Set up the Variable Group.

3. Import Repository

4. Setup Pipeline

The Fabric Fantasy

Pros:

Cons:

Prerequisites:

Steps:

1. Capture Lakehouse Variables

2. Set up the Variable Group.

3. Import Repository

4. Setup Pipeline

Conclusion

The complexity of using Power BI in US Sovereign Clouds

The complexity of using Power BI in US Sovereign Clouds

Making Your Power BI Teams More Analytic - Tracking Changes

Making Your Power BI Teams More Analytic - Tracking Changes

Why is this important?

How do we track changes?

Implementation

Fetch Changes from Upstream

Setup the Pipeline

Test the Pipeline

Next Time

Making Your Power BI Teams More Analytic - Foundation

Making Your Power BI Teams More Analytic - The Foundation

The Setup

Azure DevOps Setup

Clone Repository

Saving Into PBIP Format

Saving the Changes and Synchronization

Committing and Syncing Changes

End of Instructions

Making Your Power BI Teams More Analytic - Introduction

Making Your Power BI Teams More Analytic - Introduction

Hiring and Motivating Technical Polyglots in Data Analytics

Hiring and Motivating Technical Polyglots in Data Analytics

Testing for Broken Visuals with Azure DevOps - It’s a Team Sport

Automated Testing for Broken Visuals - It’s a Team Sport

Implementing the Updates

Sharing with the Team

Testing for Broken Visuals with Azure DevOps

Automated Testing for Broken Visuals with Microsoft Playwright and Azure DevOps

Overview

Monitoring

Sharing with the Audience

Automated Testing for Broken Visuals with Microsoft Playwright

Automated Testing for Broken Visuals with Microsoft Playwright

Microsoft Playwright enters the room

Generating Test Cases

Playwright takes center stage

Sharing with the Audience

Multiple Value Parameters in Paginated Reports

Multiple Value Parameters in Paginated Reports

The Problem

Solution

DAX Option

Code Option