PA - System Models Configuration

PA - System Models Configuration

Introduction

This documentation describes how to go about deploying and configuring PA System Models. Unlike PA Custom Models that require manual configuration and maintenance, PA System Models are recommended and automatically maintained built-in models. An admin only needs to provide the LLM’s API token and API URL address.

All PA System Models are Azure OpenAI LLM configurations that are GDPR compliant if applying suggested default values.

These are the LLMs currently supported by the PA System Models:

  • gpt-4o-mini

  • gpt-4o

  • dall-e-3

In addition, the text embedding model text-embedding-ada-002 is also supported.

Prerequisites

The following prerequisites must be fullfilled before starting on configuring the PA models:

Configuration

First-time Deployment

Obtain the Azure ARM Template

Acquire the template from the application by clicking the ARM Template link shown red circled below.

For on-premises Saga 7 and later deployments, the ARM Template can also be accessed using the URL provided below. No special role or even user access to the search application is necessary. The <SEARCH PAGE FQDN> is in Saga documentation known as the Gateway Hostname.

https://<SEARCH PAGE FQDN>/api/chat/models/system-models-template

image-20250121-135439.png

Deploy models

Use the Deploy from Custom Template feature to deploy the Azure OpenAI models. It provides the preferred configuration parameters, such as the OpenAI model quota and the model deployment type. It is recommended to use Sweden Central as the deployment region due to the model's capacity and availability.

Deploy ARM template

  1. Select Build your own template in the editor

    image-20250214-102636.png
  2. Replace the selected text with ARM template definition and press Save

    image-20250214-102838.png
  3. Fill in the form

    image-20250206-102941.png
  4. If the deployment fails with error details listing the InsufficientQuota code, you have less available quota for the model(s) specified in the error than expected by the ARM template. In this case, please follow the instructions found in PA - How to Request Azure OpenAI Quotas

    obraz-20250408-113849.png

Obtain the API Token and the API URL

Once the deployment is completed, navigate to the selected resource group and the newly created Azure OpenAI resource.
Under Resource Management / Keys and Endpoint one will find the API token and the API URL

image-20250120-152529.png

Insert System Models

Provide the API token and the API URL and press Save. The application is now configured with recommended models and can be tested.

image-20250121-142746.png

 

Update Existing Deployment

Detection of New System Model Configuration

Once the system detects there is a new version of the recommended system model configuration, the Update system models option becomes available.

image-20250121-143345.png

Redeploying Models

The system will detect if the new recommended configuration requires a redeployment of the Azure OpenAI models. In that case, the red circled message seen below will be shown:

Untitled-20250409-145210.png

 

  1. Remove existing models

    1. Navigate to Azure AI services | Azure Open AI and identify the Azure Open AI resource.

    2. Delete the resource - that will only result in a soft delete.

    3. Select Managed deleted resources and purge the deleted resource.

      image-20250121-144455.png
  2. Follow the instructions in “Deploy Models” to deploy new models

  3. Follow the instructions in “Obtain API token, API URL and update the configuration” to finalize the model configuration.

Models Restoration

The application also provides the option to restore the default system model configuration. To do so, click the Restore system model option and provide the API token and the API URL in the pop up window before clicking Save.

image-20250121-145321.png