Deploy model packages to online endpoints (preview)

Article
2024-08-14

Model package is a capability in Azure Machine Learning that allows you to collect all the dependencies required to deploy a machine learning model to a serving platform. Creating packages before deploying models provides robust and reliable deployment and a more efficient MLOps workflow. Packages can be moved across workspaces and even outside of Azure Machine Learning. Learn more about Model packages (preview)

Important

This feature is currently in public preview. This preview version is provided without a service-level agreement, and we don't recommend it for production workloads. Certain features might not be supported or might have constrained capabilities.

For more information, see Supplemental Terms of Use for Azure Previews.

In this article, you learn how to package a model and deploy it to an online endpoint in Azure Machine Learning.

Prerequisites

Before following the steps in this article, make sure you have the following prerequisites:

An Azure subscription. If you don't have an Azure subscription, create a Trial before you begin. Try the Azure Machine Learning.
An Azure Machine Learning workspace. If you don't have one, use the steps in the How to manage workspacesarticle to create one.
Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the owner or contributor role for the Azure Machine Learning workspace, or a custom role. For more information, see Manage access to an Azure Machine Learning workspace.

About this example

In this example, you package a model of type custom and deploy it to an online endpoint for online inference.

The example in this article is based on code samples contained in the azureml-examples repository. To run the commands locally without having to copy/paste YAML and other files, first clone the repo and then change directories to the folder:

Azure CLI
Python

git clone https://github.com/Azure/azureml-examples --depth 1
cd azureml-examples/cli

!git clone https://github.com/Azure/azureml-examples --depth 1
!cd azureml-examples/sdk/python

This section uses the example in the folder endpoints/online/deploy-packages/custom-model.

Connect to your workspace

Connect to the Azure Machine Learning workspace where you'll do your work.

Azure CLI
Python

az account set --subscription <subscription>
az configure --defaults workspace=<workspace> group=<resource-group> location=<location>

The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. In this section, you connect to the workspace in which you perform deployment tasks.

Import the required libraries:

from azure.ai.ml import MLClient, Input
from azure.ai.ml.entities import ManagedOnlineEndpoint, ManagedOnlineDeployment, Model
from azure.ai.ml.constants import AssetTypes
from azure.identity import DefaultAzureCredential

If you're running in a compute instance in Azure Machine Learning, create an MLClient as follows:

ml_client = MLClient.from_config(DefaultAzureCredential())

Otherwise, configure your workspace details and get a handle to the workspace:

subscription_id = "<subscription>"
resource_group = "<resource-group>"
workspace = "<workspace>"

ml_client = MLClient(DefaultAzureCredential(), subscription_id, resource_group, workspace)

Package the model

You can create model packages explicitly to allow you to control how the packaging operation is done. You can create model packages by specifying the:

Model to package: Each model package can contain only a single model. Azure Machine Learning doesn't support packaging of multiple models under the same model package.
Base environment: Environments are used to indicate the base image, and in Python packages dependencies your model need. For MLflow models, Azure Machine Learning automatically generates the base environment. For custom models, you need to specify it.
Serving technology: The inferencing stack used to run the model.

Tip

If your model is MLflow, you don't need to create the model package manually. We can automatically package before deployment. See Deploy MLflow models to online endpoints.

Model packages require the model to be registered in either your workspace or in an Azure Machine Learning registry. In this example, you already have a local copy of the model in the repository, so you only need to publish the model to the registry in the workspace. You can skip this section if the model you're trying to deploy is already registered.
- Azure CLI
- Python
```
 MODEL_NAME='sklearn-regression'
 MODEL_PATH='model'
 az ml model create --name $MODEL_NAME --path $MODEL_PATH --type custom_model
```
```
 model_name = "sklearn-regression"
 model_path = "model"

 model = ml_client.models.create_or_update(Model(name=model_name, path=model_path))
```
Our model requires the following packages to run and we have them specified in a conda file:

conda.yaml
```
 name: model-env
 channels:
  - conda-forge
 dependencies:
  - python=3.9
  - numpy=1.23.5
  - pip=23.0.1
  - scikit-learn=1.2.2
  - scipy=1.10.1
  - xgboost==1.3.3
```
Note

Notice how only model's requirements are indicated in the conda YAML. Any package required for the inferencing server will be included by the package operation.

Tip

If your model requires packages hosted in private feeds, you can configure your package to include them. Read Package a model that has dependencies in private Python feeds.

Create a base environment that contains the model requirements and a base image. Only dependencies required by your model are indicated in the base environment. For MLflow models, base environment is optional in which case Azure Machine Learning autogenerates it for you.

Azure CLI
Python

Create a base environment definition:

sklearn-regression-env.yml

 $schema: https://azuremlschemas.azureedge.net/latest/environment.schema.json
 name: sklearn-regression-env
 image: mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu22.04
 conda_file: conda.yaml
 description: An environment for models built with XGBoost and Scikit-learn.

Then create the environment as follows:

 az ml environment create -f environment/sklearn-regression-env.yml

 base_environment = ml_client.environments.create_or_update(
	Environment(
    	name=f"{model_name}-env",
    	image="mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu22.04",
    	conda_file="environment/conda.yml",
	)
 )

Create a package specification:

Azure CLI
Python

package-moe.yml

 $schema: http://azureml/sdk-2-0/ModelVersionPackage.json
 base_environment_source:
	type: environment_asset
	resource_id: azureml:sklearn-regression-env@latest
 target_environment_name: sklearn-regression-online-pkg
 inferencing_server: 
	type: azureml_online
	code_configuration:
  	  code: src
  	  entry_script: score.py

To create a model package, create a package specification as follows:

 pakage_config = ModelPackage(
	target_environment_name="sklearn-regression-online-pkg",
	base_environment_source=BaseEnvironment(
    	type="asset",
    	resource_id=f"azureml:{base_environment.name}:{base_environment.version}",
	),
	inferencing_server=AzureMLOnlineInferencingServer(
    	code_configuration=CodeConfiguration(code="src", scoring_script="score.py")
	),
 )

Start the model package operation:

Azure CLI
Python

 az ml model package -n $MODEL_NAME -l latest --file package-moe.yml

 model_package = ml_client.models.package(model_name, model.version, pakage_config)

The result of the package operation is an environment.

Deploy the model package

Model packages can be deployed directly to online endpoints in Azure Machine Learning. Follow these steps to deploy a package to an online endpoint:

Pick a name for an endpoint to host the deployment of the package and create it:

Azure CLI
Python

 ENDPOINT_NAME = "sklearn-regression-online"

 az ml online-endpoint create -n $ENDPOINT_NAME

 endpoint = ManagedOnlineEndpoint(name=endpoint_name)
 endpoint = ml_client.online_endpoints.begin_create_or_update(endpoint).result()

Create the deployment, using the package. Notice how environment is configured with the package you've created.

Azure CLI
Python

deployment.yml

 $schema: https://azuremlschemas.azureedge.net/latest/managedOnlineDeployment.schema.json
 name: with-package
 endpoint_name: hello-packages
 environment: azureml:sklearn-regression-online-pkg@latest
 instance_type: Standard_DS3_v2
 instance_count: 1

 deployment_package = ManagedOnlineDeployment(
	name="with-package",
	endpoint_name=endpoint_name,
	environment=model_package,
	instance_count=1,
 )

Tip

Notice you don't specify the model or scoring script in this example; they're all part of the package.

Start the deployment:

Azure CLI
Python

 az ml online-deployment create -f deployment.yml

 ml_client.online_deployments.begin_create_or_update(deployment_package).result()

At this point, the deployment is ready to be consumed. You can test how it's working by creating a sample request file:

sample-request.json
```
 {
	"data": [
    	[1,2,3,4,5,6,7,8,9,10], 
    	[10,9,8,7,6,5,4,3,2,1]
	]
 }
```

Send the request to the endpoint

Azure CLI
Python

 az ml online-endpoint invoke -n $ENDPOINT_NAME -d with-package -f sample-request.json

 ml_client.online_endpoints.invoke(
	endpoint_name=endpoint_name,
	deployment_name="with-package",
	request_file="sample-request.json",
 )

Next step

Package and deploy a model to App Service