Model deployment

Goal

In the following tutorials you will:
  • Prepare a pre-trained model and data for the deployment process

  • Set up Vitis AI deployment container environment

  • Deploy a pre-trained PyTorch land cover segmentation model to a Vitis AI-compatible format by:
    • quantizing the model to work with efficient numeric representation

    • compiling the model to the format compatible with the given FPGA-based inference accelerator

A bit of background

A model trained using PyTorch or TensorFlow isn’t suitable for running on a DPU unit out-of-the-box. It needs to undergo conversion process that will enable efficient inference on the target DPU platform. The deployment process contains two steps: quantization and compilation. Quantization converts model to work in efficient data representation. Compilation translates the model architecture into FPGA-based accelerator compatible format.

The deployment process differs a bit depending on the framework used for model training (PyTorch/Tensorflow). This tutorials covers deployment of a PyTorch-based land cover segmentation model for demonstrative purposes.

Model deployment steps

Proceed to the Model quantization and Model compilation tutorials to learn how to deploy a deep learning model to a DPU.