Model deployment¶
Goal¶
- In the following tutorials you will:
Prepare a pre-trained model and data for the deployment process
Set up Vitis AI deployment container environment
- Deploy a pre-trained PyTorch land cover segmentation model to a Vitis AI-compatible format by:
quantizing the model to work with efficient numeric representation
compiling the model to the format compatible with the given FPGA-based inference accelerator
A bit of background¶
A model trained using PyTorch or TensorFlow isn’t suitable for running on a DPU unit out-of-the-box. It needs to undergo conversion process that will enable efficient inference on the target DPU platform. The deployment process contains two steps: quantization and compilation. Quantization converts model to work in efficient data representation. Compilation translates the model architecture into FPGA-based accelerator compatible format.
The deployment process differs a bit depending on the framework used for model training (PyTorch/Tensorflow). This tutorials covers deployment of a PyTorch-based land cover segmentation model for demonstrative purposes.
Model deployment steps¶
Proceed to the Model quantization and Model compilation tutorials to learn how to deploy a deep learning model to a DPU.