Tutorial: Using a Pre-Trained ONNX Model for Inferencing

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

Setting up the Environment

To complete this tutorial, you need Python 3.x running on your machine. We will start by creating a Python3 virtual environment to isolate it from the main Python environment on the machine.
python3 -m venv onnx_mnist source onnx_mnist/bin/activate The above command results in a new directory called mnist that has the model and the test data serialized into ProtoBuf files. We are not going to use the test data for the tutorial.

We can now examine the model through the Netron tool by opening the model.onnx file.

The MNIST model from the ONNX Model Zoo uses maxpooling to update the weights in its convolutions as shown in the graph from Netron.

The model has two convolutional layers, two maxpool layers, one dense layer, and an output layer that can classify one of the 10 values representing the labels used in the MNIST dataset.

Writing Inference Code for Prediction

We will now write code for performing inference on the pre-trained MNIST model.

Let’s start by importing the right Python modules.
import json import sys import os import time import numpy as np import cv2 import onnx import onnxruntime from onnx import numpy_helper Notice that we are using ONNX, ONNX Runtime, and the NumPy helper modules related to ONNX.

The ONNX module helps in parsing the model file while the ONNX Runtime module is responsible for creating a session and performing inference.

Next, we will initialize some variables to hold the path of the model files and command-line arguments.
model_dir ="./mnist" model=model_dir+"/model.onnx" path=sys.argv[1] #Preprocess the image img = cv2.imread(path) img = np.dot(img[...,:3], [0.299, 0.587, 0.114]) img = cv2.resize(img, dsize=(28, 28), interpolation=cv2.INTER_AREA) img.resize((1, 1, 28, 28)) The above code snippet is responsible for converting the image to grayscale and resizing it to 28X28 array. This array will be used as an input to the model.

We will now convert the image into a NumPy array of type float32.
data = json.dumps({'data': img.tolist()}) data = np.array(json.loads(data)['data']).astype('float32') data = json.dumps({'data': img.tolist()}) data = np.array(json.loads(data)['data']).astype('float32') session = onnxruntime.InferenceSession(model, None) input_name = session.get_inputs()[0].name output_name = session.get_outputs()[0].name print(input_name) print(output_name)

data = json . dumps ({ 'data' : img . tolist ()})

data = np . array ( json . loads ( data )[ 'data' ]). astype ( 'float32' )

session = onnxruntime . InferenceSession ( model , None )

input_name = session . get_inputs ()[ 0 ]. name

output_name = session . get_outputs ()[ 0 ]. name

print ( input_name )

print ( output_name )

We need to use the same name as the input layer and the output layer of the neural network. You can easily retrieve them from the session.getinputs() and session.getoutputs() methods. The output from the above snippet matches the input and output node names shown by Netron.

Let’s pass the input to the session and print the prediction.
result = session.run([output_name], {input_name: data}) prediction=int(np.argmax(np.array(result).squeeze(), axis=0)) print(prediction) We apply the argmax function of NumPy to retrieve the value with the highest probability.

Try running the code by passing an image of a handwritten number. It predicts that with good probability.

Here is the complete code for your reference:
import json import sys import os import time import numpy as np import cv2 import onnx import onnxruntime from onnx import numpy_helper model_dir ="./mnist" model=model_dir+"/model.onnx" path=sys.argv[1] #Preprocess the image img = cv2.imread(path) img = np.dot(img[...,:3], [0.299, 0.587, 0.114]) img = cv2.resize(img, dsize=(28, 28), interpolation=cv2.INTER_AREA) img.resize((1, 1, 28, 28)) data = json.dumps({'data': img.tolist()}) data = np.array(json.loads(data)['data']).astype('float32') session = onnxruntime.InferenceSession(model, None) input_name = session.get_inputs()[0].name output_name = session.get_outputs()[0].name #print(input_name) #print(output_name) result = session.run([output_name], {input_name: data}) prediction=int(np.argmax(np.array(result).squeeze(), axis=0)) print(prediction)

import json

import sys

import os

import time

import numpy as np

import cv2

import onnx

import onnxruntime

from onnx import numpy_helper

model_dir = "./mnist"

model = model_dir + "/model.onnx"

path = sys . argv [ 1 ]

#Preprocess the image

img = cv2 . imread ( path )

img = np . dot ( img [...,: 3 ], [ 0.299 , 0.587 , 0.114 ])

img = cv2 . resize ( img , dsize =( 28 , 28 ), interpolation = cv2 . INTER_AREA )

img . resize (( 1 , 1 , 28 , 28 ))

data = json . dumps ({ 'data' : img . tolist ()})

data = np . array ( json . loads ( data )[ 'data' ]). astype ( 'float32' )

session = onnxruntime . InferenceSession ( model , None )

input_name = session . get_inputs ()[ 0 ]. name

output_name = session . get_outputs ()[ 0 ]. name

#print(input_name)

#print(output_name)

result = session . run ([ output_name ], { input_name : data })

prediction = int ( np . argmax ( np . array ( result ). squeeze (), axis = 0 ))

print ( prediction )

In the next part of this tutorial, we will learn how to export a PyTorch model and converting that into a TensorFlow saved model file. Stay tuned.

Janakiram MSV’s Webinar series, “Machine Intelligence and Modern Infrastructure (MI2)” offers informative and insightful sessions covering cutting-edge technologies. Sign up for the upcoming MI2 webinar at http://mi2.live .

Group Created with Sketch. Why Data Science Teams Should Be Using Pair Programming For Games about Civics, US Library of Congress Promises Prizes A Brief DevOps History: Databases to Infinity and Beyond Going Beyond with Advanced Infrastructure as Code Use Cases Three Horizons of Your API Journey Intel’s Generational On-Chip Change APX Will Make All the Apps Faster All about OSC&R, a Software Supply Chain Security Framework How Vector Search Can Optimize Retail Trucking Routes Stack Overflow Adds AI: Will the Community Respond? Broken Promises of the Low-Code Approach The New stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy .