Install ONNX Runtime (ORT)

Install ONNX Runtime CPU

pip install onnxruntime

Install ONNX Runtime GPU (CUDA 11.x)

The default CUDA version for ORT is 11.8.

pip install onnxruntime-gpu

Install ONNX Runtime GPU (CUDA 12.x)

For Cuda 12.x, please use the following instructions to install from ORT Azure Devops Feed

          pip install onnxruntime-gpu --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

         

Install ONNX to export the model

          ## ONNX is built into PyTorch
pip install torch

          ## tensorflow
pip install tf2onnx

          ## sklearn
pip install skl2onnx

C#/C/C++/WinML Installs

Install ONNX Runtime (ORT)

Install ONNX Runtime CPU

          # CPU
dotnet add package Microsoft.ML.OnnxRuntime

Install ONNX Runtime GPU (CUDA 11.x)

The default CUDA version for ORT is 11.8

          # GPU
dotnet add package Microsoft.ML.OnnxRuntime.Gpu

Install ONNX Runtime GPU (CUDA 12.x)

Project Setup

Ensure you have installed the latest version of the Azure Artifacts keyring from the its Github Repo .
Add a nuget.config file to your project in the same directory as your .csproj file.

          <?xml version="1.0" encoding="utf-8"?>
<configuration>
    <packageSources>
        <clear/>
        <add key="onnxruntime-cuda-12"
             value="https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/nuget/v3/index.json"/>
    </packageSources>
</configuration>

         

Restore packages

Restore packages (using the interactive flag, which allows dotnet to prompt you for credentials)

dotnet add package Microsoft.ML.OnnxRuntime.Gpu

Note: You don’t need –interactive every time. dotnet will prompt you to add –interactive if it needs updated credentials.

DirectML

dotnet add package Microsoft.ML.OnnxRuntime.DirectML

WinML

dotnet add package Microsoft.AI.MachineLearning

Install on web and mobile

Unless stated otherwise, the installation instructions in this section refer to pre-built packages that include support for selected operators and ONNX opset versions based on the requirements of popular models. These packages may be referred to as “mobile packages”. If you use mobile packages, your model must only use the supported opsets and operators .

Another type of pre-built package has full support for all ONNX opsets and operators, at the cost of larger binary size. These packages are referred to as “full packages”.

If the pre-built mobile package supports your model/s but is too large, you can create a custom build . A custom build can include just the opsets and operators in your model/s to reduce the size.

If the pre-built mobile package does not include the opsets or operators in your model/s, you can either use the full package if available, or create a custom build.

JavaScript Installs

Install ONNX Runtime Web (browsers)

          # install latest release version
npm install onnxruntime-web
# install nightly build dev version
npm install onnxruntime-web@dev

         

Install ONNX Runtime Node.js binding (Node.js)

          # install latest release version
npm install onnxruntime-node

Install ONNX Runtime for React Native

          # install latest release version
npm install onnxruntime-react-native

Install on iOS

In your CocoaPods Podfile , add the onnxruntime-c , onnxruntime-mobile-c , onnxruntime-objc , or onnxruntime-mobile-objc pod, depending on whether you want to use a full or mobile package and which API you want to use.

C/C++

            use_frameworks!
  # choose one of the two below:
  pod 'onnxruntime-c'  # full package
  #pod 'onnxruntime-mobile-c'  # mobile package

         

Objective-C

            use_frameworks!
  # choose one of the two below:
  pod 'onnxruntime-objc'  # full package
  #pod 'onnxruntime-mobile-objc'  # mobile package

         

Run pod install .

Custom build

Refer to the instructions for creating a custom iOS package .

Install on Android

Java/Kotlin

In your Android Studio Project, make the following changes to:

build.gradle (Project):
```
 repositories {
     mavenCentral()
```

build.gradle (Module):

             dependencies {
     // choose one of the two below:
     implementation 'com.microsoft.onnxruntime:onnxruntime-android:latest.release'  // full package
     //implementation 'com.microsoft.onnxruntime:onnxruntime-mobile:latest.release'  // mobile package

           

C/C++

Download the onnxruntime-android ( full package) or onnxruntime-mobile ( mobile package) AAR hosted at MavenCentral, change the file extension from .aar to .zip , and unzip it. Include the header files from the headers folder, and the relevant libonnxruntime.so dynamic library from the jni folder in your NDK project.

Custom build

Refer to the instructions for creating a custom Android package .

Install for On-Device Training

Unless stated otherwise, the installation instructions in this section refer to pre-built packages designed to perform on-device training.

If the pre-built training package supports your model but is too large, you can create a custom training build .

Offline Phase - Prepare for Training

          python -m pip install cerberus flatbuffers h5py numpy>=1.16.6 onnx packaging protobuf sympy setuptools>=41.4.0
pip install -i https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT/pypi/simple/ onnxruntime-training-cpu

Training Phase - On-Device Training

Device	Language	PackageName	Installation Instructions
Windows	C, C++, C#	Microsoft.ML.OnnxRuntime.Training	dotnet add package Microsoft.ML.OnnxRuntime.Training
Linux	C, C++	onnxruntime-training-linux*.tgz	Download the `*.tgz` file from here . Extract it. Move and include the header files in the `include` directory. Move the `libonnxruntime.so` dynamic library to a desired path and include it.
	Python	onnxruntime-training	pip install onnxruntime-training
Android	C, C++	onnxruntime-training-android	Download the onnxruntime-training-android (full package) AAR hosted at Maven Central. Change the file extension from `.aar` to `.zip` , and unzip it. Include the header files from the `headers` folder. Include the relevant `libonnxruntime.so` dynamic library from the `jni` folder in your NDK project.
	Java/Kotlin	onnxruntime-training-android	In your Android Studio Project, make the following changes to: build.gradle (Project): repositories { mavenCentral() build.gradle (Module): dependencies { implementation 'com.microsoft.onnxruntime:onnxruntime-training-android:latest.release'
iOS	C, C++	CocoaPods: onnxruntime-training-c	In your CocoaPods `Podfile` , add the `onnxruntime-training-c` pod: use_frameworks! pod 'onnxruntime-training-c'

Install ONNX Runtime (ORT)

Contents

Requirements

CUDA and CuDNN

Python Installs

Install ONNX Runtime (ORT)

Install ONNX Runtime CPU

Install ONNX Runtime GPU (CUDA 11.x)

Install ONNX Runtime GPU (CUDA 12.x)

Install ONNX to export the model

C#/C/C++/WinML Installs

Install ONNX Runtime (ORT)

Install ONNX Runtime CPU

Install ONNX Runtime GPU (CUDA 11.x)

Install ONNX Runtime GPU (CUDA 12.x)

DirectML

WinML

Install on web and mobile

JavaScript Installs

Install ONNX Runtime Web (browsers)

Install ONNX Runtime Node.js binding (Node.js)

Install ONNX Runtime for React Native

Install on iOS

C/C++

Objective-C

Custom build

Install on Android

Java/Kotlin

C/C++

Custom build

Install for On-Device Training

Offline Phase - Prepare for Training

Training Phase - On-Device Training