ONNX Runtime (ORT)
Install ONNX Runtime
Get Started
Tutorials
Build ONNX Runtime
API Docs
Reference
Execution Providers
ONNX Runtime Ecosystem
Performance

This site uses Just the Docs, a documentation theme for Jekyll.

ONNX Runtime
Install
Get Started
Tutorials
API Docs
YouTube
GitHub

Tutorials
Accelerate PyTorch
Accelerate PyTorch Inference

Accelerate PyTorch model inferencing

ONNX Runtime can be used to accelerate PyTorch models inferencing.

Convert model to ONNX

Basic PyTorch export through torch.onnx
Super-resolution with ONNX Runtime
Export PyTorch model with custom ops

Accelerate PyTorch model inferencing

BERT

Accelerate BERT model on CPU
Accelerate BERT model on GPU
Accelerate reduced size BERT model through quantization

GPT-2

Accelerate GPT2 on CPU
Accelerate GPT2 (with one step search) on CPU

For documentation questions, please file an issue

Edit this page on GitHub