Skip to main content
Bioptimus models run on three platforms. Choose the path that fits your use case — each tab below has the steps and links you need.

AWS & SageMaker

Managed cloud endpoints. Best for production pipelines and scale.

On-premise

Self-hosted container. Best for data residency and air-gapped sites.

Hugging Face

Direct model weights for non-commercial academic research (H-Optimus only).
Not sure which model? See Choosing a model. H-Optimus-1 extracts histology features; M-Optimus-1 adds spatial gene-expression prediction (and also produces embeddings).

Runnable notebooks

Prefer to start from a notebook? h1-jumpstart (H-Optimus) and m-jumpstart (M-Optimus) are end-to-end, runnable examples.

Pick your platform

The fastest path to a managed endpoint. Full steps in AWS & SageMaker.
1

Subscribe on AWS Marketplace

Subscribe to H-Optimus-1 or M-Optimus-1 to obtain the Model Package ARN.
2

Deploy a real-time endpoint

We recommend deploying a real-time endpoint on ml.g5.xlarge — the models are compiled for that GPU architecture (NVIDIA A10G, CUDA Compute Capability 8.6). See AWS & SageMaker for the full deploy script.
3

Run inference with the SDK

The SDK routes through the SageMaker /invocations endpoint automatically.
from bioptimus.models.backbones import Backbone
from bioptimus.models.types import Models

model = Backbone(Models.H1, backend="aws",
                 endpoint_name="h-optimus", region_name="us-east-1")
For whole-slide inference, see the SDK guide.

Next steps

SDK guide

Whole-slide inference: tiling, tissue masking, bulk RNA, and output formats.

Choosing a model

H-Optimus vs. M-Optimus, side by side.