AWS & SageMaker
Managed cloud endpoints. Best for production pipelines and scale.
On-premise
Self-hosted container. Best for data residency and air-gapped sites.
Hugging Face
Direct model weights for non-commercial academic research (H-Optimus only).
Not sure which model? See Choosing a model. H-Optimus-1 extracts histology features; M-Optimus-1 adds spatial gene-expression prediction (and also produces embeddings).
Runnable notebooks
Prefer to start from a notebook?
h1-jumpstart (H-Optimus) and m-jumpstart (M-Optimus) are end-to-end, runnable examples.Pick your platform
- AWS & SageMaker
- On-premise
- Hugging Face (academic)
The fastest path to a managed endpoint. Full steps in AWS & SageMaker.
Subscribe on AWS Marketplace
Subscribe to H-Optimus-1 or M-Optimus-1 to obtain the Model Package ARN.
Deploy a real-time endpoint
We recommend deploying a real-time endpoint on
ml.g5.xlarge — the models are compiled for that GPU architecture (NVIDIA A10G, CUDA Compute Capability 8.6). See AWS & SageMaker for the full deploy script.Run inference with the SDK
The SDK routes through the SageMaker For whole-slide inference, see the SDK guide.
/invocations endpoint automatically.Next steps
SDK guide
Whole-slide inference: tiling, tissue masking, bulk RNA, and output formats.
Choosing a model
H-Optimus vs. M-Optimus, side by side.

