Vollo RT Python Example
The Vollo RT Python bindings are provided for convenience, the runtime performance of this API is not a priority.
Here is a minimal way to use the Vollo RT Python bindings:
import vollo_rt
import torch
import os
with vollo_rt.VolloRTContext() as ctx:
ctx.add_accelerator(0)
if ctx.accelerator_block_size(0) == 32:
ctx.load_program(f"{os.environ["VOLLO_SDK"]}/example/identity_b32.vollo")
else:
ctx.load_program(f"{os.environ["VOLLO_SDK"]}/example/identity_b64.vollo")
input = torch.rand(*ctx.model_input_shape()).bfloat16()
output = ctx.run(input)
torch.testing.assert_close(input, output)
print("Success!")