Vollo RT Python Example

The Vollo RT Python bindings are provided for convenience, the runtime performance of this API is not a priority.

Here is a minimal way to use the Vollo RT Python bindings:

import vollo_rt
import torch
import os

with vollo_rt.VolloRTContext() as ctx:
  ctx.add_accelerator(0)

  if ctx.accelerator_block_size(0) == 32:
	  ctx.load_program(f"{os.environ["VOLLO_SDK"]}/example/identity_b32.vollo")
  else:
	  ctx.load_program(f"{os.environ["VOLLO_SDK"]}/example/identity_b64.vollo")

  input = torch.rand(*ctx.model_input_shape()).bfloat16()
  output = ctx.run(input)

  torch.testing.assert_close(input, output)
  print("Success!")