NKIPy Documentation

NKIPy Documentation#

Welcome to the NKIPy documentation! NKIPy provides a NumPy-like tensor-level programming layer on top of NKI (Neuron Kernel Interface) for AWS Trainium.

For an overview of the project, installation instructions, and basic usage examples, see the README.

Key Features#

  • NumPy-like API: Write kernels using familiar NumPy syntax with Python control flow

  • HLO Lowering: Tensor operations are traced and lowered to HLO for compilation

  • Neuron Compiler Integration: Direct integration with neuronx-cc for generating NKI or executables

Runtime#

For kernel execution, NKIPy uses Spike, a lightweight Pythonic runtime layer for AWS Neuron. See the Spike README for details on the runtime architecture and API.