Architecture overview

Architecture components

Coral NPU is a complete reference design for neural processing units (NPUs) based on the open RISC-V standard. Coral NPU is a composition consisting of three core components:

  • A scalar core for traditional CPU functions
  • A vector execution unit for additional computational features
  • A matrix execution unit for ML model operations

Kelvin architecture components

Together, these three components provide the complete feature set and performance required for a Coral NPU-based NPU. Any chip vendor designing a discrete NPU or integrating Coral NPU into a System-on-Chip (SoC) will need to incorporate all three components for a complete solution.

The main features and functions offered by each of the components are the following:

  • Scalar core:

    • Serves as the in-order, non-speculative frontend processor.
    • Drives the command queues for the vector and matrix execution units.
    • Fully compliant with the open RISC-V 32-bit base ISA standard (RV32I).
    • Features 31 general-purpose scalar registers, each 32 bits wide.
    • Offers a C-programmable interface for managing loops, control flow, flexible type encodings, and instruction compression for the SIMD/vector backend.
  • Vector execution unit:

    • Performs a wide range of vector and machine learning (ML) computations, including array operations, ML activation functions, and reductions.
    • Based on a Single Instruction, Multiple Data (SIMD) design.
    • Decoupled from the scalar frontend by a FIFO command queue, which buffers vector instructions.
    • Equipped with 32 vector registers, each 256 bits wide (capable of holding eight 32-bit integers).
    • Natively supports 8-bit, 16-bit, and 32-bit data widths.
  • Matrix execution unit:

    • Accelerates matrix multiply-accumulate (Matmul) operations essential for ML, such as matrix multiplication and convolutions.
    • Features an outer-product engine capable of 256 multiply-accumulate (MAC) operations per cycle.
    • Status: Currently under development and evaluation as part of the RISC-V matrix extension task group

Refer to the Architecture Basics page for more details of each of these features.

Platform roadmap

Coral NPU's goal is to provide the industry with an NPU reference implementation, based on RISC-V standards, that is easy to adopt and integrate. This approach provides SoC vendors the flexibility to choose their own customized component, such as custom matrix execution unit, alongside Coral NPU’s validated IP.

The initial Coral NPU release includes our RISC-V compliant scalar core and vector execution unit. A future release will introduce our RISC-V matrix execution unit, which will provide a complete, end-to-end, open-source stack for a fully RISC-V compliant NPU. Below is a high-level roadmap for planned features and enhancements to the platform:

Kelvin platform roadmap

  • Milestone 1 - Full RISC-V compliant scalar core
  • Milestone 2 - Full RISC-V compliant vector execution unit
  • Milestone 3 - Floating point and expanded VLEN for LLM support
  • Milestone 4 - Full RISC-V compliant matrix execution unit
  • Horizon - CHERI (Capability Hardware Enhanced RISC Instructions) and 64-bit support