Magyar English

Four Cores, One Chip

The CFPU core family: Nano, Actor, Rich, Seal
April 18, 2026 · Hocza József Szabolcs
Seal Core — the security core at the center of the CFPU chip

Dual specialization

The CFPU is not a single core multiplied by thousands. Different workloads demand different processor geometries — which is why the CFPU core family branches in two directions from a single common base: the Nano Core.

The first branch grows toward general-purpose programming: Nano → Actor → Rich. Each step adds more SRAM and a richer ISA — Actor is optimized for object handling and GC, Rich for floating-point computation.

The second branch diverges toward security: Nano → Seal. The Seal Core contains cryptographic primitives (SHA-256, WOTS+, Merkle), code authentication, and eFuse management — not a compute core, but a security gatekeeper.

           Nano Core (CIL-T0, base)
              /              \
     Actor (+GC+Obj)     Seal (+Crypto)
        |
     Rich (+FPU)

This dual specialization is unique to the CFPU: no other processor family offers both directions — programming and security — on the same interconnect, with the same message format, parameterized from the same RTL.

ML/SNN inference is handled not by programmable cores but by MAC Slices — FSM-driven compute units even simpler than Nano. These are specified in the CFPU-ML-Max document.

The four core types

Nano Core — spike / sensor

The smallest and most numerous core. Integer arithmetic only, 4–64 KB SRAM, minimal ISA (48 CIL-T0 opcodes), mailbox-based messaging. Designed for sensors, edge processing, and SNN spike propagation — where precision is less important than speed and energy efficiency.

Actor Core — general actor

The CFPU's primary workhorse. Full CIL ISA with object support, GC, generics, and exception handling, 64–256 KB SRAM, mailbox-based messaging. Symphact actors, server logic, general-purpose parallel processing. In most CFPU configurations, this is the dominant core type. In the CFPU-ML chip, Actor Cores handle non-MAC operations (LayerNorm, Softmax, Residual).

Rich Core — FP actor

An extension of the Actor Core with an IEEE-754 floating-point unit (power-gated). 128–512 KB SRAM. Numerical computation, scientific workloads, financial models, complex business logic. The baseline for all external CPU comparisons.

Seal Core — authentication

A security core placed at the chip center (integrated with the L3 crossbar in monolithic CFPU chips, on the IOD in chiplet CFPU-ML). Code authentication (SHA-256, WOTS+, Merkle), key management, eFuse, TRNG, isolation boundary enforcement. Not a compute core — a security gatekeeper. Scales without saturation to ~30,000 cores.

Core comparison table

Property Nano Actor Rich Seal
ISA CIL-T0 (48 opcodes, int32) Full CIL (obj, GC, generics) Full CIL + FPU CIL-Seal (CIL-T0 subset + crypto)
FPU ✓ (IEEE-754, power-gated)
GC + Obj ✓ (bump alloc, mark-sweep)
Exception handling Trap only Full (throw/catch/finally) Full Trap only
Crypto HW ✓ (SHA-256, WOTS+, Merkle, eFuse, TRNG)
Mailbox dedicated inbox
SRAM range 4–64 KB 64–256 KB 128–512 KB ~32 KB
Logic (5nm) 0.005 mm² 0.010 mm² 0.012 mm² ~0.11 mm²
Power gating Per-core clock gating Per-core clock gating FPU separate domain Wake-on-demand

Core count comparison (5nm, chiplet, 18-tine package)

Using the same chiplet layout as CFPU-ML-Max: 85 mm² tine die (5nm), ~83 mm² usable per tine, 18 tines per package (SoIC + CoWoS). Logic includes +25% design margin, SRAM uses ISSCC reference density (0.021 mm²/Mbit).

Core type SRAM Core size (5nm) Cores / tine Cores / package (18 tine) Primary use
Nano 4 KB ~0.007 mm² ~10,400 ~187,000 Spike / sensor / edge
Actor 64 KB ~0.023 mm² ~3,170 ~57,000 General actor / Symphact
Rich 256 KB ~0.057 mm² ~1,280 ~23,000 FP compute / scientific code
MAC Slice 8 KB (M SKU) ~0.015 mm² ~5,264 ~94,752 ML / SNN inference

The Seal Core resides on the IOD (or at the center of a monolithic chip), in 1+ instances — therefore not included in the table. Core sizes include logic (+25% margin) and SRAM; per-tine counts assume ~12% routing/infrastructure overhead. The MAC Slice row is from the CFPU-ML-Max specification.

CFPU product family

Variant Primary units Target platform Notes
CFPU-N Seal + Nano IoT / edge sensor Maximum core count, minimum power
CFPU-A Seal + Actor Server / cloud Symphact reference platform
CFPU-R Seal + Rich Scientific / HPC FPU + GC, competes with discrete CPUs
CFPU-ML Seal + MAC Slice + Actor ML/SNN inference Chiplet: tine dies + IOD
CFPU-H Seal + Actor + Nano Hybrid server Heterogeneous supervisor + worker
CFPU-X Mixed (any combination) Research / prototype Application-specific

CFPU-ML: the ML/SNN inference chip

The CFPU-ML variant is optimized for ML/SNN inference. It consists not of programmable cores but of MAC Slices (FSM-driven compute units) + Actor Cores + Seal Core. A MAC Slice is not a CIL processor — it has no program counter, it only streams multiply-accumulate operations. Control is handled by Actor Cores.

Detailed chiplet specification, MAC Slice architecture, SKU family, competitive comparison, and model coverage: CFPU-ML-Max: ML/SNN Inference Accelerator →

Open source

The CLI-CPU project is fully open source. The complete design process, every decision and rationale, is publicly available.

The CFPU is not a single core — it is four specialized core types + MAC Slice compute units operating on the same chip (or chiplet package), over the same message network. Specialization does not fragment the system: it unifies it.