Archive

Tokenizers

ZML is a production inference stack, purpose-built to decouple AI workloads from proprietary hardware.

01

engineering

10x faster tokenization

Integrating with the IREE tokenizer for a 10x uplift in tokenization performance.

Apr 7, 2026 monitoring / iree