01
engineering
10x faster tokenization
Integrating with the IREE tokenizer for a 10x uplift in tokenization performance.
Archive
ZML is a production inference stack, purpose-built to decouple AI workloads from proprietary hardware.
engineering
Integrating with the IREE tokenizer for a 10x uplift in tokenization performance.