Pickle files are cringe, but they're also basically unavoidable when working wit...

osanseviero · 2023-09-18T16:17:24

You should check out safetensors. They are used widely in diffusion models and LLMs https://huggingface.co/blog/safetensors-security-audit

jklehm · 2023-09-18T16:12:28

ONNX[0], model-as-protosbufs, continuing to gain adoption will hopefully solve this issue.

bunderbunder · 2023-09-18T16:23:56

ONNX is cool, but it still only supports a minority of scikit-learn components. Some of them simply aren't compatible with ONNX's basic design.

mxz3000 · 2023-09-18T16:15:04

at work we use the ONNX serialisation format for all of our prod models. Those get loaded by the ONNX runtime for inference. works great.

perhaps it's be viable to add support for the ONNX format even for use cases like model checkpointing during training, etc ?