I thought llamafile was supposed to be the solution to "too hard to work with"? ...

archerx · 2025-04-21T18:51:57 1745261517

Llamafile is great and love it. I run all my models using it and it’s super portable, I have tested it on windows and linux, on a powerful PC and SBC. It worked great without too my issues.

It takes about a month for the features from llama.cpp to trickle in. Also figuring the best mix of context length size to vram size to desired speed takes a while before it gets intuitive.

rzzzt · 2025-04-21T19:02:22 1745262142

I thought it's "docker model" (and OCI artifacts).

dust42 · 2025-04-21T20:36:58 1745267818

llamafile is a multiplatform executable that wraps the model and a slightly modified version of llama.cpp. IIRC funded by Moz.