(Optional) If you are running decoding with gemma-2 models, you will also need to install flashinfer. python -m pip install flashinfer -i https://flashinfer.ai/whl ...
[@risingsunomi] I first learned about Exo after beginning work on my own distributed inference idea in Ziglang. They appeared on my feed, as I was searching online for resources, via X and I become ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results