Popular repositories Loading
-
glm4-7flash-opus-colab
glm4-7flash-opus-colab PublicReady-to-run Colab notebook to run GLM-4.7-Flash Finetuned on Claude Opus 4.5 xHigh-Reasoning (GGUF) with llama.cpp, featuring GPU/CPU split loading, streaming chat, multi-chat manager, and a Gradi…
Jupyter Notebook
-
preman
preman PublicAn Out-of-Core MoE Inference Engine designed specifically for MoE models which uses a Predictor-Manager (PreMan) architecture.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.