Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
Repositories
Showing 10 of 89 repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic/vllm’s past year of commit activity - lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
neuralmagic/lmms-eval’s past year of commit activity - DeepEP Public Forked from deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library
neuralmagic/DeepEP’s past year of commit activity - mini-swe-agent Public Forked from SWE-agent/mini-swe-agent
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
neuralmagic/mini-swe-agent’s past year of commit activity - nyann_poker Public
neuralmagic/nyann_poker’s past year of commit activity - model-validation-configs Public
neuralmagic/model-validation-configs’s past year of commit activity - GuardBench Public Forked from eldarkurtic/GuardBench
A Python library for guardrail models evaluation with vLLM support.
neuralmagic/GuardBench’s past year of commit activity - vllm-fork Public Forked from tlrmchlsmth/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic/vllm-fork’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…