Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
-
deepsparse
deepsparse Public archiveSparsity-aware deep learning inference runtime for CPUs
Repositories
Showing 10 of 89 repositories
- lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
neuralmagic/lmms-eval's past year of commit activity - mini-swe-agent Public Forked from SWE-agent/mini-swe-agent
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo--but scores >74% on SWE-bench verified!
neuralmagic/mini-swe-agent's past year of commit activity - nyann_poker Public
neuralmagic/nyann_poker's past year of commit activity - model-validation-configs Public
neuralmagic/model-validation-configs's past year of commit activity - GuardBench Public Forked from eldarkurtic/GuardBench
A Python library for guardrail models evaluation with vLLM support.
neuralmagic/GuardBench's past year of commit activity - vllm-fork Public Forked from tlrmchlsmth/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic/vllm-fork's past year of commit activity
Top languages
Loading...
Most used topics
Loading...