Skip to content
@neuralmagic

Neural Magic

Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM

Pinned Loading

  1. nm-vllm-certs nm-vllm-certs Public

    General Information, model certifications, and benchmarks for nm-vllm enterprise distributions

    11 2

  2. deepsparse deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.1k 185

  3. sparseml sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2.1k 153

  4. docs docs Public

    Top-level directory for documentation and general content

    MDX 121 7

  5. sparsezoo sparsezoo Public

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 384 27

  6. guidellm guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    Python 300 39

Repositories

Showing 10 of 72 repositories
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/vllm’s past year of commit activity
    Python 12 Apache-2.0 7,605 0 9 Updated May 21, 2025
  • compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    neuralmagic/compressed-tensors’s past year of commit activity
    Python 113 Apache-2.0 11 5 14 Updated May 21, 2025
  • research Public

    Repository to enable research flows

    neuralmagic/research’s past year of commit activity
    Python 0 0 0 1 Updated May 21, 2025
  • yolov5 Public Forked from ultralytics/yolov5

    YOLOv5 in PyTorch > ONNX > CoreML > TFLite

    neuralmagic/yolov5’s past year of commit activity
    Python 19 GPL-3.0 17,120 0 3 Updated May 21, 2025
  • speculators Public
    neuralmagic/speculators’s past year of commit activity
    Python 2 Apache-2.0 0 19 4 Updated May 20, 2025
  • guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    neuralmagic/guidellm’s past year of commit activity
    Python 300 Apache-2.0 39 35 (2 issues need help) 16 Updated May 20, 2025
  • neuralmagic/model-validation-configs’s past year of commit activity
    0 0 0 9 Updated May 20, 2025
  • sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    neuralmagic/sparseml’s past year of commit activity
    Python 2,135 Apache-2.0 153 1 3 Updated May 20, 2025
  • lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval

    Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

    neuralmagic/lmms-eval’s past year of commit activity
    Python 0 280 0 7 Updated May 19, 2025
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    neuralmagic/axolotl’s past year of commit activity
    Python 0 Apache-2.0 1,027 0 3 Updated May 18, 2025

Top languages

Loading…

Most used topics

Loading…