npu-optimization

Here is 1 public repository matching this topic...

Tensorbit-Labs / tensorbit-core

High-performance C++ engine for Second-Order Hessian Pruning. The surgical foundation of the Tensorbit Labs P-D-Q pipeline for ultra-efficient LLM and Vision Transformers edge inference.

sparsity cpp inference-engine model-compression edge-ai llm llm-optimization llm-infrastructure npu-optimization hessian-pruning tensorbit

Updated May 2, 2026
C++

Improve this page

Add a description, image, and links to the npu-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the npu-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

npu-optimization

Here is 1 public repository matching this topic...

Tensorbit-Labs / tensorbit-core

Improve this page

Add this topic to your repo