Quantization and inference library for running LLMs locally on modern consumer-class GPUs.
0.0.25
x86_64-linux
x86_64-windows