AI学习开发AI学习开发Github库

LLaMA CPP

Inference of LLaMA model in pure C/C++, The main goal of llama.cpp is to run the LLaMA model using 4-bit integer quantization on a MacBook.

标签: