GGUF quantization of LLMs with llama cpp