Faster LLM Inference: Speeding up Falcon 7b (with QLoRA adapter) Prediction Time