Can anyone please guide how to get inference from the model and also can this be fined tuned in small GPUs (14gb) using peft, if anyone has some guidance that would be very helpful.
ยท Sign up or log in to comment