Casefleet
Popular repositories Loading
Repositories
Showing 10 of 11 repositories
-
exllama Public Forked from turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…