DavidLanz - Overview
Popular repositories Loading
-
Forked from dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
-
Fine-tune the recent Llama-2-7b model on a single GPU and turn it into a chatbot I will leverage PEFT library from Hugging Face ecosystem, as well as QLoRA for more memory efficient finetuning.