The LLaMA models have impressive performance despite their relatively smaller size, with the 13B model being even better than GPT3!
In this video, I go over how you can make the models even more powerful, by finetuning them on your own dataset!
Github:
Model Link:
LLaMA PR:
LLaMA Paper:
GPT3 Paper:
Discord:
#ai #chatgpt #docker #gpt3 #machinelearning #nlp #llama #gpt4 #wandb #llm
Timestamps
00:00 – Intro
00:16 – Model Metrics And Explanation
01:24 – Github Repo
01:39 – Finetuning Process Differences
02:36 – Setup Walkthrough
04:23 – Running Docker Image
06:00 – Looking At Run Flags
08:05 – Getting The Model Weights
10:11 – 3090 Server Performance
11:16 – A100 Server Performance
11:40 – WandB Loss Graphs
12:09 – Finetuned Model Inference
13:07 – Outro