CUDA out of memory when exporting llama7b #65

kRo0T · 2023-07-25T09:02:36Z

I'm trying to export llama7b model on my local machine (it has rtx 3060 12gb which is not enough) using export_meta_llama_bin.py script and receiving cuda out of memory.
I see that in generation.py from llama module (in Meta repo) they hardcoded cuda usage.
Anyone knows how to make it run on cpu with minimal script modification?

abdelhadi-azouni · 2023-07-25T12:52:48Z

Facing a related issue
and would love help to make this cpu only

kRo0T · 2023-07-25T17:26:16Z

Made a PR #78

karpathy · 2023-07-25T23:54:09Z

try with updated code

kRo0T mentioned this issue Jul 25, 2023

Run export script on CPU #78

Closed

karpathy closed this as completed Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA out of memory when exporting llama7b #65

CUDA out of memory when exporting llama7b #65

kRo0T commented Jul 25, 2023

abdelhadi-azouni commented Jul 25, 2023

kRo0T commented Jul 25, 2023

karpathy commented Jul 25, 2023

CUDA out of memory when exporting llama7b #65

CUDA out of memory when exporting llama7b #65

Comments

kRo0T commented Jul 25, 2023

abdelhadi-azouni commented Jul 25, 2023

kRo0T commented Jul 25, 2023

karpathy commented Jul 25, 2023