How do I set it up so I can run it with 24GB ？

  i use this set 
# --gradient_checkpointing \
   --mixed_precision fp16 \
   --use_8bit_adam \
   --set_grads_to_none \
but error is 
 return Variable._execution_engine.run_backward(  # Calls into the C++ engine to run the backward pass
torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.25 GiB. GPU 0 has a total capacity of 23.69 GiB of which 313.69 MiB is free. Including non-PyTorch memory, this process has 23.38 GiB memory in use. Of the allocated memory 22.93 GiB is allocated by PyTorch, and 128.80 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.  See documentation for Memory Management  (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

can you help me ？thank you~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I set it up so I can run it with 24GB ？ #15

--gradient_checkpointing \

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How do I set it up so I can run it with 24GB ？ #15

Description

--gradient_checkpointing \

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions