-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Related to Model/Framework(s)
SSD/PyTorch
Describe the bug
DLL 2024-09-05 08:00:14.587129 - PARAMETER dataset path : /coco epochs : 65 batch size : 64│F1Help F2Setup F3SearchF4FilterF5List F6SortByF7Nice -F8Nice +F9Kill F10Quit
eval batch size : 32 no cuda : False seed : None checkpoint path : None mode : training├────────────────────────────────────────────────────────────────────────────────────────────── eval on epochs : [21, 31, 37, 42, 48, 53, 59, 64] lr decay epochs : [43, 54] learning rat│ksv@neroksv:~$
e : 0.0026 momentum : 0.9 weight decay : 0.0005 lr warmup : 300 backbone : resnet50 back│
bone path : None num workers : 8 AMP : True precision : amp │
Using seed = 6772 │
Loading annotations into memory... │
Done (t=0.03s) │
Creating index... │
terminate called after throwing an instance of 'simdjson::simdjson_error' │
what(): The JSON element does not have the requested type. │
Aborted (core dumped)
To Reproduce
Steps to reproduce the behavior:
- Install https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Detection/SSD#quick-start-guide
- Set COCO_DIR to my custom dataset
- Change
https://github.com/NVIDIA/DeepLearningExamples/blob/729963dd47e7c8bd462ad10bfac7a7b0b604e6dd/PyTorch/Detection/SSD/ssd/model.py#L61C9-L61C54
to my class count + 1 - Launch python main.py --backbone resnet50 --warmup 300 --bs 64 --d│ 565 root 0.0 0.0 0:00.58 ├─ /lib/systemd/systemd-journald ata /coco
Expected behavior
A clear and concise description of what you expected to happen.
Environment
Please provide at least:
- Container version (22.10-py3):
- GPUs in the system: (1 NVIDIA RTX A5000 - 24GB):
- CUDA driver version (545.23.08):
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working