runtime error
Exit code: 1. Reason: 0:07<00:00, 639MB/s] Downloading shards: 67%|██████▋ | 2/3 [00:15<00:07, 7.93s/it][A model-00003-of-00003.safetensors: 0%| | 0.00/3.59G [00:00<?, ?B/s][A model-00003-of-00003.safetensors: 1%| | 41.9M/3.59G [00:01<01:52, 31.5MB/s][A model-00003-of-00003.safetensors: 13%|█▎ | 482M/3.59G [00:02<00:13, 239MB/s] [A model-00003-of-00003.safetensors: 36%|███▌ | 1.29G/3.59G [00:03<00:05, 451MB/s][A model-00003-of-00003.safetensors: 59%|█████▉ | 2.12G/3.59G [00:04<00:02, 585MB/s][A model-00003-of-00003.safetensors: 76%|███████▌ | 2.73G/3.59G [00:05<00:01, 578MB/s][A model-00003-of-00003.safetensors: 100%|█████████▉| 3.59G/3.59G [00:06<00:00, 552MB/s] Downloading shards: 100%|██████████| 3/3 [00:22<00:00, 7.30s/it][A Downloading shards: 100%|██████████| 3/3 [00:22<00:00, 7.49s/it] Loading checkpoint shards: 0%| | 0/3 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 3/3 [00:00<00:00, 54471.48it/s] generation_config.json: 0%| | 0.00/119 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 119/119 [00:00<00:00, 1.01MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 15, in <module> model = AutoModelForCausalLM.from_pretrained(MODEL_NAME, torch_dtype=torch.float16, device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4302, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 500, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...