runtime error

urn F.layer_norm( File "/opt/conda/lib/python3.9/site-packages/torch/nn/functional.py", line 2515, in layer_norm return torch.layer_norm(input, normalized_shape, weight, bias, eps, torch.backends.cudnn.enabled) RuntimeError: "LayerNormKernelImpl" not implemented for 'Half' 2023-10-29T08:03:54.468454Z ERROR warmup{max_input_length=1024 max_prefill_tokens=4096 max_total_tokens=2048}:warmup: text_generation_client: router/client/src/lib.rs:33: Server error: "LayerNormKernelImpl" not implemented for 'Half' Error: Warmup(Generation("\"LayerNormKernelImpl\" not implemented for 'Half'")) 2023-10-29T08:03:54.553953Z ERROR text_generation_launcher: Webserver Crashed 2023-10-29T08:03:54.553979Z  INFO text_generation_launcher: Shutting down shards 2023-10-29T08:03:55.452679Z  INFO shard-manager: text_generation_launcher: Shard terminated rank=0 Error: WebserverFailed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 curl: (7) Failed to connect to 127.0.0.1 port 8080: Connection refused Warning: Transient problem: connection refused Will retry in 10 seconds. 2 Warning: retries left. 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 curl: (7) Failed to connect to 127.0.0.1 port 8080: Connection refused Warning: Transient problem: connection refused Will retry in 10 seconds. 1 Warning: retries left. 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 curl: (7) Failed to connect to 127.0.0.1 port 8080: Connection refused

Container logs:

Fetching error logs...