2026-06-11 20:07:35.841 1 storage.initializer INFO [initializer-entrypoint:():16] Initializing, args: (src_uri, dest_path): [('hf://sshleifer/tiny-gpt2', '/mnt/models')] 2026-06-11 20:07:35.841 1 storage.initializer INFO [kserve_storage.py:download():161] Copying contents of hf://sshleifer/tiny-gpt2 to local Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/wPaCkH-WbT7GsmxMKKrNZTV4nSM=.ae8c63daedbd4206d7d40126955d4e6ab1c80f8f.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_328169d6-366b-406b-96de-702506369c85'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/8_PA_wEVGiVa2goH2H4KQOQpvVY=.2c81a6c4c984e95a45338c64a7445c1f0f88077f.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_d80e6dfc-88bd-4f81-a9c1-539aa88e070e'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/gPcsVCQDYDHk-_n0G9uADl7PXIM=.3cd4987249615ca870f2c18e657a62f0962aecad0eac790dd89362227310fd30.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_da2657fb-e28c-4f2d-b4fa-cdffe31f1e93'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/PtHk0z_I45atnj23IIRhTExwT3w=.226b0752cac7789c48f0cb3ec53eda48b7be36cc.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_db5c7311-94f5-426d-b83d-20ba64762d55'. Continuing without setting permissions. {"timestamp":"2026-06-11T20:08:29.024165Z","level":"WARN","fields":{"message":"Reqwest(reqwest::Error { kind: Request, url: \"https://us.aws.cdn.hf.co/xorbs/default/25327c62b388c329c0a8956b9eaaf146ba4141c4b3a02f20634e52aa0be9a7ba?repo_id=621ffdc136468d709f18090c&user_id=public&X-Xet-Session-Id=01KTW4R9N2P62XXE4SH0BJ5RXF&Expires=1781212059&Policy=eyJTdGF0ZW1lbnQiOlt7IlJlc291cmNlIjoiaHR0cHM6Ly91cy5hd3MuY2RuLmhmLmNvL3hvcmJzL2RlZmF1bHQvMjUzMjdjNjJiMzg4YzMyOWMwYTg5NTZiOWVhYWYxNDZiYTQxNDFjNGIzYTAyZjIwNjM0ZTUyYWEwYmU5YTdiYVxcP3JlcG9faWQ9NjIxZmZkYzEzNjQ2OGQ3MDlmMTgwOTBjJnVzZXJfaWQ9cHVibGljJlgtWGV0LVNlc3Npb24tSWQ9MDFLVFc0UjlOMlA2MlhYRTRTSDBCSjVSWEYiLCJDb25kaXRpb24iOnsiRGF0ZUxlc3NUaGFuIjp7IkVwb2NoVGltZSI6MTc4MTIxMjA1OX0sIkJ5dGVSYW5nZSI6eyJFeHBlY3RlZEhlYWRlciI6ImJ5dGVzPTM5MTA1NS04MzA4MTIifX19XX0_&Signature=MEUCIC9S9xsIrUFOVlfqfoXxxMC4Q4AtbWgeXLvm8WL1v13OAiEA-rWPUtR%7ENEKz65G9WFfyfvDRMESWcuZUce9nuNLj4U8_&Key-Pair-Id=01KAYHXK2CBJSW0YZTMNXK9W1M\", source: hyper_util::client::legacy::Error(Connect, Ssl(Error { code: ErrorCode(5), cause: None }, X509VerifyResult { code: 0, error: \"ok\" })) }). Retrying..."},"filename":"/home/runner/work/xet-core/xet-core/cas_client/src/http_client.rs","line_number":200} {"timestamp":"2026-06-11T20:08:29.024206Z","level":"WARN","fields":{"message":"Retry attempt #0. Sleeping 825.007996ms before the next attempt"},"filename":"/root/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/reqwest-retry-0.7.0/src/middleware.rs","line_number":171} Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/Q1p2l2BzM1m6P5jKvr8WTq1TUio=.b706b24034032bdfe765ded5ab6403d201d295a995b790cb24c74becca5c04e6.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_cac3b990-1adb-425e-98f0-3242b7e08287'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/ahkChHUJFxEmOdq5GDFEmerRzCY=.817762d631ad6f9c799f6b9dc713c46420e65546.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_8cb8ac3e-7cd4-40e2-a7e5-24938023d3f6'. Continuing without setting permissions. {"timestamp":"2026-06-11T20:08:35.057779Z","level":"WARN","fields":{"message":"Status Code: 408. Retrying...","request_id":""},"filename":"/home/runner/work/xet-core/xet-core/cas_client/src/http_client.rs","line_number":194} {"timestamp":"2026-06-11T20:08:35.057806Z","level":"WARN","fields":{"message":"Retry attempt #0. Sleeping 454.602467ms before the next attempt"},"filename":"/root/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/reqwest-retry-0.7.0/src/middleware.rs","line_number":171} Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/a7eHxRFT3OeMBIFg52k2nfj5m7w=.09166c56d46ec8eca5f1e46e4e4a62265fca5975a70a2b263c1795291a685cf7.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_ed8dd7d7-8ae0-4159-8ca9-95730d983a9b'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/vzaExXFZNBay89bvlQv-ZcI6BTg=.be4d21d94f3b4687e5a54d84bf6ab46ed0f8defd.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_5dee72cc-56f3-443e-b28c-57c0307aa378'. Continuing without setting permissions. Could not set the permissions on the file '/mnt/models/.cache/huggingface/download/j3m-Hy6QvBddw8RXA1uSWl1AJ0c=.b00361fece0387ca34b4b8b8539ed830d644dbeb.incomplete'. Error: [Errno 13] Permission denied: '/mnt/tmp_8befd3f1-6fe3-4d94-87fe-723beb3648fc'. Continuing without setting permissions. 2026-06-11 20:08:35.678 1 storage.initializer INFO [kserve_storage.py:download():229] Successfully copied hf://sshleifer/tiny-gpt2 to /mnt/models 2026-06-11 20:08:35.678 1 storage.initializer INFO [kserve_storage.py:download():230] Model downloaded in 59.8365899800001 seconds. I0611 20:08:36.466324 1 config.go:602] "Configuration:" =< { "IP": "10.134.0.37", "PodName": "e2e-unconfigured-facebook-opt-125m-simulated-kserve-5b477cpczrx", "PodNameSpace": "llm", "VllmDevMode": false, "block-size": 16, "data-parallel-rank": -1, "data-parallel-size": 1, "dataset-in-memory": false, "dataset-path": "", "dataset-table-name": "llmd", "dataset-url": "", "default-embedding-dimensions": 384, "ec-transfer-config": "", "enable-kvcache": false, "enable-prefix-caching": false, "enable-request-id-headers": false, "enable-sleep-mode": false, "enforce-eager": false, "event-batch-size": 16, "failure-injection-rate": 0, "failure-types": null, "fake-metrics": null, "fake-metrics-refresh-interval": 100000000, "global-cache-hit-threshold": 0, "hash-seed": "", "inter-token-latency": 0, "inter-token-latency-std-dev": 0, "kv-cache-size": 1024, "kv-cache-transfer-latency": 0, "kv-cache-transfer-latency-std-dev": 0, "kv-cache-transfer-time-per-token": 0, "kv-cache-transfer-time-std-dev": 0, "latency-calculator": "", "lora-modules": null, "max-cpu-loras": 1, "max-loras": 1, "max-model-len": 1024, "max-num-seqs": 5, "max-tool-call-array-param-length": 5, "max-tool-call-integer-param": 100, "max-tool-call-number-param": 100, "max-waiting-queue-length": 1000, "min-tool-call-array-param-length": 1, "min-tool-call-integer-param": 0, "min-tool-call-number-param": 0, "mm-encoder-only": false, "mm-processor-kwargs": "", "mode": "random", "model": "facebook/opt-125m", "object-tool-call-not-required-field-probability": 50, "port": 8000, "prefill-overhead": 0, "prefill-time-per-token": 0, "prefill-time-std-dev": 0, "seed": 1781208516465880300, "self-signed-certs": false, "served-model-name": [ "facebook/opt-125m" ], "ssl-certfile": "/var/run/kserve/tls/tls.crt", "ssl-keyfile": "/var/run/kserve/tls/tls.key", "time-factor-under-load": 1, "time-to-first-token": 0, "time-to-first-token-std-dev": 0, "tool-call-not-required-param-probability": 50, "uds-socket-path": "/tmp/tokenizer/tokenizer-uds.socket", "zmq-endpoint": "tcp://127.0.0.1:5557" } > I0611 20:08:36.498447 1 tokenizer.go:104] "Model is not a real HF model, using simulated tokenizer" model="facebook/opt-125m" I0611 20:08:36.502481 1 context.go:138] "No dataset path or URL provided, using random text for responses" I0611 20:08:36.502564 1 communication.go:49] "Starting communication layer" I0611 20:08:36.502579 1 simulator.go:188] "Start processing routine" I0611 20:08:36.502821 1 http_server_tls.go:44] "HTTPS server starting with certificate files" cert="/var/run/kserve/tls/tls.crt" key="/var/run/kserve/tls/tls.key" I0611 20:08:36.502917 1 grpc.go:126] "Server starting" protocol="gRPC" port=8000 I0611 20:08:36.503450 1 http.go:96] "Server starting" protocol="HTTPS" port=8000