|
2024-04-13 23:56:06,889 INFO StreamThr :132 [internal.py:wandb_internal():86] W&B internal server running at pid: 132, started at: 2024-04-13 23:56:06.888350 |
|
2024-04-13 23:56:06,890 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status |
|
2024-04-13 23:56:07,299 INFO WriterThread:132 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/run-7t5z0ip3.wandb |
|
2024-04-13 23:56:07,300 DEBUG SenderThread:132 [sender.py:send():379] send: header |
|
2024-04-13 23:56:07,303 DEBUG SenderThread:132 [sender.py:send():379] send: run |
|
2024-04-13 23:56:07,449 INFO SenderThread:132 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files |
|
2024-04-13 23:56:07,449 INFO SenderThread:132 [sender.py:_start_run_threads():1124] run started: 7t5z0ip3 with start time 1713052566.889519 |
|
2024-04-13 23:56:07,457 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: check_version |
|
2024-04-13 23:56:07,457 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: check_version |
|
2024-04-13 23:56:07,553 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: run_start |
|
2024-04-13 23:56:07,564 DEBUG HandlerThread:132 [system_info.py:__init__():26] System info init |
|
2024-04-13 23:56:07,564 DEBUG HandlerThread:132 [system_info.py:__init__():41] System info init done |
|
2024-04-13 23:56:07,564 INFO HandlerThread:132 [system_monitor.py:start():194] Starting system monitor |
|
2024-04-13 23:56:07,564 INFO SystemMonitor:132 [system_monitor.py:_start():158] Starting system asset monitoring threads |
|
2024-04-13 23:56:07,565 INFO HandlerThread:132 [system_monitor.py:probe():214] Collecting system info |
|
2024-04-13 23:56:07,565 INFO SystemMonitor:132 [interfaces.py:start():190] Started cpu monitoring |
|
2024-04-13 23:56:07,566 INFO SystemMonitor:132 [interfaces.py:start():190] Started disk monitoring |
|
2024-04-13 23:56:07,566 INFO SystemMonitor:132 [interfaces.py:start():190] Started gpu monitoring |
|
2024-04-13 23:56:07,567 INFO SystemMonitor:132 [interfaces.py:start():190] Started memory monitoring |
|
2024-04-13 23:56:07,568 INFO SystemMonitor:132 [interfaces.py:start():190] Started network monitoring |
|
2024-04-13 23:56:07,585 DEBUG HandlerThread:132 [system_info.py:probe():150] Probing system |
|
2024-04-13 23:56:07,587 DEBUG HandlerThread:132 [gitlib.py:_init_repo():56] git repository is invalid |
|
2024-04-13 23:56:07,587 DEBUG HandlerThread:132 [system_info.py:probe():198] Probing system done |
|
2024-04-13 23:56:07,587 DEBUG HandlerThread:132 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-13T23:56:07.585134', 'startedAt': '2024-04-13T23:56:06.882235', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '89baed725890', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.138, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.138, 'min': 0.0, 'max': 0.0}, {'current': 2000.138, 'min': 0.0, 'max': 0.0}, {'current': 2000.138, 'min': 0.0, 'max': 0.0}, {'current': 2000.138, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5565.551467895508}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}} |
|
2024-04-13 23:56:07,587 INFO HandlerThread:132 [system_monitor.py:probe():224] Finished collecting system info |
|
2024-04-13 23:56:07,587 INFO HandlerThread:132 [system_monitor.py:probe():227] Publishing system info |
|
2024-04-13 23:56:07,587 DEBUG HandlerThread:132 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment |
|
2024-04-13 23:56:08,451 INFO Thread-12 :132 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/conda-environment.yaml |
|
2024-04-13 23:56:22,601 ERROR HandlerThread:132 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds |
|
Traceback (most recent call last): |
|
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda |
|
subprocess.call( |
|
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call |
|
return p.wait(timeout=timeout) |
|
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait |
|
return self._wait(timeout=timeout) |
|
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait |
|
raise TimeoutExpired(self.args, timeout) |
|
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds |
|
2024-04-13 23:56:22,604 DEBUG HandlerThread:132 [system_info.py:_save_conda():222] Saving conda packages done |
|
2024-04-13 23:56:22,605 INFO HandlerThread:132 [system_monitor.py:probe():229] Finished publishing system info |
|
2024-04-13 23:56:22,612 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:22,612 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: keepalive |
|
2024-04-13 23:56:22,612 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:22,612 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: keepalive |
|
2024-04-13 23:56:22,612 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:22,613 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: keepalive |
|
2024-04-13 23:56:22,613 DEBUG SenderThread:132 [sender.py:send():379] send: files |
|
2024-04-13 23:56:22,613 INFO SenderThread:132 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now |
|
2024-04-13 23:56:22,823 INFO wandb-upload_0:132 [upload_job.py:push():131] Uploaded file /tmp/tmpqfnl0l2kwandb/4o1292rv-wandb-metadata.json |
|
2024-04-13 23:56:23,454 INFO Thread-12 :132 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/wandb-metadata.json |
|
2024-04-13 23:56:23,594 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: python_packages |
|
2024-04-13 23:56:23,595 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: python_packages |
|
2024-04-13 23:56:23,598 DEBUG SenderThread:132 [sender.py:send():379] send: telemetry |
|
2024-04-13 23:56:23,610 DEBUG SenderThread:132 [sender.py:send():379] send: config |
|
2024-04-13 23:56:23,612 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:56:23,610 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: internal_messages |
|
2024-04-13 23:56:23,613 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: stop_status |
|
2024-04-13 23:56:23,614 DEBUG SenderThread:132 [sender.py:send():379] send: telemetry |
|
2024-04-13 23:56:23,614 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: stop_status |
|
2024-04-13 23:56:23,654 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:56:23,654 WARNING SenderThread:132 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen) |
|
2024-04-13 23:56:23,654 DEBUG SenderThread:132 [sender.py:send():379] send: telemetry |
|
2024-04-13 23:56:24,454 INFO Thread-12 :132 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/requirements.txt |
|
2024-04-13 23:56:24,455 INFO Thread-12 :132 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/output.log |
|
2024-04-13 23:56:26,455 INFO Thread-12 :132 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/output.log |
|
2024-04-13 23:56:26,621 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: pause |
|
2024-04-13 23:56:26,621 INFO HandlerThread:132 [handler.py:handle_request_pause():708] stopping system metrics thread |
|
2024-04-13 23:56:26,621 INFO HandlerThread:132 [system_monitor.py:finish():203] Stopping system monitor |
|
2024-04-13 23:56:26,621 DEBUG SystemMonitor:132 [system_monitor.py:_start():172] Starting system metrics aggregation loop |
|
2024-04-13 23:56:26,621 DEBUG SystemMonitor:132 [system_monitor.py:_start():179] Finished system metrics aggregation loop |
|
2024-04-13 23:56:26,621 DEBUG SystemMonitor:132 [system_monitor.py:_start():183] Publishing last batch of metrics |
|
2024-04-13 23:56:26,622 INFO HandlerThread:132 [interfaces.py:finish():202] Joined cpu monitor |
|
2024-04-13 23:56:26,623 INFO HandlerThread:132 [interfaces.py:finish():202] Joined disk monitor |
|
2024-04-13 23:56:26,632 INFO HandlerThread:132 [interfaces.py:finish():202] Joined gpu monitor |
|
2024-04-13 23:56:26,633 INFO HandlerThread:132 [interfaces.py:finish():202] Joined memory monitor |
|
2024-04-13 23:56:26,633 INFO HandlerThread:132 [interfaces.py:finish():202] Joined network monitor |
|
2024-04-13 23:56:28,407 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:33,408 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:38,415 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:38,596 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: stop_status |
|
2024-04-13 23:56:38,596 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: stop_status |
|
2024-04-13 23:56:38,599 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: internal_messages |
|
2024-04-13 23:56:39,505 INFO Thread-12 :132 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/config.yaml |
|
2024-04-13 23:56:43,649 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:48,649 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:53,596 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: stop_status |
|
2024-04-13 23:56:53,597 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: stop_status |
|
2024-04-13 23:56:53,637 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: internal_messages |
|
2024-04-13 23:56:53,682 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:56:58,684 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:03,684 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:08,596 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: stop_status |
|
2024-04-13 23:57:08,597 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: stop_status |
|
2024-04-13 23:57:08,637 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: internal_messages |
|
2024-04-13 23:57:09,648 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:14,649 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:15,685 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: resume |
|
2024-04-13 23:57:15,686 INFO HandlerThread:132 [handler.py:handle_request_resume():699] starting system metrics thread |
|
2024-04-13 23:57:15,686 INFO HandlerThread:132 [system_monitor.py:start():194] Starting system monitor |
|
2024-04-13 23:57:15,686 INFO SystemMonitor:132 [system_monitor.py:_start():158] Starting system asset monitoring threads |
|
2024-04-13 23:57:15,686 INFO SystemMonitor:132 [interfaces.py:start():190] Started cpu monitoring |
|
2024-04-13 23:57:15,687 INFO SystemMonitor:132 [interfaces.py:start():190] Started disk monitoring |
|
2024-04-13 23:57:15,687 INFO SystemMonitor:132 [interfaces.py:start():190] Started gpu monitoring |
|
2024-04-13 23:57:15,688 INFO SystemMonitor:132 [interfaces.py:start():190] Started memory monitoring |
|
2024-04-13 23:57:15,689 INFO SystemMonitor:132 [interfaces.py:start():190] Started network monitoring |
|
2024-04-13 23:57:19,113 DEBUG SenderThread:132 [sender.py:send():379] send: config |
|
2024-04-13 23:57:19,114 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:19,115 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:19,115 WARNING SenderThread:132 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen) |
|
2024-04-13 23:57:20,115 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:20,519 INFO Thread-12 :132 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/output.log |
|
2024-04-13 23:57:21,519 INFO Thread-12 :132 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/output.log |
|
2024-04-13 23:57:23,599 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: internal_messages |
|
2024-04-13 23:57:23,600 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: stop_status |
|
2024-04-13 23:57:23,601 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: stop_status |
|
2024-04-13 23:57:25,058 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: partial_history |
|
2024-04-13 23:57:25,060 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:25,060 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:25,060 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:25,060 DEBUG SenderThread:132 [sender.py:send():379] send: metric |
|
2024-04-13 23:57:25,060 DEBUG SenderThread:132 [sender.py:send():379] send: history |
|
2024-04-13 23:57:25,061 DEBUG SenderThread:132 [sender.py:send_request():406] send_request: summary_record |
|
2024-04-13 23:57:25,061 INFO SenderThread:132 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end |
|
2024-04-13 23:57:25,521 INFO Thread-12 :132 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/wandb-summary.json |
|
2024-04-13 23:57:25,913 DEBUG HandlerThread:132 [handler.py:handle_request():146] handle_request: status_report |
|
2024-04-13 23:57:26,521 INFO Thread-12 :132 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_235606-7t5z0ip3/files/output.log |
|
|