![]() |
|
|
#430 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
153D16 Posts |
Had been reliably getting new session on one account and iffy on the other. Shortly before noon both terminated early, and I haven't been able to get a session on either for hours, so can no longer test any scripts at the moment.
|
|
|
|
|
|
#431 |
|
Feb 2005
Colorado
2·7·47 Posts |
|
|
|
|
|
|
#432 |
|
Jun 2019
Boston, MA
3·13 Posts |
So my Kaggle commits just finished after running 9 hours, and no output tab exists! I was running a single long job with no intention of it finishing, but was hoping to collect the checkpoint file and resume with another session... it says exited with error code 137 in the log.
The run info section also says output size 0 (see attached). Does it matter than I'm moving my input and exe to /usr/loca/bin/ and running everything there? Is there somewhere else I need to move/run files in order to be captured as output?? |
|
|
|
|
|
#433 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,437 Posts |
No VM backend at all. With gpu accelerator, or without.
Last fiddled with by kriesel on 2019-10-22 at 22:28 |
|
|
|
|
|
#434 | |
|
Feb 2005
Colorado
65810 Posts |
Quote:
|
|
|
|
|
|
|
#435 |
|
Jun 2019
Boston, MA
478 Posts |
Here's what I see when I click "1 commit" -- nothing is clearly labelled as tabs, and clicking on any of the columns returns me to the same page I showed in my prior post.
|
|
|
|
|
|
#436 | |
|
Feb 2005
Colorado
2·7·47 Posts |
Quote:
The difference is that I have a green check mark on the far left, indicating a successful run, instead of a red X. If your Version 1 isn't clickable, then that must be why. |
|
|
|
|
|
|
#437 | |
|
Jun 2019
Boston, MA
3·13 Posts |
Quote:
In post 418, axn said I should have an output tab regardless of it the kernel is killed after 9 hours or completes... I lost 9 hours of GPU quota and the kernel ran so why no output...? Also can anyone confirm my log makes it look like it failed with error code 137 in 5 seconds, but gives no explanation and didn’t stop running, supposedly that’s a memory error code but I’m running something requiring little memory that I know works from trying it in draft mode Finally I’d like to test something but I need help from a Python savvy user out there: is there a way to issue a keyboard interrupt (I.e ctrl + c) after a certain time delay? My thought is maybe I’ll program into my code to interrupt my script before the kernel times out and maybe that will result in a successful “complete” status as all of the code cells will run hopefully giving me some output... thoughts? Last fiddled with by mnd9 on 2019-10-23 at 01:15 Reason: More info |
|
|
|
|
|
|
#438 | ||
|
Feb 2005
Colorado
29216 Posts |
Quote:
Quote:
|
||
|
|
|
|
|
#439 | |
|
Jun 2003
5,087 Posts |
Quote:
EDIT:- /kaggle/working Last fiddled with by axn on 2019-10-23 at 02:50 |
|
|
|
|
|
|
#440 |
|
Jun 2003
13DF16 Posts |
No need. This is what I'm doing. If my run completes in 9 hours, I get results.json.txt, else I get the pXXXXX files. I also get the mprime executable and all the accompanying text file, because I run the program directly from the default folder.
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Alternatives to Google Colab | kriesel | Cloud Computing | 11 | 2020-01-14 18:45 |
| Notebook | enzocreti | enzocreti | 0 | 2019-02-15 08:20 |
| Computer Diet causes Machine Check Exception -- need heuristics help | Christenson | Hardware | 32 | 2011-12-25 08:17 |
| Computer diet - Need help | garo | Hardware | 41 | 2011-10-06 04:06 |
| Workunit diet ? | dsouza123 | NFSNET Discussion | 5 | 2004-02-27 00:42 |