Completed WU still running!

Fehler und Wünsche zum Projekt yoyo@home
Bugs and wishes for the project yoyo@home
Nachricht
Autor
Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Completed WU still running!

#1 Ungelesener Beitrag von Dunckx » 09.09.2018 13:29

Well, here's a new one I haven't seen before. A completed WU is still running!

I noticed yesterday there was a WU which claimed that 00:00:00 remained and yet was still running. Today the time remaining has disappeared, but it's still there, crunching away.
Completed WU still running 090918.jpg
After 1 day 23 hours 42 minutes it's still going!
Completed WU still running 090918.jpg (230.58 KiB) 7762 mal betrachtet
Do I kill it, or let it run?

Dunckx

Benutzeravatar
gemini8
Vereinsvorstand
Vereinsvorstand
Beiträge: 5898
Registriert: 31.05.2011 10:30
Wohnort: Hannover

Re: Completed WU still running!

#2 Ungelesener Beitrag von gemini8 » 09.09.2018 14:47

I've had something similar as well quite a while ago.
Does that WU still consume CPU-time?
If so, let it run, as it's only the estimate which is off.
If it doesn't, then try quitting and restarting Boinc to make it run again.
If that doesn't help, please wait for what yoyo will tell about the situation. ;-)
Gruß, Jens
- - - - - -
Lowend-User und Teilzeit-Cruncher

Bild Bild Bild
Bild

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#3 Ungelesener Beitrag von Dunckx » 09.09.2018 15:58

Thanks for the feedback Jens. What bothers me is that the WU has apparently not checkpointed since it started, now over two days ago. I have another WU which has 9 hours+ elapsed and last checkpointed over two hours ago, but the other six running have checkpoints within the last half hour.

I thought they were meant to write a checkpoint file every half hour or so, but maybe I got that wrong. It is still using CPU time though.

Thanks for getting back to me.
Dunckx

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8045
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Completed WU still running!

#4 Ungelesener Beitrag von yoyo » 09.09.2018 20:24

Can you post the hostID?
Have you throttled CPU usage?

The P1 and P2 WUs do not have checkpoint. The P2 which you have runs on an Intel processor only round about 1 hour, but consumes much RAM.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#5 Ungelesener Beitrag von Dunckx » 09.09.2018 21:00

Host ID is 119264. CPU isn't throttled, but I suspended the task until I heard from you. I've now set it to resume and it's waiting to run, it should start up again in around 15 minutes after the next (small) WU completes. I've seen the P2 take all the RAM my system has to the point of waiting for memory, but yes, they do tend to finish in little over an hour. I've never seen any kind of WU take two days on this PC, this is a first, assuming all is as it should be. Now two days, three hours and fortyone minutes...

Thanks for coming back to me.
Dunckx

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#6 Ungelesener Beitrag von Dunckx » 10.09.2018 09:53

Now two days, sixteen hours and thirtythree minutes...

Dunckx

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#7 Ungelesener Beitrag von Dunckx » 11.09.2018 10:20

After three days and nearly seventeen hours, it is still running and still using 6GB of RAM. Also, it has now missed the deadline, which was in the early hours of this morning.
Completed WU still running 110918.jpg
Missed the deadline!
Completed WU still running 110918.jpg (234.67 KiB) 7694 mal betrachtet
So, there's no point in letting it still run. Is it OK to kill it, is there any info you want from this run, any xml files, logs?
Dunckx

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8045
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Completed WU still running!

#8 Ungelesener Beitrag von yoyo » 11.09.2018 10:42

Can you send the content of the slot directory to me, yoyo(a)mailueberfall.de?
Afterwards abort the WU.
Maybe I see something in the logs.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#9 Ungelesener Beitrag von Dunckx » 11.09.2018 17:50

OK, I'm emailing part of the slot directory, three files couldn't be copied because they were in use.

I then went to close BOINC and the pc crashed with a BSOD "not shut down properly" error.
Completed WU still running past the deadline.jpg
Completed WU still running past the deadline.jpg (231.57 KiB) 7676 mal betrachtet
Eventually I got everything up and running again, interestingly, the same WU has restarted from scratch despite being past the deadline already. It claims that it will take about an hour plus forty minutes, so I'm inclined to let it run and see what happens. I may be going mad (Einstein's definition of madness - to do the same thing repeatedly and expect it to turn out differently this time) but we shall see.

Shouldn't it check the deadline before it starts up, or quit if it overruns?
Dunckx

Dunckx
PDA-Benutzer
PDA-Benutzer
Beiträge: 45
Registriert: 12.11.2014 09:26

Re: Completed WU still running!

#10 Ungelesener Beitrag von Dunckx » 11.09.2018 18:48

Well, I'm not mad yet. :roll2:

This time was different. The WU has finished after 45 minutes (not three days plus...) Too late for the deadline, pity it couldn't do this at the first attempt!

I will get my pc to upload the results now. Hope you find something useful from this and the slot directory.
Dunckx

Antworten

Zurück zu „Fehler, Wünsche / Bugs, Wishes“