Long running work unit

Everything about the project RNA World
Nachricht
Autor
trindol

Re: Long running work unit

#661 Ungelesener Beitrag von trindol » 24.10.2013 16:39

6077994
6083144

I am concerned with these two. When they were at 300 hours, the host lost power. The units restarted automatically when power was restored. Now they are at 2400 hours. Both have been at 100% for weeks. Usually this host is much faster even with XXL work units. Do you think these units will be ok despite the crash? Thanks.

Benutzeravatar
Michael H.W. Weber
Vereinsvorstand
Vereinsvorstand
Beiträge: 22431
Registriert: 07.01.2002 01:00
Wohnort: Marpurk
Kontaktdaten:

Re: Long running work unit

#662 Ungelesener Beitrag von Michael H.W. Weber » 26.10.2013 12:28

The crash just caused a regular restart.
By the way: We have a VM-based checkpointing system now.

Michael.
Fördern, kooperieren und konstruieren statt fordern, konkurrieren und konsumieren.

http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B

Bild Bild Bild

trindol

Re: Long running work unit

#663 Ungelesener Beitrag von trindol » 27.10.2013 22:29

Those two work units need an extension please.

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Long running work unit

#664 Ungelesener Beitrag von yoyo » 28.10.2013 06:09

extended.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

rebel9
Taschenrechner
Taschenrechner
Beiträge: 8
Registriert: 13.11.2013 20:21

Re: Long running work unit

#665 Ungelesener Beitrag von rebel9 » 13.11.2013 20:31

I also have this one:

http://www.rnaworld.de/rnaworld/result. ... d=14920814

which is overdue. It's been running for 9.5 days and been running at high priority for some time. It has been pegged at the 98.765 limit for ages and the time left has been at around 7 hrs for as long as I can remember.

What should I do with it?

Thanks.

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Long running work unit

#666 Ungelesener Beitrag von yoyo » 14.11.2013 12:22

This task isn't overdue. According to the server the deadline is 9 Dec 2013, 20:00:49 UTC. So let it run.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

rebel9
Taschenrechner
Taschenrechner
Beiträge: 8
Registriert: 13.11.2013 20:21

Re: Long running work unit

#667 Ungelesener Beitrag von rebel9 » 14.11.2013 19:28

Hi,

All my BOINC clients are showing that the deadline was 9/11/2013. I hadn't noticed that the site showed a different date and I'm assuming that's the correct one as it's the only active one and the only VBox WU. If so, there's a problem with your WUs. I'm happy to let it run for now, though.

Thanks.

ChristianB
Admin
Admin
Beiträge: 1920
Registriert: 23.02.2010 22:12

Re: Long running work unit

#668 Ungelesener Beitrag von ChristianB » 14.11.2013 19:39

rebel9 hat geschrieben:Hi,

All my BOINC clients are showing that the deadline was 9/11/2013. I hadn't noticed that the site showed a different date and I'm assuming that's the correct one as it's the only active one and the only VBox WU. If so, there's a problem with your WUs. I'm happy to let it run for now, though.

Thanks.
There is no problem per se with our WUs. Just the one you have is from a batch where automatic deadline extension and the progress indicator aren't working. The deadline was extended on the server side. The newer tasks don't have this problem so you may cancel this WU and get a new one in the future (that has automatic deadline extension and a working progress bar) if you want.

The deadline on the Client won't change because BOINC doesn't support updating the deadline. Also we are still trying to find bugs in the VM and create new WUs only if needed for the moment. All our tasks are known to exceed the initial deadline but as long as you let it finish you will successfully contribute to RNA World.

rebel9
Taschenrechner
Taschenrechner
Beiträge: 8
Registriert: 13.11.2013 20:21

Re: Long running work unit

#669 Ungelesener Beitrag von rebel9 » 15.11.2013 19:54

OK, that's good to know, thanks.

candido

Re: Long running work unit

#670 Ungelesener Beitrag von candido » 23.11.2013 19:01

I have this task that has been runing for 280 hours and is showing 30% compelete.

wu 6330294
cms_GA-p[b-Lin64f-2]_1_Tolumonas-auensis-DSM-9187_CP001616.cir.EMBL_RF00028_Intron_gpI_1358679723_2015

The deadline is 01/December/2013.
Could you please extende the deadline?
Thanks

Jacob Klein
Brain-Bug
Brain-Bug
Beiträge: 564
Registriert: 26.07.2013 15:41

Re: Long running work unit

#671 Ungelesener Beitrag von Jacob Klein » 05.12.2013 20:39

Could you please carefully extend the deadline for task 14921181 (from 12/12/2013 to maybe 1/1/2014)?
http://www.rnaworld.de/rnaworld/result. ... d=14921181

Status:
It is a VM task, I am 740 hours in, it is running high-priority nearly 24/7, and because it is one of the old v1.03 tasks, I believe automatic deadline extension is not working.
I am carefully babysitting it - monitoring stderr.txt, and when I see it grow to a couple MB, I close BOINC, wait for the processes to stop, delete the content of stderr.txt, and restart BOINC.
I'm hopeful it will complete successfully, eventually; estimated runtime on reference system is 1447 hours.

Thanks,
Jacob

ChristianB
Admin
Admin
Beiträge: 1920
Registriert: 23.02.2010 22:12

Re: Long running work unit

#672 Ungelesener Beitrag von ChristianB » 05.12.2013 22:24

extended

Antworten

Zurück zu „RNA World Discussions (english)“