Check pointing?

Everything about the project RNA World
Nachricht
Autor
Benutzeravatar
zombie67
Prozessor-Polier
Prozessor-Polier
Beiträge: 119
Registriert: 26.07.2007 21:58
Wohnort: Reno, NV

Check pointing?

#1 Ungelesener Beitrag von zombie67 » 19.01.2010 00:59

Do the RNA apps checkpoint? And if so, how frequently?
Team: SETI.USA

Ananas
WU-Schieber
WU-Schieber
Beiträge: 1184
Registriert: 27.04.2008 18:37
Wohnort: Nordlichter Köln

Re: Check pointing?

#2 Ungelesener Beitrag von Ananas » 19.01.2010 01:14

The windows version does not.

If software paket "freeze" (it's not the Phyton "freeze" program I guess) is installed, the Linux version saves a process image for restart on the checkpoints, so that's not really checkpoints but it serves the same purpose.
vi BOINC/checkin_notes
:1,$s/bug/feature/g
:wq!

Erzaehlen sich Biologen eigentlich Klein-RNA-Witze?

Roland Schneider
Vereinsvorstand
Vereinsvorstand
Beiträge: 919
Registriert: 12.11.2003 20:41
Wohnort: Stavenhagen
Kontaktdaten:

Re: Check pointing?

#3 Ungelesener Beitrag von Roland Schneider » 19.01.2010 01:27

No, there is currently no checkpointing for the RNA apps.

The only system where checkpointing can be enabled manually is linux 32 bit, but it not that easy and you have to have superuser access to do it.

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Check pointing?

#4 Ungelesener Beitrag von yoyo » 19.01.2010 08:28

Ananas hat geschrieben:The windows version does not.

If software paket "freeze" (it's not the Phyton "freeze" program I guess) is installed, the Linux version saves a process image for restart on the checkpoints, so that's not really checkpoints but it serves the same purpose.
Freeze is inside the RNA application and is called frequently. If it works depends on the configuration of the Linux kernel. You can see the freeze calls in stderr of the results.
yoyo
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Ananas
WU-Schieber
WU-Schieber
Beiträge: 1184
Registriert: 27.04.2008 18:37
Wohnort: Nordlichter Köln

Re: Check pointing?

#5 Ungelesener Beitrag von Ananas » 19.01.2010 10:13

yoyo hat geschrieben:...
Freeze is inside the RNA application and is called frequently. ...
why does it sometimes say "not found" then?
vi BOINC/checkin_notes
:1,$s/bug/feature/g
:wq!

Erzaehlen sich Biologen eigentlich Klein-RNA-Witze?

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Check pointing?

#6 Ungelesener Beitrag von yoyo » 19.01.2010 13:57

Ananas hat geschrieben:
yoyo hat geschrieben:...
Freeze is inside the RNA application and is called frequently. ...
why does it sometimes say "not found" then?
This should be the case only on linux64 and Windows. For these we do not have a freeze app.
yoyo
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Benutzeravatar
Michael H.W. Weber
Vereinsvorstand
Vereinsvorstand
Beiträge: 22435
Registriert: 07.01.2002 01:00
Wohnort: Marpurk
Kontaktdaten:

Re: Check pointing?

#7 Ungelesener Beitrag von Michael H.W. Weber » 19.01.2010 17:18

In my case, validation errors occur exclusively on my single 32-Bit Linux box and there is a lot of notes from the "freeze" program in my (non-validated) results. So, I conclude this "freeze thing" has to do with the validateion issues. However, not all WUs on this machine are faulty (it is a minority actually). I do not know how the error occasionally occurs. :roll: It would surely be very helpful if we could nail the cause for this. :D

Michael.
Fördern, kooperieren und konstruieren statt fordern, konkurrieren und konsumieren.

http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B

Bild Bild Bild

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Check pointing?

#8 Ungelesener Beitrag von yoyo » 19.01.2010 19:47

I don't think, that freeze is the root cause for not validating. The freeze output only show that freeze works and what it does. But you can probably switch off freeze. Just delete freeze in the slot directory.
yoyo
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Ananas
WU-Schieber
WU-Schieber
Beiträge: 1184
Registriert: 27.04.2008 18:37
Wohnort: Nordlichter Köln

Re: Check pointing?

#9 Ungelesener Beitrag von Ananas » 19.01.2010 20:11

yoyo hat geschrieben:
Ananas hat geschrieben:...
why does it sometimes say "not found" then?
This should be the case only on linux64 and Windows. For these we do not have a freeze app.
yoyo
http://www.rnaworld.de/rnaworld/result. ... tid=120325
Wrapper: writing checkpoint image
wrapper: starting ./freeze chkpoint 4367
sh: ./freeze: not found
Linux 2.6.24-19-generic (nothing about x64, it is Saenger's box)


A possible reason : It doesn't sit in the right directory when it calls freeze, i.e. cwd isn't where freeze is extracted.
vi BOINC/checkin_notes
:1,$s/bug/feature/g
:wq!

Erzaehlen sich Biologen eigentlich Klein-RNA-Witze?

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8048
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: Check pointing?

#10 Ungelesener Beitrag von yoyo » 19.01.2010 21:19

As I know saengers PC is a 64bit Linux, but named themself linux-generix. So it downloaded the 64bit app.
yoyo
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Benutzeravatar
Uwe Sänger Herzke
Block-Bunkerer
Block-Bunkerer
Beiträge: 1326
Registriert: 31.05.2006 14:33
Wohnort: Bremen
Kontaktdaten:

Re: Check pointing?

#11 Ungelesener Beitrag von Uwe Sänger Herzke » 19.01.2010 21:28

The first lines in BOINC after start-up on my machine are this:

Code: Alles auswählen

Sa 16 Jan 2010 13:51:49 CET		Starting BOINC client version 6.10.17 for x86_64-pc-linux-gnu
Sa 16 Jan 2010 13:51:49 CET		Config: use at most 4 CPUs
Sa 16 Jan 2010 13:51:49 CET		log flags: file_xfer, sched_ops, task
Sa 16 Jan 2010 13:51:49 CET		Libraries: libcurl/7.18.0 OpenSSL/0.9.8g zlib/1.2.3.3 c-ares/1.5.1
Sa 16 Jan 2010 13:51:49 CET		Data directory: /home/saenger/BOINC/BOINC
Sa 16 Jan 2010 13:51:49 CET		Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad  CPU   Q9450  @ 2.66GHz [Family 6 Model 23 Stepping 7]
Sa 16 Jan 2010 13:51:49 CET		Processor: 6.00 MB cache
Sa 16 Jan 2010 13:51:49 CET		Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr sse4_1
Sa 16 Jan 2010 13:51:49 CET		OS: Linux: 2.6.24-19-generic
Sa 16 Jan 2010 13:51:49 CET		Memory: 3.87 GB physical, 5.68 GB virtual
Sa 16 Jan 2010 13:51:49 CET		Disk: 132.04 GB total, 67.03 GB free
Sa 16 Jan 2010 13:51:49 CET		Local time is UTC +1 hours
Sa 16 Jan 2010 13:51:49 CET		No usable GPUs found
Sa 16 Jan 2010 13:51:50 CET		Not using a proxy
I don't know where to change the naming of the OS, but it's 64bit.
Edith says: 2.66GHz is wrong as well, it's 3.2GHz
Grüße vom Sänger
Bild Bild Bild

vaughan

Re: Check pointing?

#12 Ungelesener Beitrag von vaughan » 21.01.2010 23:15

Check pointing would be nice. :)

I have a couple of tasks running that are going to run over the deadline.
The task names are incredibly long and I don't know how to cut & paste but the first is:
CMS_MW1_Anaeromxyobacter-sp.-Fw109-5 etc etc
The second is:
CMS_MW1_Clostridium-E88 etc etc
They are at approx. 4% done after 40 hours with estimated 78 hours remaining.
This is on an Intel C2D E8600 cpu @ stock 3.33GHz with 8GB RAM, Windows 7 64-bit Ultimate.

Perhaps its time to abort them.

Antworten

Zurück zu „RNA World Discussions (english)“