Odd, I've been given a load of points for the 30% completed task which I aborted when it went back to 0% after a reboot. Does this mean they got useful data or just felt sorry for me?Michael H.W. Weber hat geschrieben:No, the server does in this respect actually not know anything about what the clients are doing. It only receives a trickle-up message how far the individual client is progressing to auto-adjust the tasks deadline on the server-side (run time prediction is actually very difficult for these types of calculations).Peter Hucker hat geschrieben:Also, when I said "why can't the checkpoints be used", what I actually meant was doesn't the server have a copy of where I got up to, so the unit can be handed to the next user half done?
.
Long running work unit
-
- Mikrocruncher
- Beiträge: 30
- Registriert: 19.08.2017 13:56
Re: Long running work unit
-
- Brain-Bug
- Beiträge: 564
- Registriert: 26.07.2013 15:41
Re: Long running work unit
Speed is an 8-core, 16-thread, PC.
I have determined that these RNA World VM Tasks do not hyperthread well with each other at all (verified by looking at "retired instructions" from Intel Performance Counter). So I will only allow 8 to run at the same time, while the remaining threads work on non-VM workloads.
Speed has 6 RNA World tasks currently in progress (5 in BOINC, and 1 that I'm doing locally).
He was already searching for 1 more -- Now he is searching for 2!
I have determined that these RNA World VM Tasks do not hyperthread well with each other at all (verified by looking at "retired instructions" from Intel Performance Counter). So I will only allow 8 to run at the same time, while the remaining threads work on non-VM workloads.
Speed has 6 RNA World tasks currently in progress (5 in BOINC, and 1 that I'm doing locally).
He was already searching for 1 more -- Now he is searching for 2!
-
- Mikrocruncher
- Beiträge: 30
- Registriert: 19.08.2017 13:56
Re: Long running work unit
How are you getting so many?Jacob Klein hat geschrieben:Speed is an 8-core, 16-thread, PC.
I have determined that these RNA World VM Tasks do not hyperthread well with each other at all (verified by looking at "retired instructions" from Intel Performance Counter). So I will only allow 8 to run at the same time, while the remaining threads work on non-VM workloads.
Speed has 6 RNA World tasks currently in progress (5 in BOINC, and 1 that I'm doing locally).
He was already searching for 1 more -- Now he is searching for 2!
-
- Brain-Bug
- Beiträge: 564
- Registriert: 26.07.2013 15:41
Re: Long running work unit
For starters, I have set all my other CPU-capable projects to be "backup projects", using 0 Resource Share. Then I set my cache/buffer settings high enough such that I'm always needing work. Then BOINC asks RNA World for work, as often as it automatically can, according to resource backoffs and project backoffs. I also have additional tactics. I can answer more questions privately, if you'd like.Peter Hucker hat geschrieben:How are you getting so many?
-
- Mikrocruncher
- Beiträge: 30
- Registriert: 19.08.2017 13:56
Re: Long running work unit
I see. I've got all my projects with different weights. I guess my machines can't get RNA then settle for something else. Not to worry, I have other projects I like doing too, I was just wondering. One of the four machines is asking every 5 hours, the others aren't asking at all just now.
-
- Brain-Bug
- Beiträge: 564
- Registriert: 26.07.2013 15:41
Re: Long running work unit
I'm a BOINC Alpha tester, attached to all 61 projects. I routinely do work for about 10 of them, and RNA World is one of my top 3 favorite projects. But yeah, I make sure to keep the threads fully loaded, unless I have a PC in "Get 'er done" mode due to upcoming VirtualBox incompatibilities. 2 of my 4 PCs are like that, currently. You might like following the other thread I chime in on, here: viewtopic.php?f=75&t=16160
-
- Admin
- Beiträge: 1920
- Registriert: 23.02.2010 22:12
Re: Long running work unit
No we don't get the science back but we are rewarding the effort put into trying to tackle a monster by granting partial credit for failed tasks.Peter Hucker hat geschrieben:Odd, I've been given a load of points for the 30% completed task which I aborted when it went back to 0% after a reboot. Does this mean they got useful data or just felt sorry for me?Michael H.W. Weber hat geschrieben:No, the server does in this respect actually not know anything about what the clients are doing. It only receives a trickle-up message how far the individual client is progressing to auto-adjust the tasks deadline on the server-side (run time prediction is actually very difficult for these types of calculations).Peter Hucker hat geschrieben:Also, when I said "why can't the checkpoints be used", what I actually meant was doesn't the server have a copy of where I got up to, so the unit can be handed to the next user half done?
.
-
- Mikrocruncher
- Beiträge: 30
- Registriert: 19.08.2017 13:56
Re: Long running work unit
It's a pity you get nothing out of it. Is this a feature missing from BOINC? The inability to upload partial workunits? It must happen often that someone fails to complete a workunit as they go on holiday etc. Or of course in my case a computer error.
- Michael H.W. Weber
- Vereinsvorstand
- Beiträge: 22419
- Registriert: 07.01.2002 01:00
- Wohnort: Marpurk
- Kontaktdaten:
Re: Long running work unit
Well, to solve this, there is actually the concept of checkpoints.Peter Hucker hat geschrieben:It's a pity you get nothing out of it. Is this a feature missing from BOINC? The inability to upload partial workunits?
Michael.
Fördern, kooperieren und konstruieren statt fordern, konkurrieren und konsumieren.
http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B
http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B
-
- Mikrocruncher
- Beiträge: 30
- Registriert: 19.08.2017 13:56
Re: Long running work unit
That's what I thought, so why weren't they used?Michael H.W. Weber hat geschrieben:Well, to solve this, there is actually the concept of checkpoints.Peter Hucker hat geschrieben:It's a pity you get nothing out of it. Is this a feature missing from BOINC? The inability to upload partial workunits?
Michael.
- Michael H.W. Weber
- Vereinsvorstand
- Beiträge: 22419
- Registriert: 07.01.2002 01:00
- Wohnort: Marpurk
- Kontaktdaten:
Re: Long running work unit
I think we discussed your specific problem in detail above.
Michael.
Michael.
Fördern, kooperieren und konstruieren statt fordern, konkurrieren und konsumieren.
http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B
http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B
-
- Admin
- Beiträge: 1920
- Registriert: 23.02.2010 22:12
Re: Long running work unit
BOINC actually does support that but the RNA science app does not. It doesn't do checkpoints so we have to use VirtualBox in order to get checkpointing but we can't upload the VBox checkpoint as it only works on the host it was created on. It would also be too large.Peter Hucker hat geschrieben:It's a pity you get nothing out of it. Is this a feature missing from BOINC? The inability to upload partial workunits? It must happen often that someone fails to complete a workunit as they go on holiday etc. Or of course in my case a computer error.