Jacob Klein hat geschrieben: ↑09.12.2020 17:29
I'm sorry, but it sounds like it's not recoverable. What I typically do is .... every week, close BOINC, make sure that all VirtualBox processes are also closed (either gracefully on their own, or by force), then open VirtualBox to make a Clone of the VM, to save as a backup. Then close VirtualBox, and wait for its processes to close. Then start BOINC again. I also have my BOINC setup to not auto-start with Windows, since it will auto-report failures, even though I might have a chance to restore things offline, and test offline, to correct things offline, before allowing it to start online.
I know this is information that is ... too late, for your scenario. Again, sorry.
But now might be a good time to familiarize yourself with VirtualBox, and cloning, etc. For the next time that you get a monster!
Thankyou for the information.
I have been creating VM's with Virtual Box for years. My 3 servers are running 8 to 12, 4-core, minimalist OS (Windows 7 or Anti-X), BOINC installs that I update from an original then propagate the new version to the servers. Customers have hired me to transfer some of their older machines, with games or research, to VM's so they can run it from their new workstations.
I did a superb job of keeping this laptop running uninterrupted long stretches; it had only restarted 5 times during the life of this 376day WU and was only hibernated for a few hours in power failures during thunderstorms.
It's a 2008 model and was eventually going to have a failure.
For a BOINC project, it is beyond the scope of my responsibilities to clone or backup a work unit (VM). It is the responsibility of the project developers and managers to assure a WU is either a small slice of work so that the loss of a single WU is insignificant to the user/project, that the WU is failed if it appears the client computer is incapable of completing in a reasonable time (deadlines), or that the WU is properly hardened against loss if it typically takes longer than 24-72 hours to complete.
Honestly, life is way too short, with way too many other joys and responsibilities, to add additional burden to users of protecting WU's.
I do not think it possible to get another of these WU's. If one did arrive, the main computing equipment would not be able to complete it as they have no battery backups and a random power glitch (3 last year: 2 from high winds, 1 from snow falling off the lines) could kill this WU ( I always shut them down if storms are coming) also because of climate change, the machines are off in the summer months. It's not ethical to use coal fired plant electricity (35% of our area still is coal) to cool 64 cores in the summer but they are excellent home heating the rest of the year.
Will these last 30 WU's, that are taking 400+ days, still be relevant to the science?
Is a competing research team going to beat the project to paper publication?
Really would like to have completed this WU as a badge of honor and for Free-DC and WUProps badges.