ECM Wrapper; ecm.exe not in RAM

Alles zum Projekt yoyo@home
Everything about the project yoyo@home
Nachricht
Autor
mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

ECM Wrapper; ecm.exe not in RAM

#1 Ungelesener Beitrag von mister.marmot » 01.11.2018 00:28

Noticing that the ECM_wrapper is idle on many virtual machines, mostly on one host server, but it happens on all three of my servers at some point during the day.
The ecm.exe application is done with a phase, or crashed, and the wrapper doesn't bring ecm.exe in quickly.

Shouldn't the wrapper check every few seconds if ecm.exe is in RAM and promptly restart it when it is out?

Thought it might be connected to not enough RAM in the VM so increased RAM to 5120 MB for 4 WU's; there is available free RAM but the issue continued to happen.

Siever and Cruncher work fine.

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#2 Ungelesener Beitrag von mister.marmot » 03.11.2018 18:45

This computer will be removed from ECM WU's, it runs Siever and Cruncher fine.

Work unit log:

Result ID 54395780
Name ecm_ru_1540832720_10_463.c391_12930_0
Workunit 41389575
Created 29 Oct 2018 17:07:46 UTC
Sent 30 Oct 2018 14:25:31 UTC
Received 2 Nov 2018 19:24:02 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 4 Nov 2018 14:25:31 UTC
CPU time 92744.2344
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x772932FF read attempt to address 0x00000004

Engaging BOINC Windows Runtime Debugger...

No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x770B32FF read attempt to address 0x20762D24

Engaging BOINC Windows Runtime Debugger...

wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x770B32FF read attempt to address 0x00000004

Engaging BOINC Windows Runtime Debugger...

No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#3 Ungelesener Beitrag von mister.marmot » 03.11.2018 18:53

This is a different host server and these are a couple of it's logs (the maximum RAM available for each WU is 1GB on this machine, not the 11GB in the maxmem)


Result ID 54367497
Name ecm_ru_1540688129_10_449.c429_3640_0
Workunit 41369370
Created 28 Oct 2018 1:46:16 UTC
Sent 28 Oct 2018 18:48:09 UTC
Received 2 Nov 2018 11:09:14 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 2 Nov 2018 18:48:09 UTC
CPU time 144077.0938
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>
Validate state Initial
Claimed credit 793.7802217481235
Granted credit 647.485966293932
application version 705.02

====================================================

Result ID 54508846
Name ecm_xy_1541136656_C253_148_97_5310_0
Workunit 41466408
Created 2 Nov 2018 5:37:25 UTC
Sent 2 Nov 2018 18:06:15 UTC
Received 2 Nov 2018 21:48:21 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 7 Nov 2018 18:06:15 UTC
CPU time 10653.125
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>
Validate state Initial
Claimed credit 58.69246596928843
Granted credit 129.566730755246
application version 705.02

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#4 Ungelesener Beitrag von mister.marmot » 22.11.2018 11:17

I've caught this issue on all three servers now. 2 more wrappers sitting idle in RAM with no ecm.exe this morning.

No other work unit from any other project has a wrapper that sits idle while there is no verified PID for the working application.

It seems an easy fix to add a check into the wrapper to verify a handshake to the ecm.exe application or increase the polling speed if verification check already exists.

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8045
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: ECM Wrapper; ecm.exe not in RAM

#5 Ungelesener Beitrag von yoyo » 22.11.2018 19:05

The wrapper doesn't poll if ecm is running or not. No wrapper does it, also not the wrapper for the other projects.
The wrapper just starts ecm and waits until the process has finished, afterwards it checks if it finished by error or with success.

So if the wrapper doesn't continue it means the ecm process hasn't finished.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

Benutzeravatar
gemini8
Vereinsvorstand
Vereinsvorstand
Beiträge: 5898
Registriert: 31.05.2011 10:30
Wohnort: Hannover

Re: ECM Wrapper; ecm.exe not in RAM

#6 Ungelesener Beitrag von gemini8 » 22.11.2018 19:09

From what I read I get that there is no ECM working, just the wrapper waiting.
Gruß, Jens
- - - - - -
Lowend-User und Teilzeit-Cruncher

Bild Bild Bild
Bild

Benutzeravatar
yoyo
Vereinsvorstand
Vereinsvorstand
Beiträge: 8045
Registriert: 17.12.2002 14:09
Wohnort: Berlin
Kontaktdaten:

Re: ECM Wrapper; ecm.exe not in RAM

#7 Ungelesener Beitrag von yoyo » 22.11.2018 19:26

I have read it, but this doesn't fit to the source code.
I rechecked the source.
ECM is started, the pid is stored and every second it is checked if this pid is alife or not:

Code: Alles auswählen

bool TASK::poll(int& status) {
    if (!suspended) wall_cpu_time += POLL_PERIOD;
#ifdef _WIN32
    unsigned long exit_code;
    if (GetExitCodeProcess(pid_handle, &exit_code)) {
        if (exit_code != STILL_ACTIVE) {
            status = exit_code;
            final_cpu_time = cpu_time();
            return true;
        }
    }
#else
    int wpid, stat;
    struct rusage ru;

    wpid = wait4(pid, &status, WNOHANG, &ru);
    if (wpid) {
        final_cpu_time = (float)ru.ru_utime.tv_sec + ((float)ru.ru_utime.tv_usec)/1e+6;
        return true;
    }
#endif
    return false;
}
So, if the wrapper is still running, the process is still available.

I never saw such an issue.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Bild Bild

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#8 Ungelesener Beitrag von mister.marmot » 23.11.2018 18:45

yoyo hat geschrieben: So if the wrapper doesn't continue it means the ecm process hasn't finished.
Just wanted to clarify.

All my VM's are 4 core and what I see is the VM only using 2 cores for an hour+
Open the VM, look at it's process manager and the ecm_xxxx wrapper is in RAM but ecm.exe is not.
I check with Process Hacker and look for hidden processes, but cause there are only 2 cores of 4 being used (by siever or cruncher) it obvious ecm.exe isn't running.
BOINC reports the WU is 98% complete, 99.3% complete, etc.

If I shut down BOINC and restart then the wrapper starts ecm.exe and puts the progress back to 0%.

Maybe ecm.exe has crashed but somehow left a PID in the OS process tables?
How do I know from the logs that ecm.exe crashed because it ran out of RAM (these VM's have 3GB available to run 2x ecm.exe and 2x siever)?

PID cleanup failure... I shut down most all the services to get the OS to run in 64MB RAM but PID cleanup should be a kernel function and untouched by support services.

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#9 Ungelesener Beitrag von mister.marmot » 23.11.2018 19:10

Going through my http://www.rechenkraft.net/yoyo/results.php?userid= and it's really, really hard to tell if I've had any successful ECM results.

How can I easily tell there have been ECM WU finishing?

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#10 Ungelesener Beitrag von mister.marmot » 15.12.2018 02:00

yoyo hat geschrieben:
So, if the wrapper is still running, the process is still available.

I never saw such an issue.
Nevertheless, it is happening within all 26 of my VM's on 4 different servers that attempted to run ECM. They run other WU's successfully including YAFU's ECM phases.

The ECM wrapper will sit for hours without ECM.exe in the process list.

Since the YAFU wrapper has no problems keeping the ECM.exe in RAM, on the same VM's, then check to see the difference in the code between the wrappers.

mister.marmot
PDA-Benutzer
PDA-Benutzer
Beiträge: 34
Registriert: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

#11 Ungelesener Beitrag von mister.marmot » 04.04.2019 00:55

Have an update to this issue.

Virtual Box will allow the rtc (real time clock) of the guest OS to drift.
On my VM's the time could fall behind up to 8 minutes.

BOINC.exe will remove a work unit from RAM if it sees the rtc set to a past time till the clock catches up to where it was.
BOINC puts either of the errors in the log:
no heartbeat detected
exited with zero status
See: https://boinc.berkeley.edu/dev/forum_th ... 7348#47348

There is a way to force Oracle Virtual Box to update rtc more often:
timesync-interval=1000, default is 10000
timesync-min-adjust=100, same as the default
timesync-set-on-restore=1, same as the default
timesync-set-threshold=1000, default is 1200000

See discussion: https://forums.virtualbox.org/viewtopic ... 80#p361824

But I didn't try it yet.
Solved my problem by moving all ECM work to host machine.

Maybe in October 2019 I will try ECM in VM again with new settings.

The question is why does BOINC.exe remove ecm.exe from RAM when detecting "no heartbeat" error instead of removing ecm_wrapper and all sub processes?

Benutzeravatar
Michael H.W. Weber
Vereinsvorstand
Vereinsvorstand
Beiträge: 22417
Registriert: 07.01.2002 01:00
Wohnort: Marpurk
Kontaktdaten:

Re: ECM Wrapper; ecm.exe not in RAM

#12 Ungelesener Beitrag von Michael H.W. Weber » 04.04.2019 07:24

Just a short note: None of the Yoyo@home subprojects uses Virtualbox.

Michael.
Fördern, kooperieren und konstruieren statt fordern, konkurrieren und konsumieren.

http://signature.statseb.fr I: Kaputte Seite A
http://signature.statseb.fr II: Kaputte Seite B

Bild Bild Bild

Antworten

Zurück zu „Number crunching“