ECM Wrapper; ecm.exe not in RAM

Alles zum Projekt yoyo@home
Everything about the project yoyo@home

ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 01.11.2018 00:28

Noticing that the ECM_wrapper is idle on many virtual machines, mostly on one host server, but it happens on all three of my servers at some point during the day.
The ecm.exe application is done with a phase, or crashed, and the wrapper doesn't bring ecm.exe in quickly.

Shouldn't the wrapper check every few seconds if ecm.exe is in RAM and promptly restart it when it is out?

Thought it might be connected to not enough RAM in the VM so increased RAM to 5120 MB for 4 WU's; there is available free RAM but the issue continued to happen.

Siever and Cruncher work fine.
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 03.11.2018 18:45

This computer will be removed from ECM WU's, it runs Siever and Cruncher fine.

Work unit log:

Result ID 54395780
Name ecm_ru_1540832720_10_463.c391_12930_0
Workunit 41389575
Created 29 Oct 2018 17:07:46 UTC
Sent 30 Oct 2018 14:25:31 UTC
Received 2 Nov 2018 19:24:02 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 4 Nov 2018 14:25:31 UTC
CPU time 92744.2344
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x772932FF read attempt to address 0x00000004

Engaging BOINC Windows Runtime Debugger...

No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x770B32FF read attempt to address 0x20762D24

Engaging BOINC Windows Runtime Debugger...

wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x770B32FF read attempt to address 0x00000004

Engaging BOINC Windows Runtime Debugger...

No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 03.11.2018 18:53

This is a different host server and these are a couple of it's logs (the maximum RAM available for each WU is 1GB on this machine, not the 11GB in the maxmem)


Result ID 54367497
Name ecm_ru_1540688129_10_449.c429_3640_0
Workunit 41369370
Created 28 Oct 2018 1:46:16 UTC
Sent 28 Oct 2018 18:48:09 UTC
Received 2 Nov 2018 11:09:14 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 2 Nov 2018 18:48:09 UTC
CPU time 144077.0938
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 260000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>
Validate state Initial
Claimed credit 793.7802217481235
Granted credit 647.485966293932
application version 705.02

====================================================

Result ID 54508846
Name ecm_xy_1541136656_C253_148_97_5310_0
Workunit 41466408
Created 2 Nov 2018 5:37:25 UTC
Sent 2 Nov 2018 18:06:15 UTC
Received 2 Nov 2018 21:48:21 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID xxxxxx
Report deadline 7 Nov 2018 18:06:15 UTC
CPU time 10653.125
stderr out
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
wrapper: starting
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
No heartbeat from core client for 30 sec - exiting
wrapper: starting
wrapper: running ecm (-resume checkpnt -param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
wrapper: running ecm (-param 0 -v -timestamp -chkpnt checkpnt -inp in -maxmem 1800 110000000)
wrapper: delete file checkpoint_file
called boinc_finish

</stderr_txt>
]]>
Validate state Initial
Claimed credit 58.69246596928843
Granted credit 129.566730755246
application version 705.02
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 22.11.2018 11:17

I've caught this issue on all three servers now. 2 more wrappers sitting idle in RAM with no ecm.exe this morning.

No other work unit from any other project has a wrapper that sits idle while there is no verified PID for the working application.

It seems an easy fix to add a check into the wrapper to verify a handshake to the ecm.exe application or increase the polling speed if verification check already exists.
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby yoyo » 22.11.2018 19:05

The wrapper doesn't poll if ecm is running or not. No wrapper does it, also not the wrapper for the other projects.
The wrapper just starts ecm and waits until the process has finished, afterwards it checks if it finished by error or with success.

So if the wrapper doesn't continue it means the ecm process hasn't finished.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Image Image
User avatar
yoyo
Vereinsvorstand
Vereinsvorstand
 
Posts: 7660
Joined: 17.12.2002 14:09
Location: Berlin

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby gemini8 » 22.11.2018 19:09

From what I read I get that there is no ECM working, just the wrapper waiting.
Gruß, Jens
- - - - - -
Lowend-User und Teilzeitcruncher

Image Image
Image
User avatar
gemini8
Vereinsmitglied
Vereinsmitglied
 
Posts: 2655
Joined: 31.05.2011 10:30
Location: Hannover

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby yoyo » 22.11.2018 19:26

I have read it, but this doesn't fit to the source code.
I rechecked the source.
ECM is started, the pid is stored and every second it is checked if this pid is alife or not:

Code: Select all
bool TASK::poll(int& status) {
    if (!suspended) wall_cpu_time += POLL_PERIOD;
#ifdef _WIN32
    unsigned long exit_code;
    if (GetExitCodeProcess(pid_handle, &exit_code)) {
        if (exit_code != STILL_ACTIVE) {
            status = exit_code;
            final_cpu_time = cpu_time();
            return true;
        }
    }
#else
    int wpid, stat;
    struct rusage ru;

    wpid = wait4(pid, &status, WNOHANG, &ru);
    if (wpid) {
        final_cpu_time = (float)ru.ru_utime.tv_sec + ((float)ru.ru_utime.tv_usec)/1e+6;
        return true;
    }
#endif
    return false;
}


So, if the wrapper is still running, the process is still available.

I never saw such an issue.
HILF mit im Rechenkraft-WiKi, dies gibts zu tun.
Wiki - FAQ - Verein - Chat

Image Image
User avatar
yoyo
Vereinsvorstand
Vereinsvorstand
 
Posts: 7660
Joined: 17.12.2002 14:09
Location: Berlin

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 23.11.2018 18:45

yoyo wrote:So if the wrapper doesn't continue it means the ecm process hasn't finished.


Just wanted to clarify.

All my VM's are 4 core and what I see is the VM only using 2 cores for an hour+
Open the VM, look at it's process manager and the ecm_xxxx wrapper is in RAM but ecm.exe is not.
I check with Process Hacker and look for hidden processes, but cause there are only 2 cores of 4 being used (by siever or cruncher) it obvious ecm.exe isn't running.
BOINC reports the WU is 98% complete, 99.3% complete, etc.

If I shut down BOINC and restart then the wrapper starts ecm.exe and puts the progress back to 0%.

Maybe ecm.exe has crashed but somehow left a PID in the OS process tables?
How do I know from the logs that ecm.exe crashed because it ran out of RAM (these VM's have 3GB available to run 2x ecm.exe and 2x siever)?

PID cleanup failure... I shut down most all the services to get the OS to run in 64MB RAM but PID cleanup should be a kernel function and untouched by support services.
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 23.11.2018 19:10

Going through my http://www.rechenkraft.net/yoyo/results.php?userid= and it's really, really hard to tell if I've had any successful ECM results.

How can I easily tell there have been ECM WU finishing?
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53

Re: ECM Wrapper; ecm.exe not in RAM

Unread postby mister.marmot » 15.12.2018 02:00

yoyo wrote:
So, if the wrapper is still running, the process is still available.

I never saw such an issue.


Nevertheless, it is happening within all 26 of my VM's on 4 different servers that attempted to run ECM. They run other WU's successfully including YAFU's ECM phases.

The ECM wrapper will sit for hours without ECM.exe in the process list.

Since the YAFU wrapper has no problems keeping the ECM.exe in RAM, on the same VM's, then check to see the difference in the code between the wrappers.
mister.marmot
Taschenrechner
Taschenrechner
 
Posts: 8
Joined: 15.07.2016 00:53


Return to Number crunching

Who is online

Users browsing this forum: No registered users and 6 guests