Dear Sirs,
We have analysis of blast ALE methodology. We have a problem MPP version of LS-Dyna-Linux based cluster. When we restart the dump file, we have a error as you follows;
"memory=40000m memory2=1000m r=d3dump01 X=6000
**************** SOLVER FINISHED ***************** Solver finished with return code: 0
slurmstepd: error: Detected 1 oom-kill event(s) in step 1821.batch cgroup. Some of your processes may have been killed by the cgroup out-of-memory handler."
What do you think about this? Do you have any solve about this problem.
Thank You..
Best Regards...
Oof, I'm not sure. I would suggest to try and initialize it locally on your computer as a test (if possible). Do you have support for the Linux based cluster or is it an in-house thing? I would contact support if you do and if you can initiate the simulation locally.
If you get an error when trying to initialize the simulation locally, then it should provide more detail into what the issue with the model is and I will be able to help you from there.
Typically I have seen this error for clusters when you don't specify enough RAM, I know for my MPP simulations I set Memory = 500M and Memory2 = 500M and changed the requested RAM usage until the program ran. For my setup I needed a minimum of 125GB of RAM.