- Posts: 8
- Thank you received: 0
If you have problems during the execution of MRCC, please attach the output with an adequate description of your case as well as the followings:
This information really helps us during troubleshooting
- the way mrcc was invoked
- the way build.mrcc was invoked
- the output of build.mrcc
- compiler version (for example: ifort -V, gfortran -v)
- blas/lapack versions
- as well as gcc and glibc versions
This information really helps us during troubleshooting
Error at termination of mrcc
- rkanters
- Topic Author
- Offline
- New Member
Less
More
10 years 3 months ago - 10 years 3 months ago #23
by rkanters
Error at termination of mrcc was created by rkanters
I am still playing a bit with MRCC and ran into a case where I get:
Fatal error in mrcc.
Program will stop.
************************ 2014-08-21 09:56:53 *************************
Error at the termination of mrcc.
**********************************************************************
I have attached the output file and MINP.
What did I do wrong here?
Thanks,
René
PS I compiled with Intel -pOMP using intel 11 compilers on a RH 6.5 system.
PPS I just thought of adding the stderr output from SGE to this email, which may be helpful?:
forrtl: Invalid argument
forrtl: severe (39): error during read, unit 21, file /tmp/574.1.anynode.q/fort.21
Image PC Routine Line Source
mrcc 00000000005BD42D Unknown Unknown Unknown
mrcc 00000000005BBF35 Unknown Unknown Unknown
mrcc 0000000000569920 Unknown Unknown Unknown
mrcc 000000000050D41A Unknown Unknown Unknown
mrcc 000000000050CC10 Unknown Unknown Unknown
mrcc 00000000005287B7 Unknown Unknown Unknown
mrcc 0000000000526A08 Unknown Unknown Unknown
mrcc 00000000004F1FA3 Unknown Unknown Unknown
mrcc 00000000004D5F89 Unknown Unknown Unknown
mrcc 00000000004CFA58 Unknown Unknown Unknown
mrcc 00000000004C7A80 Unknown Unknown Unknown
mrcc 0000000000435B84 Unknown Unknown Unknown
mrcc 0000000000409CAF Unknown Unknown Unknown
mrcc 000000000040414C Unknown Unknown Unknown
libc.so.6 0000003C84E1ED1D Unknown Unknown Unknown
mrcc 0000000000404049 Unknown Unknown Unknown
Fatal error in mrcc.
Program will stop.
************************ 2014-08-21 09:56:53 *************************
Error at the termination of mrcc.
**********************************************************************
I have attached the output file and MINP.
What did I do wrong here?
Thanks,
René
PS I compiled with Intel -pOMP using intel 11 compilers on a RH 6.5 system.
PPS I just thought of adding the stderr output from SGE to this email, which may be helpful?:
forrtl: Invalid argument
forrtl: severe (39): error during read, unit 21, file /tmp/574.1.anynode.q/fort.21
Image PC Routine Line Source
mrcc 00000000005BD42D Unknown Unknown Unknown
mrcc 00000000005BBF35 Unknown Unknown Unknown
mrcc 0000000000569920 Unknown Unknown Unknown
mrcc 000000000050D41A Unknown Unknown Unknown
mrcc 000000000050CC10 Unknown Unknown Unknown
mrcc 00000000005287B7 Unknown Unknown Unknown
mrcc 0000000000526A08 Unknown Unknown Unknown
mrcc 00000000004F1FA3 Unknown Unknown Unknown
mrcc 00000000004D5F89 Unknown Unknown Unknown
mrcc 00000000004CFA58 Unknown Unknown Unknown
mrcc 00000000004C7A80 Unknown Unknown Unknown
mrcc 0000000000435B84 Unknown Unknown Unknown
mrcc 0000000000409CAF Unknown Unknown Unknown
mrcc 000000000040414C Unknown Unknown Unknown
libc.so.6 0000003C84E1ED1D Unknown Unknown Unknown
mrcc 0000000000404049 Unknown Unknown Unknown
Attachments:
Last edit: 10 years 3 months ago by rkanters. Reason: realized that some extra information could be helpful. I put it in the PPS part
Please Log in or Create an account to join the conversation.
- kallay
- Offline
- Administrator
- Mihaly Kallay
10 years 3 months ago #24
by kallay
Best regards,
Mihaly Kallay
Replied by kallay on topic Error at termination of mrcc
Dear René,
I couldn't reproduce the error, however, I am using 4 cores (for the moment I have no access to a machine with 12 cores). Could you please try to run the job on 4 cores to see if the problem is related to the number of cores or to the compiler version? Thank you in advance.
I couldn't reproduce the error, however, I am using 4 cores (for the moment I have no access to a machine with 12 cores). Could you please try to run the job on 4 cores to see if the problem is related to the number of cores or to the compiler version? Thank you in advance.
Best regards,
Mihaly Kallay
Please Log in or Create an account to join the conversation.
- rkanters
- Topic Author
- Offline
- New Member
Less
More
- Posts: 8
- Thank you received: 0
10 years 3 months ago #25
by rkanters
Replied by rkanters on topic Error at termination of mrcc
It failed at the exact same spot. Then I tried without ovirt=MP2 and it still fails (the CC iterations take quite a bit longer) but in that case it gets to Spin case 22 Alpha:1 Beta:2 with 632700 excitations before it fails.
I just did run a regular CCSD calculation without a problem.
René
I just did run a regular CCSD calculation without a problem.
René
Please Log in or Create an account to join the conversation.
- kallay
- Offline
- Administrator
- Mihaly Kallay
10 years 3 months ago #26
by kallay
Best regards,
Mihaly Kallay
Replied by kallay on topic Error at termination of mrcc
Then it is probably a compiler issue. Again, I would suggest upgrading to the last Intel version (14.0.2). We invested much work in adapting our codes to the compiler bugs of that version, and the program is rather stable with it.
You can also try the following things:
1) Increase the memory available to the program.
2) Compile the code without OpenMP parallelization. Then, of course, the program will run on one core.
3) Since you are interested in a full CCSDTQ calculation, you don't need to run the CCSD(T) calculation (where the program crashes). That is, delete rest=2 from the input, then only the CCSDTQ calculation will be performed. Another option is to run a CCSDT calculation first, and then to perform the CCSDTQ calculation with rest=1 in the input.
You can also try the following things:
1) Increase the memory available to the program.
2) Compile the code without OpenMP parallelization. Then, of course, the program will run on one core.
3) Since you are interested in a full CCSDTQ calculation, you don't need to run the CCSD(T) calculation (where the program crashes). That is, delete rest=2 from the input, then only the CCSDTQ calculation will be performed. Another option is to run a CCSDT calculation first, and then to perform the CCSDTQ calculation with rest=1 in the input.
Best regards,
Mihaly Kallay
Please Log in or Create an account to join the conversation.
- rkanters
- Topic Author
- Offline
- New Member
Less
More
- Posts: 8
- Thank you received: 0
10 years 3 months ago #27
by rkanters
Replied by rkanters on topic Error at termination of mrcc
Dear Mihaly,
I really would like to use threading, so I don't like option 2. I think option 1 is out because of cost, but I was wondering whether you think compiling with gnu compilers could work out. Or would it be possible for you to distribute an OMP binary version with statically linked libraries?
Regarding 3) I started to go down that way starting with a CCSD calculation followed by CCSDT and rest=1. Things seem to be going along (still at the CC iterations stage), but I do get the message from xmrcc (for Si4Li isomers that have are C1, I didn't get it for another one that has Cs symmetry):
The number of active orbitals has been changed!
It is dangerous to restart the program!
Just before mrcc kicks in and goes ahead.
By the way, not sure whether that has anything to do with it, but I compiled the mrcc.2014-07-10 version.
Cheers,
René
PS is the output from the program buffered? It seems that it often takes a while before the output file is update (usually with the output for several CC iterations at the same time).
I really would like to use threading, so I don't like option 2. I think option 1 is out because of cost, but I was wondering whether you think compiling with gnu compilers could work out. Or would it be possible for you to distribute an OMP binary version with statically linked libraries?
Regarding 3) I started to go down that way starting with a CCSD calculation followed by CCSDT and rest=1. Things seem to be going along (still at the CC iterations stage), but I do get the message from xmrcc (for Si4Li isomers that have are C1, I didn't get it for another one that has Cs symmetry):
The number of active orbitals has been changed!
It is dangerous to restart the program!
Just before mrcc kicks in and goes ahead.
By the way, not sure whether that has anything to do with it, but I compiled the mrcc.2014-07-10 version.
Cheers,
René
PS is the output from the program buffered? It seems that it often takes a while before the output file is update (usually with the output for several CC iterations at the same time).
Please Log in or Create an account to join the conversation.
- kallay
- Offline
- Administrator
- Mihaly Kallay
10 years 3 months ago #28
by kallay
Best regards,
Mihaly Kallay
Replied by kallay on topic Error at termination of mrcc
Dear René,
We have created statically linked binaries, please find them in the download area.
I wouldn't recommend the GNU compiler, the GNU-compiled executables are rather slow.
The output is not buffered, the I/O buffers are flushed in each iteration.
We have created statically linked binaries, please find them in the download area.
I wouldn't recommend the GNU compiler, the GNU-compiled executables are rather slow.
The output is not buffered, the I/O buffers are flushed in each iteration.
Best regards,
Mihaly Kallay
The following user(s) said Thank You: rkanters
Please Log in or Create an account to join the conversation.
Time to create page: 0.044 seconds