Skip to Content.
Sympa Menu

forum - Re: [abinit-forum] Errors using band parallelisation

forum@abinit.org

Subject: The ABINIT Users Mailing List ( CLOSED )

List archive

Re: [abinit-forum] Errors using band parallelisation


Chronological Thread 
  • From: gilles zerah <gilles.zerah@wanadoo.fr>
  • To: forum@abinit.org
  • Subject: Re: [abinit-forum] Errors using band parallelisation
  • Date: Thu, 17 May 2007 16:42:52 +0200

Dear Andreas,
I looked into your case.
It yields ngfft 40 40 320 and 63 bands. You might try another band/fft parallelization.
This approach relies on MPI_ALLTOALL, so you need a good network
and a good implementation of MPI.

To try this different approach of the band parallelisation:

-compile a recent version (e.g. 5.3.4) with MPI_FFT flag instead MPI

-perform a permutation on your data (acell, kptrlatt, xred etc..) to bring z in place of x

-use the flags indicated under the keyword npband.
You might want to try different combinations of npband and npfft.

Tell us what happens,

Have fun,

Gilles
Dear abiniters,

I hope you can help me with some issues with the band-parallelisation using the variables "wfoptalg" and "nbdblock".
Calculations without this band-parallelisation run normally with nkpt = 6, but take really long. To speed my calculations up, I chose to use the band parallelisation option of abinit,
wfoptalg 1
nbdblock 2,
which makes it, in my case, possible to use 12 processors.

Sadly this parallelisations doesn´t work properly.
The error-message I get is
"forrtl: severe (174): SIGSEGV, segmentation fault occurred"


So I tested the differences in the three following cases:

So, without parallelisation
# wfoptalg 1
# nbdblock 2
-> calculation is running

With parallelisation-variables, but calculating one band per block (equals no parallelisation)
wfoptalg 1
nbdblock 1
-> calculation is running

With parallelisation, two bands per block
wfoptalg 1
nbdblock 2
-> error-message "forrtl: severe (174): SIGSEGV, segmentation fault occurred"
(This message appears for every processor)


I hope you can help me to get this working,

best regards
Andreas Boehner


PS:
INPUT-FILE:
chkprim 0
tsmear 0.01
occopt 7
fband 0.5
prtden 1
#irdwfk 1

*wfoptalg 1
nbdblock 2*

# Positions-Optimierung
# ionmov 3
# ecutsm 0.5
# ntime 10


# EZ
acell 7.48 7.48 58.588468

rprim
1 0 0
0 1 0
0 0 1

# Atomsorte
ntypat 1
znucl 24

# Aufbau des Gitters
natom 18
typat 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 xred
0 0 0 # Ebene 1 Flaeche
0.5 0.5 0
0.5 0 0.0451627 # Ebene 2 Innenzentriete Atome
0 0.5 0.0451627
0 0 0.0903255 # Ebene 3
0.5 0.5 0.0903255
0.5 0 0.1354882 # Ebene 4
0 0.5 0.1354882
0 0 0.180651 # Ebene 5
0.5 0.5 0.180651
0.5 0 0.2258138 # Ebene 6 0 0.5 0.2258138
0 0 0.2709765 # Ebene 7
0.5 0.5 0.2709765
0.5 0 0.3161393 # Ebene 8
0 0.5 0.3161393
0 0 0.3613021 # Ebene 9 Cr-bcc Ende
0.5 0.5 0.3613021

# Konvergenzparamter
ecut 35
kptopt 1
#prtkpt 1
kptrlatt # 6 kpt kptrlen = 44.88xxx
6 0 0 0 6 0 0 0 1

nshiftk 1
shiftk 0.5 0.5 0.5

# SCF
nstep 30
toldfe 1.0d-6














Archive powered by MHonArc 2.6.16.

Top of Page