Skip to Content.
Sympa Menu

forum - Re: [abinit-forum] problems with abinip parallel run

forum@abinit.org

Subject: The ABINIT Users Mailing List ( CLOSED )

List archive

Re: [abinit-forum] problems with abinip parallel run


Chronological Thread 
  • From: "Anglade Pierre-Matthieu" <anglade@gmail.com>
  • To: forum@abinit.org
  • Subject: Re: [abinit-forum] problems with abinip parallel run
  • Date: Fri, 29 Aug 2008 23:49:41 +0200
  • Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:content-transfer-encoding:content-disposition :references; b=F1/yngAEvH4Wty1+hC03M8jdpEpDfZcuK0PVLi24MwNeOdSvU3I+y5nuxijuBSGxVQ 9b6RB3l665A887DQqUUZvHOXf06YzHUcWr4U49ZlHvvNWNSzmFTf8ySzBxJrqFNKID0P ssTuZVa6gjVn2D9lZf/m7k6SjfnPKUTgKfXDc=

Hi,

Unfortunately I don't have better ideas than Matthieu. Just one thing
makes me wonder : Abinit says it was prepared for linux and your log
file looks like having been digested by a wormdows. Looks strange to
me. Could it be related to your problem ?

regards

PMA

On Fri, Aug 29, 2008 at 10:44 PM, Alessandro Fortunelli
<fortunelli@ipcf.cnr.it> wrote:
> Dear Matthieu,
>
> I tried all of your suggestions, but I did not find a way to solve the
> problem: I confirmed that my jobs do not run in parallel, but I do not
> understand why. It's possibly something connected with mpich, so I was
> looking for somebody that had already experienced my same problems.
>
> Thanks anyway for your suggestions,
>
> Alessandro
>
>
> In data 29/8/2008, "matthieu verstraete"
> <matthieu.jean.verstraete@gmail.com> ha scritto:
>
>>your number of kpoints is equal to mkmem, so your abinip is certainly not
>>running on 2 processors...
>>
>>try with mynodes=
>>growth1
>>growth1
>>
>>for instance, or even better, using 2 machines so you can be sure both are
>>being used
>>
>>Are you sure of the syntax for mpiexec? I usually use mpirun (which has the
>>syntax you give), and there may be a difference...
>>
>>
>>> >Do you get any message in your log or output file related to this problem
>>> ?
>>>
>>> Well, as you can see from the attached log and output files, there are
>>> indications that the system is using only one cpu (e.g., only node 0 is
>>> mentioned), but I cannot understand wh
>>
>>the main log is only written to by the mother process. Check the other _LOG_
>>files. Again if they are not there you are not running in parallel...
>>
>>
>>>
>>> ------------------------------------------------------------------------------
>>>
>>> I followed the run through top command for the 2' something of the run,
>>> and
>>> I always saw ONLY ONE abinip running. The cpu time was identical with
>>> abinis.
>>
>>In a short run, much of the time may be spent with only one processor
>>running actively. You might try making the run a bit longer (increase ecut
>>or nkpt or something). In top try "u" for user and then your user name, to
>>see all of your processes (including inactive ones). You should have 2 or 4
>>abinip-s (the doubling can happen on some platforms due to the way the mpi
>>works, but the other 2 don't do anything they just sit there -anyone know
>>why this happens?)
>>
>>good luck
>>
>>
>>Matthieu
>>
>>
>
> Alessandro Fortunelli
> IPCF-CNR, via G. Moruzzi, 1
> 56124 - Pisa -Italy
> e-mail: fortunelli@ipcf.cnr.it
> tel. +39-050-3152447
> fax +39-050-3152442
> cel. +39-349-2987108
> Home-page: http://h2.ipcf.cnr.it/alex/af.html
>



--
Pierre-Matthieu Anglade



Archive powered by MHonArc 2.6.16.

Top of Page