[FLASH-USERS] Details of speedup after restart
sanjib gupta
guptasanjib at lanl.gov
Tue Jun 5 18:57:43 CDT 2007
Hi,
I am attaching 2 log files - the initial run on 128 processors, then
immediately killing the job and restarting from the first checkpoint
file "hc-rt-hdf5_chk_0000"
notice about 4 timesteps per second initially, then ~30 timesteps/sec
after restart.
On 64 processors I noticed the gain was higher , but my resolution was
lower (half the number of nblocky, same nblockx, this is a 2D run)-
sorry did not keep the logfiles.
However this "gain" cannot be predicted......sometimes I don't get it on
the first restart, so I restart a couple of times!
As you'all can guess, this plays havoc with any benchmarking efforts
.......and we do intend to showcase our results from FLASH soon ... :-)
We compile with intel fortran 9.1.033 and openmpi 1.1 on a linux cluster
....and hdf5 version 1.6.5 ......Makefile.h is attached.
Architecture - 64 bit AMD Opteron
running FC3 linux + BProcV4 (cluster OS) with kernel
Thanks much for your help/insight/suggestions,
Sanjib.
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: hc-rt-firstrun_06_05_07.log
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0002.pl
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: hc-rt-restart_06_05_07.log
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0003.pl
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: Makefile.h
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0001.h
More information about the flash-users
mailing list