[FLASH-USERS] Details of speedup after restart

sanjib gupta guptasanjib at lanl.gov
Tue Jun 5 18:57:43 CDT 2007


Hi,

I am attaching 2 log files - the initial run on 128 processors, then 
immediately killing the job and restarting from the first checkpoint 
file "hc-rt-hdf5_chk_0000"
notice about 4 timesteps per second initially, then ~30 timesteps/sec 
after restart.

On 64 processors I noticed the gain was higher , but my resolution was 
lower (half the number of nblocky, same nblockx, this is a 2D run)- 
sorry did not keep the logfiles.

However this "gain" cannot be predicted......sometimes I don't get it on 
the first restart, so I restart a couple of times!
As you'all can guess, this plays havoc with any benchmarking efforts 
.......and we do intend to showcase our results from FLASH soon ...   :-)

We compile with intel fortran 9.1.033 and openmpi 1.1 on a linux cluster 
....and hdf5 version 1.6.5 ......Makefile.h is attached.
Architecture - 64 bit AMD Opteron
running FC3 linux  + BProcV4 (cluster OS) with kernel 
Thanks much for your help/insight/suggestions,
Sanjib.
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: hc-rt-firstrun_06_05_07.log
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0002.pl 
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: hc-rt-restart_06_05_07.log
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0003.pl 
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: Makefile.h
Url: http://flash.uchicago.edu/pipermail/flash-users/attachments/20070605/172918eb/attachment-0001.h 


More information about the flash-users mailing list