[Zinc-fans] requesting peer suggestions on docking and duplication

John J. Irwin jji at cgl.ucsf.edu
Fri May 16 23:14:07 PDT 2008


Hi Martin

That's interesting. We like to keep database tranches small so we can 
run dock in parallel (course grained parallel, completely separate 
processes) on dozens or even hundreds of cores, and then combine the 
results together at the end. Most computers nowadays come with 4 or even 
8 effective cores inside, so running dock in smaller chunks makes 
efficient use of your available hardware.

John
UCSF ZINC Team

Martin Stoermer wrote:
> Mornign everyone,
>
> another thing to remember is that if you do concatenate the zinc files 
> (as we do) is that some docking packages will run into memory problems 
> after too many ligands. For example we have had problems with GOLD. 
> When I concatenated all 113 of the ZINC Druglike sdf "chunks" you get 
> a 9.6GB SD file. When I ran a GOLD job on our cluster across 8 nodes 
> if dies after ~600K ligands with error messages like:
>
> libpvm [t40002]: pvm_pklong(): Value too large
> Pvm Function pvm_pklong( (long) start_point, 1, 1), called from 
> GOLD_COMM_SendNextDock, caused an error: Value too large
>
> ******************************************************************************
> warning: Slave process 786433 on host node21.***.***.***.*** failed 
> with following error message:
> Gold internal failure (3.2) in docking ligand 0 in file 
> /chem_db/Drug_like/GOLD/Drug_like_3d.sdf
> ******************************************************************************
> WARNING: pvm task problem 786433
>  Fatal error: 
> Received interrupt [signal number 15]
>
> I can't speak for how Autodock, Dock and the others handle these large 
> datasets, I haven't tried to bludgeon them with 2.5 million compounds 
> yet. Maybe others will have that answer. So it may be more sensible to 
> limit yourself to smaller sets e.g. one concatentated vendor database 
> at a time.
>
> cheers,
> Martin
>
>
>>  
>> The question is, if we merge all the files into one single giga SDF 
>> file, are the ZINC IDs unique to all the entries, meaning, can we 
>> safely traceback the ligand of interest to its representative pH subset?
>>  
>>  
>> With best regards,
>> Elite158
>>  
>> _______________________________________________
>> Zinc-fans mailing list
>> Zinc-fans at docking.org <mailto:Zinc-fans at docking.org>
>> http://blur.compbio.ucsf.edu/mailman/listinfo/zinc-fans
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Zinc-fans mailing list
> Zinc-fans at docking.org
> http://blur.compbio.ucsf.edu/mailman/listinfo/zinc-fans
>   


More information about the Zinc-fans mailing list