[Zinc-fans] total number of compounds in vendors subset -reg

John J. Irwin jji at cgl.ucsf.edu
Wed May 28 07:30:06 PDT 2008


Hi Rafi

Thanks for your email and your interest in ZINC. Sorry to take so long 
to get back to you.

I have recently exported a fresh copy of Sigma Aldrich in ZINC 8 
(http://zinc8.docking.org). There are 17,931 molecules in the source 
catalogs, and 15186 in ZINC. We downloaded every SDF file we could find 
on the Sigma Aldrich website. I've ordered the CD, and will include any 
additional molecules that may be there.

Previously we have included the "rare" library from Sigma Aldrich, based 
on files we received perhaps 5 years ago. There were nearly 200K of 
these. Since these are no longer available on the Sigma Aldrich website, 
they have been removed from ZINC. I think this change may account for 
some of the discrepancies you saw.

Good luck

John
UCSF ZINC Team




rafi A wrote:
> Hello,
>  
>
> Where can we find the total number of compounds in a subset?
>
>  
>
> For example I want to download the vendors/sigma Aldrich subset.
>
>  
>
> In the table column, catalog information: Source entries; shows 295,562.
>
> Another column, ZINC information: Loaded; shows 115,595. So I expected 
> the total number of molecules to be either 295,000 or 115,000.
>
>  
>
> But when I downloaded the mid pH,( SMILES or mol2) it shows only 
> 14,449 molecules.
>
>  
>
> Did I misunderstood something. Or can you tell me where I can find the 
> total number of molecules in a subset before downloading.
>
>  
>
> Thanks in advance.
>
>  
>
> Best regards,
>
> Rafi
>


More information about the Zinc-fans mailing list