Last edited: 18/11/2011
By: VLG
Sourceforge: SA2 website
Help: SA2 forums
Documentation index
This page describes the detailed procedure used to import a new molecule (and eventually some properties) in the database.
The first important thing to know is that screening assistant represents the structure of your molecule using the MDL MOL-formatted connection table. All further information in each SD entry (basically everything that comes after the M_END flag) will be skipped.
Even more importantly, absolutely no modification will be performed on your original connection table! The connection table will be stored as provided, and SA2 don't even add hydrogens, remove salts or perform any other action on your original entry (at least in the current default version of SA2 - see next section). The standardization of each molecules is therefore up to you!
Why is that so? Well, the standardisation of molecules is usually laboratory-specific, and even sometimes problem-specific (e.g. for descriptors calculation). Thus, we decided that SA2 will basically percieve molecules just as provided by you, with absolutely no further processing.
It is however still possible to include your own standardization procedure directely within SA2, but this is to be documented (see the Pre-processing section), and you will need a JAVA-programmer to do so.
SA2 stores unique molecules. We use the IUPAC InChI Identifier to represent each molecule and detect a duplicate molecule. Thus, tautomers will be detected as duplicated molecule (at least as far as the InChI algorithm can detect them...). However, a charged carboxilic acid will be detected as a different molecule compared to its neutral form.
SA2 also takes into account stereochemistry when computing the InChI.
We told you previously that SA2 do not change your molecules in any partucular way. This is clearly true in the current default version of SA2. However, SA2 makes it possible to define (if you know java programming ;)) so-called transformers. A transformer will generated a new SD file based on the original one, and thus eventually perform some custom, developper-defined standardization on each molecule.
As already said, there is currently no transformer available in the default version of SA2, but it's worse to note their existence if you know JAVA, and as they might come up in future major release of SA2 as optional features.
Lets now enter in more details on the import process. Before processing each molecule successively, we do the following:
Once done, we loop over each SDF entry ($$$$ separated), and for each entry, we do the following:
Here is a graphical outline of the import workflow: