Package chemaxon.descriptors
Class PFParameters
java.lang.Object
chemaxon.descriptors.MDParameters
chemaxon.descriptors.PFParameters
Manages 2D pharmacophore fingerprint parameters. This class has multiple
roles: 1. reads, stores and provides parameters for
PharmacophoreFingerprint
;
and 2. acts as a shared memory area of all PharmacophoreFingerprint
instances.
This latter serves pure running-time and storage efficiency needs by removing
redundant data from PharmacophoreFingerprint
objects and easing the communication
(that is passing data) between parts of the program. Example for these
include the following data members: compressedData, xyz, asymmetricXYZSum.
Most of these members have public access for the sake of ease. (The
use of private access members and the introduction of corresponding
setter/getter methods would not increase the safety (or any other quality)
quality of the code at all (as they would not add any further functionality,
than the mere change or retrieval of the value, which can now be done
directly).)- Since:
- JChem 2.2
-
Field Summary
Modifier and TypeFieldDescriptionfloat
Stores last calculatred asymmetric Tanimoto sum.protected chemaxon.pharmacophore.ColorDecoder
atom set colorsprotected byte[][]
Buffer for compressed data, used inMolecularDescriptor.toData( final int[] )
.static final float
static final float
static final int
static final int
default valuesstatic final String
protected ArrayList
fuzziness factors of parametrized metricsprotected float[][]
fuzzi exponent for the FBPA metric, calculated from the fuzziness factorprotected float[][]
Fuzzy smoothing values for gaussian smoothing.protected String[]
Pharmacophore point type names.protected float[][][]
Fuzzy smoothing values for smoothing with user defined smoothing vector.float[]
ThecalculateXYZ()
methods (in PharmacophoreFingerprint sub-classes) calculate three numbers (denoted by x,y and z), but methods in java cannot have output parameters.Fields inherited from class chemaxon.descriptors.MDParameters
asymmetryFactors, cellSize, cellwiseWeights, configFilePath, currentMetricIndex, data, decForm, DEFAULT_ASYMMETRY_FACTOR, DEFAULT_OUTPUT_PRECISION, DEFAULT_SCALE_FACTOR, DEFAULT_WEIGHT, defaultWeight, document, generator, internalSize, length, md, metricIndexes, normalized, outputPrecision, parametrizedMetricNodes, parametrizedMetrics, parametrizedMetricsNode, scaleFactors, screeningConfigurationNode, similarityNode, standardizer, standardizerConfigurationNode, thresholds, tverskyA, tverskyB, weights
-
Constructor Summary
ConstructorDescriptionCreates an empty object.PFParameters
(File configFile) Creates a new object based on a given configuration file.PFParameters
(String XMLconfig) Creates a new object based on a given configuration string. -
Method Summary
Modifier and TypeMethodDescriptionprotected int
appendParametrizedMetric
(String name, String metric) Extends internal data with a new parametrized metric.void
Sets parameters from an XML file.void
fromString
(String parameterString) Sets parameters from a string representation.protected String[]
generate
(Molecule m, MolecularDescriptor cd) Calls the correspondingMolecularDescriptorGenerator
and generates the descriptor for the given molecule.Color[]
Gets the coloring of atoms as defined in the XML configuration file.int[]
Gets the individual atom colors by pharmcophore point type.String[]
Gets the name of atom sets (pharmacophore point type set), as defined in the configuration file/string.float[]
getCustomFuzzyIncrements
(int fa, int fb) Gets fuzzy increments for the fuzzy smoothing with a user-defined custom distribution (fuzzy smoothing vector).Gets the default XML configuration string.static String
Gets the default configuration of the standardizer.float[][]
Gets the precalculated fuzzy exponent used in the Fuzzy Bipolar Autocorrelogram (FBPA) metric.float
float
Gets the fuzzy smoothing factor used in the construction of fuzzy fingerprints.float[][]
Gets fuzzy increments for the fuzzy smoothing with a normal distribution.boolean
int
Gets the miximum distance of pharmacophore points that is considered.int
Gets the minimum distance of pharmacophore points that is considered.int
Gets the number of histograms bars, that is that number of distance values disguished.int
The number of pharmacophore features (aka types, properties) used in the fingerprints.protected int
getNumberOfWeights
(int parametrizedMetricIndex) Gets the number of weight factors used by the specified metric.chemaxon.pharmacophore.PSymbols
Gets the pharmacophore type symbols used in the current configuration.int
Gets the resolution of histograms.getSymbol
(int i) Gets the symbol associated with the specified Pharmacophore.int
getSymbolIndex
(String symbol) Gets the index associated with the specified Pharmacophore symbol.protected boolean
importNodes
(org.dom4j.Document doc, boolean merge) Imports nodes from the specifiedDocument
into the current (main)Document
.protected void
init()
Initializes the object.protected void
Creates and initializes aPFGenerator
instance.protected void
Initializes those data members that depend on the XML configuration but are not directly taken from it.boolean
Gets whether the fuzzy fingerprints are used or not.boolean
Gets if fuzzy smmoothing is based on normal distribution.boolean
protected void
processDocument
(boolean all) Searches the DOM tree for relevant nodes and sets internal variables to some these nodes for the sake of easier information processing.protected void
Processes allParametrizedMetric
nodes in the DOM tree.protected void
readMetricWeights
(org.dom4j.Element parametrizedMetric, int metricIndex) protected void
readValues
(boolean all) Picks attribute values from the document tree that are relevant to the actualMDParameters
sub-class.void
setCurrentParametrizedMetric
(int metricIndex) Selects a metric to be used.void
setFuzzinessFactor
(float alpha) Sets the value of the fuzziness factor, used in fuzzy bipolar pharmacophore autocorrelogram (FBPA) based convolution product in non-parametrized (generic) FBPA metric.void
setFuzzyCutOff
(float fco) void
setFuzzyLowerBound
(int flb) void
setFuzzySmoothingFactor
(float fsf) Sets the value of the fuzzy smoothing factor used in the generation of fuzzy fingerprints.void
setIgnoreRotatableBonds
(boolean irb) void
setParameters
(File parametersFile) Sets parameters from an XML file representation overwriting all previous settings with the new ones.void
setParameters
(String parametersString) Sets parameters from an XML string representation overwriting all previous parameters settings with the new ones.void
setPMAPTagName
(String tagName) Sets the name of the pharmacophore map tag to be considered in the input SDfile.void
setSymmetricalFuzzy
(boolean sf) void
setUsePMAP
(String pmapTagName) Pharmacophore map data in imported Molecule-s should be used.void
setWeights
(float[] featureWeights, float[] distanceWeights) Sets the cell-wise weight factors for the current parametrized metric.toString()
Returns the XML configuration in a string.int
tripletIndex
(int f1, int f2, int d) Calculates the index of a feature pair and distance triplet in the linearised representation of histograms.Methods inherited from class chemaxon.descriptors.MDParameters
addParameters, addParameters, addParametrizedMetric, addParametrizedMetricNode, addParametrizedMetricsNode, checkDocumentVersion, getAsymmetryFactor, getCellSize, getCurrentMetricIndex, getData, getDecForm, getDescriptorTypeName, getInternalMetricIndex, getInternalSize, getLength, getMetricIndex, getMetricName, getMetricName, getNumberOfMetrics, getNumberOfWeights, getScaleFactor, getScalingHypothesis, getScreeningConfigurationString, getThreshold, getThreshold, getTverskyAlpha, getTverskyBeta, getWeights, isAsymmetric, isCellwiseWeights, isNormalized, isScaled, isStandardizationMandatory, isWeighted, readFromXmlFile, readFromXmlString, setAsymmetryFactor, setCellSize, setCellwiseWeights, setCreateStatistics, setLength, setNormalized, setOutputPrecision, setScaleFactor, setScalingHypothesis, setThreshold, setWeights, standardize, toString, writeMetricParameter
-
Field Details
-
DEFAULT_RESOLUTION
public static final int DEFAULT_RESOLUTIONdefault values- See Also:
-
DEFAULT_FUZZINESS_FACTOR
public static final float DEFAULT_FUZZINESS_FACTOR- See Also:
-
DEFAULT_FUZZY_CUTOFF
public static final float DEFAULT_FUZZY_CUTOFF- See Also:
-
DEFAULT_FUZZY_LOWER_BOUND
public static final int DEFAULT_FUZZY_LOWER_BOUND- See Also:
-
DEFAULT_XML_CONFIG
- See Also:
-
fuzzinessFactors
fuzziness factors of parametrized metrics -
fuzzyExp
protected float[][] fuzzyExpfuzzi exponent for the FBPA metric, calculated from the fuzziness factor -
colDec
protected chemaxon.pharmacophore.ColorDecoder colDecatom set colors -
pptNames
Pharmacophore point type names. These are similar to the above coloring scheme, though instead of colors, this array contains the IDs of pharmacophore point types as defined in the XML configuration. Tha names of combined types (e.g. d - (Donor Minus) are obtained by concatenating the corresponding IDs. -
gFuzzyIncrements
protected float[][] gFuzzyIncrementsFuzzy smoothing values for gaussian smoothing. The first index is the number of rotatable bonds while the second is the distance value. For the sake of efficiency, distances from 0 to maxDist() are stored. -
svFuzzyIncrements
protected float[][][] svFuzzyIncrementsFuzzy smoothing values for smoothing with user defined smoothing vector. The first and the second indexes are the pharmacophore point type indexes the third is the distance value. -
compressedData
protected byte[][] compressedDataBuffer for compressed data, used inMolecularDescriptor.toData( final int[] )
. Element 'i' is either null or is an array of 'i' bytes. -
xyz
public float[] xyzThecalculateXYZ()
methods (in PharmacophoreFingerprint sub-classes) calculate three numbers (denoted by x,y and z), but methods in java cannot have output parameters. We could (1) pass an array of three elements, but in that case the array was either a data member of PharmacophoreFingerprint, which has to be avoided in order to keep the size of a descriptor as small as possible; or alternatively (2) the array could be allocated in the caller ofcalculateXYZ()
each time, which clearly had time and storage overhead. Keeping a preallocated instance in parameters is efficient though apparently not nice in terms of object oriented principles. -
asymmetricXYZSum
public float asymmetricXYZSumStores last calculatred asymmetric Tanimoto sum.
-
-
Constructor Details
-
PFParameters
public PFParameters()Creates an empty object. UsesetConfig( String paramString )
to initialize it properly. -
PFParameters
Creates a new object based on a given configuration file.- Parameters:
configFile
- XML configuration file- Throws:
MDParametersException
- missing or bad XML configuration
-
PFParameters
Creates a new object based on a given configuration string.- Parameters:
XMLconfig
- XML configuration string- Throws:
MDParametersException
- missing or bad XML configuration
-
-
Method Details
-
fromString
Sets parameters from a string representation. This method processes an XML format, but derived classes may overload this and define their own format.- Overrides:
fromString
in classMDParameters
- Parameters:
parameterString
- parameters in string- Throws:
MDParametersException
- when the parameter string is not well-formed
-
fromFile
Sets parameters from an XML file. Derived classes may overload this and define their own format (even other than XML).- Overrides:
fromFile
in classMDParameters
- Parameters:
parameterFile
- initialized configuration file- Throws:
MDParametersException
- failed to process parameter file
-
toString
Returns the XML configuration in a string.- Overrides:
toString
in classMDParameters
- Returns:
- configuration string in XML format
-
setParameters
Sets parameters from an XML string representation overwriting all previous parameters settings with the new ones.- Overrides:
setParameters
in classMDParameters
- Parameters:
parametersString
- parameters in string- Throws:
MDParametersException
- when the parameter string is not well-formed
-
setParameters
Sets parameters from an XML file representation overwriting all previous settings with the new ones.- Overrides:
setParameters
in classMDParameters
- Parameters:
parametersFile
- parameter configuration file- Throws:
MDParametersException
- when the parameter string is not well-formed
-
init
protected void init()Initializes the object. Only those fundamental data members are set that do not depend on the XML configuration. -
initGenerator
Creates and initializes aPFGenerator
instance. The generator is stored in theMDParameters
in order to spare memory space: thePFParameter
object is shared among multiple instances ofPharmacophoreFingerprint
, thus they refer to the same generator object too via theparameters
field.- Throws:
MDParametersException
-
initParameters
protected void initParameters()Initializes those data members that depend on the XML configuration but are not directly taken from it.- Overrides:
initParameters
in classMDParameters
-
generate
Calls the correspondingMolecularDescriptorGenerator
and generates the descriptor for the given molecule. Note, that it is the responsibility of subclasses to initialize the generator upfront any call to this function.- Parameters:
m
- a molecular structurecd
- the chemical descriptor generated for the given molecule, an output parameter- Returns:
- names of Molecule Property-s (SDfile tags) set by the generator
- Throws:
MDGeneratorException
- when failed to generate descriptor
-
setCurrentParametrizedMetric
public void setCurrentParametrizedMetric(int metricIndex) Selects a metric to be used. When a metric is selected all properties and related parameters can be retrieved with the getter methods of this class.- Overrides:
setCurrentParametrizedMetric
in classMDParameters
- Parameters:
metricIndex
- index of a parametrized metric to be used
-
setFuzzySmoothingFactor
public void setFuzzySmoothingFactor(float fsf) Sets the value of the fuzzy smoothing factor used in the generation of fuzzy fingerprints. The smoothing process fits a Guassian curve to all values to be put into histogram bins, and this value is the standard deviation of the corresponding normal distribution.- Parameters:
fsf
- the fuzzy smoothing factor
-
setFuzzyCutOff
public void setFuzzyCutOff(float fco) -
setFuzzyLowerBound
public void setFuzzyLowerBound(int flb) -
setSymmetricalFuzzy
public void setSymmetricalFuzzy(boolean sf) -
setIgnoreRotatableBonds
public void setIgnoreRotatableBonds(boolean irb) -
setFuzzinessFactor
public void setFuzzinessFactor(float alpha) Sets the value of the fuzziness factor, used in fuzzy bipolar pharmacophore autocorrelogram (FBPA) based convolution product in non-parametrized (generic) FBPA metric.- Parameters:
alpha
- fuzziness factor used in the convolution product
-
setUsePMAP
Pharmacophore map data in imported Molecule-s should be used. The generator will not generate PMAP data, but will use the exsiting one.- Parameters:
pmapTagName
- name of the SDfile tag storing the PMAP data
-
setPMAPTagName
Sets the name of the pharmacophore map tag to be considered in the input SDfile.- Parameters:
tagName
- name of the SDfile tag that stores pharmacophore map
-
setWeights
public void setWeights(float[] featureWeights, float[] distanceWeights) Sets the cell-wise weight factors for the current parametrized metric.- Parameters:
featureWeights
- weights for each pharmacophore typedistanceWeights
- weights for each point pair distance
-
getNumberOfFeatures
public int getNumberOfFeatures()The number of pharmacophore features (aka types, properties) used in the fingerprints.- Returns:
- number of pharmacophore features
-
getPSymbols
public chemaxon.pharmacophore.PSymbols getPSymbols()Gets the pharmacophore type symbols used in the current configuration.- Returns:
- pharmacophore symbols
-
getSymbol
Gets the symbol associated with the specified Pharmacophore.- Parameters:
i
- index of the pharmacophore property (feature)- Returns:
- the symbol string of the feature
-
getSymbolIndex
Gets the index associated with the specified Pharmacophore symbol.- Parameters:
symbol
- pharmacophore property (feature) symbol- Returns:
- the index of the feature, -1 if not found
- Throws:
IllegalArgumentException
-
getMinDist
public int getMinDist()Gets the minimum distance of pharmacophore points that is considered.- Returns:
- value of minimal distance
-
getMaxDist
public int getMaxDist()Gets the miximum distance of pharmacophore points that is considered.- Returns:
- value of miximal distance
-
getResolution
public int getResolution()Gets the resolution of histograms. THe resolution is the width of a histogram bin, that is the distance values considered to be indistinguishable.- Returns:
- value of the current resolution
-
getNDists
public int getNDists()Gets the number of histograms bars, that is that number of distance values disguished.- Returns:
- hte number of ditance values in one histogram
-
getNumberOfWeights
Gets the number of weight factors used by the specified metric. This method can be applied to the dissimilarity metrics provided by theMolecularDescriptor
class or its derived classes, but not to parametrized metric.- Overrides:
getNumberOfWeights
in classMDParameters
- Parameters:
parametrizedMetricIndex
- parametrized metric index- Returns:
- number of weights the metric uses
- Throws:
IllegalArgumentException
- if the given parameter is not a valid metric index
-
isFuzzyFingerprint
public boolean isFuzzyFingerprint()Gets whether the fuzzy fingerprints are used or not.- Returns:
- fuzzy flag
-
isGaussianSmoothing
public boolean isGaussianSmoothing()Gets if fuzzy smmoothing is based on normal distribution.- Returns:
- true if smoothing has gaussian
-
getFuzzySmoothingFactor
public float getFuzzySmoothingFactor()Gets the fuzzy smoothing factor used in the construction of fuzzy fingerprints.- Returns:
- the fuzzy smoothing factor
-
isSymmetricalFuzzy
public boolean isSymmetricalFuzzy() -
getIgnoreRotatableBonds
public boolean getIgnoreRotatableBonds() -
getFuzzyLowerBound
public float getFuzzyLowerBound() -
getDefaultStandardizerConfiguration
Gets the default configuration of the standardizer. This method is called if no standardizer configuration is set in the parameters configuration. The default standardization forPharmacophoreFingerprint
includes aromatization and the removal of small disjoint fragments (largest component is kept only).- Returns:
- standardizer confiuration XML string
- Since:
- JChem 2.3
-
getDefaultDocumentFrame
Gets the default XML configuration string. This is needed when the PharmacophoreFingerprint XML configuration is not specified.- Overrides:
getDefaultDocumentFrame
in classMDParameters
- Returns:
- default XML configuration string of the PFParameters class
- Since:
- JChem 2.3
-
getGaussianFuzzyIncrements
public float[][] getGaussianFuzzyIncrements()Gets fuzzy increments for the fuzzy smoothing with a normal distribution. The first index of the raturned array is the number of rotatable bonds, while the second is the distance of the two pharmacophoric points considered. These values represent a normal distribution with a standard deviation equal to the product of the number of rotatable bonds plus one and the fuzzy smoothing factor. Therefore, the more rotatable bonds are found between the two pharmacophoric point, the larger the deviation of the normal distribution is (the wider and flatter the Gaussian curve is). Note, that the normal distribution is standard (its peek is in zero), and that only the right hand side of the curve is returned (the left is a mirror image).- Returns:
- fuzzy increments
-
getCustomFuzzyIncrements
public float[] getCustomFuzzyIncrements(int fa, int fb) Gets fuzzy increments for the fuzzy smoothing with a user-defined custom distribution (fuzzy smoothing vector).- Parameters:
fa
- pharmacophore point type of the first component of the pairfb
- pharmacophore point type of the second component of the pair- Returns:
- fuzzy increments
-
getFuzzyExponents
public float[][] getFuzzyExponents()Gets the precalculated fuzzy exponent used in the Fuzzy Bipolar Autocorrelogram (FBPA) metric.- Returns:
- fuzzy exponents
-
getAtomSetColors
Gets the coloring of atoms as defined in the XML configuration file.- Returns:
- array of colors of different pharmacophore point types
-
getAtomSetNames
Gets the name of atom sets (pharmacophore point type set), as defined in the configuration file/string.- Returns:
- names of pharmacophore point types
-
getAtomSetIndexes
Gets the individual atom colors by pharmcophore point type. This method is implemented in this class because it calls the pharmacophore fingerprint generator.- Parameters:
m
- a molecule to assign pharmacophore point colors to- Returns:
- array of color indexes indexed by atom indexes
-
processDocument
Description copied from class:MDParameters
Searches the DOM tree for relevant nodes and sets internal variables to some these nodes for the sake of easier information processing.- Overrides:
processDocument
in classMDParameters
- Parameters:
all
- process the complete document or only theScreeningConfiguration
tag- Throws:
MDParametersException
-
readValues
Description copied from class:MDParameters
Picks attribute values from the document tree that are relevant to the actualMDParameters
sub-class.- Overrides:
readValues
in classMDParameters
- Parameters:
all
- process the complete document or only theScreeningConfiguration
tag- Throws:
MDParametersException
-
readMetricParameters
Description copied from class:MDParameters
Processes allParametrizedMetric
nodes in the DOM tree. Reads parameterized metric names and associated parameter setting and stores them in data member for faster and easier access in getter methods.- Overrides:
readMetricParameters
in classMDParameters
- Throws:
MDParametersException
- if one of the nodes is not well-formed
-
readMetricWeights
protected void readMetricWeights(org.dom4j.Element parametrizedMetric, int metricIndex) throws MDParametersException - Overrides:
readMetricWeights
in classMDParameters
- Throws:
MDParametersException
-
appendParametrizedMetric
Description copied from class:MDParameters
Extends internal data with a new parametrized metric. Neither the DOM tree nor the XML document is modified.- Overrides:
appendParametrizedMetric
in classMDParameters
- Parameters:
name
- name of the parametrized metricmetric
- dissimilarity metric name (as defined in its implementor class
-
importNodes
protected boolean importNodes(org.dom4j.Document doc, boolean merge) Description copied from class:MDParameters
Imports nodes from the specifiedDocument
into the current (main)Document
. New nodes can either merged into the existing ones without removing them, or new nodes may overwrite exisiting nodes.- Overrides:
importNodes
in classMDParameters
- Parameters:
doc
- import nodes from this documentmerge
- merge (add new) or overwrite (replace with new) existing nodes
-
tripletIndex
public int tripletIndex(int f1, int f2, int d) Calculates the index of a feature pair and distance triplet in the linearised representation of histograms. Features are ordered as described above.- Parameters:
f1
- index of the first featuref2
- index of the other featured
- distance value- Returns:
- the index of the triplet
-