Package chemaxon.clustering
Class Compare
java.lang.Object
chemaxon.clustering.Common
chemaxon.clustering.Compare
Compares two sets of objects (like compound libraries)
using diversity and dissimilarity calculations.
This class can be used from command-line or from the API.
See the documentation for details.
This class can be used from command-line or from the API.
See the documentation for details.
An example for running Compare:
ConnectionHandler ch; String selectString1; String selectString2; ... Compare cp=new Compare(); cp.setInput1(ch, selectString1 ); cp.setInput2(ch, selectString2 ); cp.setOutput(System.out); cp.setDimensions(0); cp.setFpSize(512); cp.setStatNeeded(true); cp.setOnlyStat(false); cp.setStatStream(System.out); cp.setIdGeneration(true); cp.setThreshold(0.1f); cp.setOnlyDissimilarListed(true); cp.setDifferentIdsOnly(true); cp.run();
- Since:
- JChem 1.6.2
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionintGets the maximum allowed number of similar objects for a compound.floatGets the threshold value.booleanIndicates whether compounds with different id-s compared only.booleanIndicates whether only dissimilar objects are printed.booleanIndicates, whether the list similar objects should be ordered by distance (closest first).booleanIndicates whether similar objects listed from the first set.voidrun()Starts processing.voidsetDifferentIdsOnly(boolean b) Specifies whether compounds with identical id should be compared.voidsetInput1(ConnectionHandler conh, String querySQL) Sets the first compound library (first set) as an SQL query.voidSets the first compound library (first set) as a file.voidsetInput1(InputStream is) Sets the first compound library (first set) as an InputStreamvoidSets the first compound library (first set) as a file, specified by the file's path.voidsetInput2(ConnectionHandler conh, String querySQL) Sets the second compound library (second set) as an SQL query.voidSets the second compound library (second set) as a file.voidsetInput2(InputStream is) Sets the second compound library (second set) as an InputStreamvoidSets the second compound library (second set) as a file, specified by the file's path.voidsetMaxSimilar(int maxSimilar) Sets the maximum allowed number of similar objects for a compound.voidsetOnlyDissimilarListed(boolean b) Specifies, if only dissimilar objects are to be printed.voidsetOrderSimilarObjects(boolean orderSimilarObjects) Specifies, whether the list similar objects should be ordered by distance (closest first).voidsetSimilarListed(boolean b) Specifies if similar objects are to be listed from the first set.voidsetThreshold(float threshold) Sets the threshold value.Methods inherited from class chemaxon.clustering.Common
getDimensions, getElapsedTime, getFpSize, getIdGeneration, getStartTime, getStatStream, getWeights, isOnlyStat, isStatNeeded, setDimensions, setFpSize, setIdGeneration, setOnlyStat, setOutput, setOutput, setOutput, setOutput, setStatNeeded, setStatStream, setWeights
-
Constructor Details
-
Compare
public Compare()
-
-
Method Details
-
setInput1
Sets the first compound library (first set) as an SQL query.- Parameters:
conh- ConnectionHandler object opened to the databasequerySQL- an SQL SELECT statement used as data source- Throws:
SQLException
-
setInput2
Sets the second compound library (second set) as an SQL query.- Parameters:
conh- ConnectionHandler object opened to the databasequerySQL- an SQL SELECT statement used as data source- Throws:
SQLException
-
setInput1
Sets the first compound library (first set) as a file.- Parameters:
file- the File to be used for input- Throws:
FileNotFoundException
-
setInput2
Sets the second compound library (second set) as a file.- Parameters:
file- the File to be used for input- Throws:
FileNotFoundException
-
setInput1
Sets the first compound library (first set) as a file, specified by the file's path.- Parameters:
fileName- the path of the input file- Throws:
FileNotFoundException
-
setInput2
Sets the second compound library (second set) as a file, specified by the file's path.- Parameters:
fileName- the path of the input file- Throws:
FileNotFoundException
-
setInput1
Sets the first compound library (first set) as an InputStream- Parameters:
is- the InputStream to be used for input
-
setInput2
Sets the second compound library (second set) as an InputStream- Parameters:
is- the InputStream to be used for input
-
setThreshold
public void setThreshold(float threshold) Sets the threshold value.- Parameters:
threshold- the threshold value to be set
-
getThreshold
public float getThreshold()Gets the threshold value.- Returns:
- the threshold value currently set
-
setOnlyDissimilarListed
public void setOnlyDissimilarListed(boolean b) Specifies, if only dissimilar objects are to be printed.- Parameters:
b- if set totrueonly dissimilar objects are printed, otherwise all objects are listed
-
isOnlyDissimilarListed
public boolean isOnlyDissimilarListed()Indicates whether only dissimilar objects are printed.- Returns:
true, if only dissimilar objects are printed,falseotherwise
-
setSimilarListed
public void setSimilarListed(boolean b) Specifies if similar objects are to be listed from the first set.- Parameters:
b- if set totrue, similar objects are printed from the first set
-
isSimilarListed
public boolean isSimilarListed()Indicates whether similar objects listed from the first set.- Returns:
true, if similar objects are printed from the first set
-
setDifferentIdsOnly
public void setDifferentIdsOnly(boolean b) Specifies whether compounds with identical id should be compared.- Parameters:
b- if set totrue, compounds with different id-s compared only. Otherwise all compounds are compared.
-
isDifferentIdsOnly
public boolean isDifferentIdsOnly()Indicates whether compounds with different id-s compared only.- Returns:
- true, if compunds with identical id-s are not compared
-
setMaxSimilar
public void setMaxSimilar(int maxSimilar) Sets the maximum allowed number of similar objects for a compound. If set to 0 (default), the number of similar objects is unlimited.- Parameters:
maxSimilar- the maximum number of similar neighbors
-
getMaxSimilar
public int getMaxSimilar()Gets the maximum allowed number of similar objects for a compound. If 0 (default), the number of similar objects is unlimited.- Returns:
- the maximum number of similar neighbors
-
setOrderSimilarObjects
public void setOrderSimilarObjects(boolean orderSimilarObjects) Specifies, whether the list similar objects should be ordered by distance (closest first).- Parameters:
orderSimilarObjects- if set totrue, the list of similar objects should be ordered by distance in the output. Default value isfalse.
-
isOrderSimilarObjects
public boolean isOrderSimilarObjects()Indicates, whether the list similar objects should be ordered by distance (closest first).- Returns:
true, if the list of similar objects will be ordered by distance in the output. Default value isfalse.
-
run
Starts processing.
-