Molecules input
Every single virtual screening campaign initiates with the identification of a chemical space that is intended to be explored. This chemical space may vary significantly in size, ranging from a single molecule to a large set of hundred millions compounds.
In this respect, the most simple procedure to read a set of compound into TidyScreen is by using a .csv file structured in the following way:
field_#1: SMILES notation indicating the structure of the molecule/s to be read.
field_#2: name of the molecule/s
field_#3: Flag used for general purposes (such as filtering, etc)
In order to read a .csv file, the corresponding function should be invoked using the generated chemspace_object as follows:
# Step #1: It is assumed that TidyScreen packages have been activated
# Step #2: A project named `tutorial` has been generated
# Step #3: A chemspace object named `tutorial_chemspace` has been created
$ tutorial_chemspace.input_csv($PATH_TO_FILE)
When the above function is executed, the first line of the .csv file is written to the console, asking the user to indicate if the files has a header or not (y/n).
Once the input files containing valid SMILES notations, a table named after the read .csv file will be created in the chemspace.db.
Molecules tables contained within the chemspace.db can be listed as follows:
# Step #1: It is assumed that TidyScreen packages have been activated
# Step #2: A project named `tutorial` has been generated
# Step #3: A chemspace object named `tutorial_chemspace` has been created
$ tutorial_chemspace.list_ligand_tables()
Also a table can be deleted from chemspace.db using:
# Step #1: It is assumed that TidyScreen packages have been activated
# Step #2: A project named `tutorial` has been generated
# Step #3: A chemspace object named `tutorial_chemspace` has been created
$ tutorial_chemspace.delete_table(TABLE_NAME)
In case a 2D depiction of compounds stored in a given table is required, the following function can be invoked:
# Step #1: It is assumed that TidyScreen packages have been activated
# Step #2: A project named `tutorial` has been generated
# Step #3: A chemspace object named `tutorial_chemspace` has been created
$ tutorial_chemspace.depict_ligand_table(TABLE_NAME)
The depiction of a table of molecules will generate a set of .png files that will be stored within the $PROJECT_PATH/chemspace/misc/$TABLE_NAME folder.
Frequently, during virtual screening campaigns a high number of molecules are stored within a table. Consequently, depicting all of them may lead to a large number or .png files. In those cases, the following are usefull
- max_mols_ppage: indicates the number of depictions included per
.pngfile. Default:25 - limit: set the maximum number of molecules to depict. Default:
0; depicts all molecules stored in the table. - random: If set to
Truewill depict the required molecules indicated with limit picking a randomized selection. Default:False