rxn.reaction_preprocessing.config.StandardizeConfig
- class rxn.reaction_preprocessing.config.StandardizeConfig(input_file_path='${rxn_import.output_csv}', annotation_file_paths=<factory>, discard_unannotated_metals=False, output_file_path='${data.proc_dir}/${data.name}.standardized.csv', fragment_bond='${common.fragment_bond}', reaction_column_name='${common.reaction_column_name}', remove_stereo_if_not_defined_in_precursors=False, keep_intermediate_columns='${common.keep_intermediate_columns}')[source]
Bases:
objectConfiguration for the standardization transformation step.
- Fields:
input_file_path: The input CSV (one SMILES per line). output_file_path: The output file path containing the result after standardization. annotation_file_paths: The files to load the annotated molecules from. discard_unannotated_metals: whether reactions containing unannotated
molecules with transition metals must be rejected.
fragment_bond: Token used to denote a fragment bond in the reaction SMILES. reaction_column_name: Name of the reaction column for the data file. remove_stereo_if_not_defined_in_precursors: Remove chiral centers from product. keep_intermediate_columns: Whether the columns generated during preprocessing should be kept.
- Parameters
input_file_path (
str, default:'${rxn_import.output_csv}') –annotation_file_paths (
List[str], default:<factory>) –discard_unannotated_metals (
bool, default:False) –output_file_path (
str, default:'${data.proc_dir}/${data.name}.standardized.csv') –fragment_bond (
FragmentBond, default:'${common.fragment_bond}') –reaction_column_name (
str, default:'${common.reaction_column_name}') –remove_stereo_if_not_defined_in_precursors (
bool, default:False) –keep_intermediate_columns (
bool, default:'${common.keep_intermediate_columns}') –
- __init__(input_file_path='${rxn_import.output_csv}', annotation_file_paths=<factory>, discard_unannotated_metals=False, output_file_path='${data.proc_dir}/${data.name}.standardized.csv', fragment_bond='${common.fragment_bond}', reaction_column_name='${common.reaction_column_name}', remove_stereo_if_not_defined_in_precursors=False, keep_intermediate_columns='${common.keep_intermediate_columns}')
- Parameters
input_file_path (
str, default:'${rxn_import.output_csv}') –annotation_file_paths (
List[str], default:<factory>) –discard_unannotated_metals (
bool, default:False) –output_file_path (
str, default:'${data.proc_dir}/${data.name}.standardized.csv') –fragment_bond (
FragmentBond, default:'${common.fragment_bond}') –reaction_column_name (
str, default:'${common.reaction_column_name}') –remove_stereo_if_not_defined_in_precursors (
bool, default:False) –keep_intermediate_columns (
bool, default:'${common.keep_intermediate_columns}') –
- Return type
None
Methods
__init__([input_file_path, ...])- param input_file_path
Attributes
discard_unannotated_metalsfragment_bondinput_file_pathkeep_intermediate_columnsoutput_file_pathreaction_column_nameremove_stereo_if_not_defined_in_precursorsannotation_file_paths