rxn.reaction_preprocessing.standardizer.Standardizer
- class rxn.reaction_preprocessing.standardizer.Standardizer(annotations, discard_unannotated_metals, reaction_column_name, fragment_bond=None, remove_stereo_if_not_defined_in_precursors=False, keep_intermediate_columns=False)[source]
Bases:
object- Parameters
annotations (
List[MoleculeAnnotation]) –discard_unannotated_metals (
bool) –reaction_column_name (
str) –fragment_bond (
Optional[str], default:None) –remove_stereo_if_not_defined_in_precursors (
bool, default:False) –keep_intermediate_columns (
bool, default:False) –
- __init__(annotations, discard_unannotated_metals, reaction_column_name, fragment_bond=None, remove_stereo_if_not_defined_in_precursors=False, keep_intermediate_columns=False)[source]
Creates a new instance of the Standardizer class.
- Parameters
annotations (
List[MoleculeAnnotation]) – A list of MoleculeAnnotation objects used to perform the substitutions/rejectionsdiscard_unannotated_metals (
bool) – whether reactions containing unannotated molecules with transition metals must be rejected.reaction_column_name (
str) – The name of the DataFrame column containing the reaction SMILES.fragment_bond (
Optional[str], default:None) – the fragment bond used in the dataframe.remove_stereo_if_not_defined_in_precursors (
bool, default:False) – Remove chiral centers from products.keep_intermediate_columns (
bool, default:False) – Whether the columns generated during preprocessing should be kept.
Methods
__init__(annotations, ...[, fragment_bond, ...])Creates a new instance of the Standardizer class.
standardize_file(input_csv, output_csv)Standardize the reactions in a CSV file.
standardize_iterator(csv_iterator)Standardize the reactions in a CSV iterator.
standardize_one(rxn_smiles)- param rxn_smiles
- standardize_file(input_csv, output_csv)[source]
Standardize the reactions in a CSV file.
- Parameters
input_csv (
Path) – CSV with the reactions to standardize.output_csv (
Path) – CSV where to save the standardized reactions.
- Return type
None
- standardize_iterator(csv_iterator)[source]
Standardize the reactions in a CSV iterator.
Same as
standardize_file, except that it acts directly on the iterator.- Parameters
csv_iterator (
CsvIterator) – input CSV iterator for the reactions to standardize.- Return type
CsvIterator- Returns
CsvIterator with reactions after the standardization step.