chemprop.data.datapoints#
Attributes#
Classes#
A |
|
A |
|
A |
|
A |
Module Contents#
- chemprop.data.datapoints.MoleculeFeaturizer#
- class chemprop.data.datapoints.MoleculeDatapoint[source]#
Bases:
_DatapointMixin,_MoleculeDatapointMixinA
MoleculeDatapointcontains a single molecule and its associated features and targets.- V_f: numpy.ndarray | None = None#
A numpy array of shape
V x d_vf, whereVis the number of atoms in the molecule, andd_vfis the number of additional features that will be concatenated to atom-level features before message passing
- E_f: numpy.ndarray | None = None#
A numpy array of shape
E x d_ef, whereEis the number of bonds in the molecule, andd_efis the number of additional features containing additional features that will be concatenated to bond-level features before message passing
- V_d: numpy.ndarray | None = None#
A numpy array of shape
V x d_vd, whereVis the number of atoms in the molecule, andd_vdis the number of additional descriptors that will be concatenated to atom-level descriptors after message passing
- class chemprop.data.datapoints.LazyMoleculeDatapoint[source]#
Bases:
_DatapointMixin,_LazyMoleculeDatapointMixinA
LazyMoleculeDatapointcontains a single SMILES string, and all attributes need to form a rdkit.Chem.Mol object. The molecule is computed lazily when the attribute mol is accessed.- V_f: numpy.ndarray | None = None#
A numpy array of shape
V x d_vf, whereVis the number of atoms in the molecule, andd_vfis the number of additional features that will be concatenated to atom-level features before message passing
- E_f: numpy.ndarray | None = None#
A numpy array of shape
E x d_ef, whereEis the number of bonds in the molecule, andd_efis the number of additional features containing additional features that will be concatenated to bond-level features before message passing
- V_d: numpy.ndarray | None = None#
A numpy array of shape
V x d_vd, whereVis the number of atoms in the molecule, andd_vdis the number of additional descriptors that will be concatenated to atom-level descriptors after message passing
- class chemprop.data.datapoints.MolAtomBondDatapoint[source]#
Bases:
MoleculeDatapointA
MoleculeDatapointcontains a single molecule and its associated features and targets.- E_d: numpy.ndarray | None = None#
A numpy array of shape
E x d_ed, whereEis the number of bonds in the molecule, andd_edis the number of additional descriptors that will be concatenated to edge-level descriptors after message passing
- atom_y: numpy.ndarray | None = None#
A numpy array of shape
V x v_t, whereVis the number of atoms in the molecule, andv_tis the number of atom targets. The order of atoms in the array should match the order of atoms in the mol. Unknown targets are indicated by `nan`s.
- atom_gt_mask: numpy.ndarray | None = None#
Indicates whether the atom targets are an inequality regression target of the form <x
- atom_lt_mask: numpy.ndarray | None = None#
Indicates whether the atom targets are an inequality regression target of the form >x
- bond_y: numpy.ndarray | None = None#
A numpy array of shape
E x e_t, whereVis the number of bonds in the molecule, ande_tis the number of bond targets. The order of bonds in the array should match the order of bonds in the mol. Unknown targets are indicated by `nan`s.
- bond_gt_mask: numpy.ndarray | None = None#
Indicates whether the bond targets are an inequality regression target of the form <x
- bond_lt_mask: numpy.ndarray | None = None#
Indicates whether the bond targets are an inequality regression target of the form >x
- atom_constraint: numpy.ndarray | None = None#
A numpy array of shape
1 x v_tcontaining the values that the atom property predictions should be constrained to sum to, with np.nan indicating no constraint for that property
- bond_constraint: numpy.ndarray | None = None#
A numpy array of shape
1 x e_tcontaining the values that the bond property predictions should be constrained to sum to, with np.nan indicating no constraint for that property