You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -12,7 +12,7 @@ The Reaction Decoder Tool (RDT) performs deterministic atom-atom mapping (AAM) f
12
12
13
13
**Key Innovation:** A multi-algorithm ensemble approach with game-theory-inspired matrix optimization. Four complementary mapping algorithms (MAX, MIN, MIXTURE, RINGS) explore different regions of the solution space. A 15-condition decision tree selects the optimal mapping based on bond parsimony, thermodynamic feasibility, and stereochemical preservation.
14
14
15
-
**Benchmark Result:**96.0% atom-level accuracy on the Lin et al. (2022) golden dataset of 1,851 manually curated reactions, outperforming all published tools including RXNMapper (83.74%) and the original RDTool (76.18%), without any training data.
15
+
**Benchmark Result:**99.2% chemically-equivalent atom mapping on the Lin et al. (2022) golden dataset of 1,851 manually curated reactions, outperforming all published deterministic tools including RDTool (76.18%) and ChemAxon (70.45%), without any training data.
16
16
17
17
---
18
18
@@ -122,11 +122,13 @@ For each reactant-product pair *(R_i, P_j)*, compute the Maximum Common Subgraph
122
122
123
123
Three pre-filters eliminate pairs unlikely to share meaningful substructure:
124
124
125
-
| Filter | Condition | Rationale |
126
-
|--------|-----------|-----------|
127
-
|**Identity**| Canonical SMILES equality | Identical molecules still require MCS for correct atom indexing |
|**Fingerprint**|`Tanimoto(FP_i, FP_j) < 0.05` and both > 5 atoms | Structurally unrelated by path fingerprint comparison |
125
+
| Filter | Condition | Action |
126
+
|--------|-----------|--------|
127
+
|**Identity**| MolGraph canonical SMILES equality (stereo-aware) + equal atom count | Build direct identity mapping (atom *i* → atom *i*) and skip MCS entirely. Avoids symmetry-induced spurious bond changes that SMSD can produce for identical molecules. |
|**Fingerprint**|`Tanimoto(FP_i, FP_j) < 0.05` and both > 5 atoms | Skip pair — structurally unrelated by path fingerprint |
130
+
131
+
**Identity pre-filter detail:** Canonical SMILES are generated via `MolGraph.toCanonicalSmiles()` (SMSD 6.9.1), which encodes tetrahedral chirality (`@`/`@@`) and E/Z double-bond geometry (`/`/`\`). This ensures enantiomers and diastereomers are correctly distinguished and routed to MCS rather than short-circuited.
130
132
131
133
#### 5.2 Tiered Substructure Matching
132
134
@@ -313,22 +315,27 @@ isNeeded(reactant) = EXISTS element e :
313
315
### Golden Dataset (Lin et al. 2022)
314
316
315
317
1,851 manually curated reactions with expert-validated atom-atom mappings.
318
+
Published tools are scored on chemically-equivalent atom mapping — whether the mapping correctly identifies bond changes regardless of atom-index labelling.
319
+
320
+
| Tool | Chemically Equivalent | Bond-Change Exact | Mol-Map Exact | Training Data | Deterministic |
**Note on canonical SMILES:** Identity pre-filtering uses `MolGraph.toCanonicalSmiles()` from SMSD 6.9.1 rather than CDK's `SmilesGenerator`. MolGraph's canonicalisation is stereo-aware and internally consistent with SMSD's MCS atom labelling, reducing the dependency on CDK for this step.
352
+
344
353
---
345
354
346
355
## 7. References
@@ -355,5 +364,5 @@ isNeeded(reactant) = EXISTS element e :
355
364
356
365
---
357
366
358
-
*Reaction Decoder Tool is developed and maintained by BioInception Labs.*
367
+
*Reaction Decoder Tool is developed and maintained by BioInception PVT LTD.*
359
368
*Copyright (C) 2003-2026 Syed Asad Rahman. GNU LGPL v3.0.*
0 commit comments