Return to RXN for Chemistry Documentation

RXN for Chemistry - Predict retrosynthesis

This guide is a tutorial to learn how to predict a retrosynthesis using RXN for Chemistry.

+ Expand All

- Close All

Start Tutorial

An RXN for Chemistry account is required to complete this tutorial.
  • Number of steps in this tutorial: 28

  • Estimated time to complete this tutorial: 15 minutes


Step 1: Log into RXN for Chemistry
See our RXN for Chemistry Quick Start for directions on logging into the RXN for Chemistry system.
Step 2: From the RXN for Chemistry Home screen, click "Predict retrosynthetic routes"

Screenshot of: Click "Predict retrosynthetic routes"

Step 3: Select "Add to pre-existing project" or "Add to a brand new project"

If selecting Add to pre-existing project then use the dropdown labeled "Filter" to select the project.

Screenshot of: Click the "Filter…​" field.

Step 4: Select "Target molecule" to specify a molecule
If you have a file containing SMILES strings you may use the "From file" option to upload that file.

Screenshot of: Click here.

Step 5: Click "Predict retrosynthetic routes"

Screenshot of: Click "Predict retrosynthetic routes"

Step 6: In the visual editor, click "Smiles string editor" at the top of the page

Screenshot of: Click "Smiles string editor"

Step 7: Copy and paste a SMILES string into the text box.

On this page, you will paste a SMILES string representing the "target" molecule for the retrosynthesis. Use the one shown below.

The following SMILES string is for a molecule named paliperidone palmitate (brand name Invega Sustenna).

CCCCCCCCCCCCCCCC(=O)OC1CCCN2C1=NC(=C(C2=O)CCN3CCC(CC3)C4=NOC5=C4C=CC(=C5)F)C

This compound is the active ingredient in medication used to treat schizophrenia and other psychotic disorders.

Screenshot of: Click this field.

Step 8: Click "Validate string smiles" to validate the structure in the SMILES string

Screenshot of: Click "Validate string smiles"

Step 9: Click "Load in visual editor" to view a visual model of the molecule

Screenshot of: Click "Load in visual editor"

Step 10: After viewing the molecule in the visual editor, click "Predict retrosynthetic route"

Screenshot of: Click "Predict retrosynthetic route"

Step 11: Select "Automatic mode" in the dialog box

There is no difference in the outcome of the analysis between Automatic mode and Interactive mode, but they each provide a different way of interacting with the results.

Automatic mode: All of the most likely reactions and their reagents are predicted and presented in a list, allowing you to navigate and view each of the predicted retrosynthetic routes.

Interactive mode: The interactive mode allows you to select step-by-step the reagents and reaction steps that you would like to use.

Screenshot of: Click here.

Step 12: Under "Choose AI model category" select RXN

After selecting the RXN model, select the first option under "Choose AI model version".

AIZ and RXN are both machine learning models that have been trained on slightly different data sets. Both of these models are powerful tools for retrosynthetic analysis.

After selecting the AI model version, you may move the slide control labeled "Speed/Quality Tuning" to set a balance between how rapidly results are completed versus how exhaustive the search is.

Screenshot of: Click here.

Step 13: Click the "Show Advanced Options" check box

Adjust the MSSR slide controller to a value between 2 and 15.

MSSR is the Minimum Sum of Squared Running Digital Sum. Lower values indicate that you want only those results that more closely fit the underlying machine learning models. Higher values indicate that you want to include analyses that have a higher discrepancy between the data and machine learning models.

Screenshot of: Click "Show Advanced Options"

Step 14: Click "Predict Retrosynthetic Route" to begin the analysis

Screenshot of: Click "Predict Retrosynthetic Route"

Step 15: Click "Back to Retrosynthetic Routes"

This will take you back to a page containing a listing of any previously computed retrosynthesis analyses.

The computation of all of the retrosynthesis reactions may take some time (up to 10 minutes) depending on several factors, including (but not limited to) molecular complexity, MSSR setting, and selection of machine learning model.

Screenshot of: Click "Back to Retrosynthetic Routes"

Step 16: Start from the RXN for Chemistry Home view
The retrosynthesis computations may take several minutes. You can log off from RXN for Chemistry anytime and return to the Home page later to view the results.

Navigate to the RXN for Chemistry Home page at https://rxn.app.accelerate.science/rxn/home.

If you are already logged into RXN for Chemisty click on the "Home" link in the left-hand sidebar.

Step 17: From Home page, click "Projects"

Screenshot of: Click "Projects"

Step 18: Click on the Project containing the retrosynthesis analysis.

Screenshot of: Click "Ray Lopez"

Step 19: Select "AI Predictions" from the top menu, then "Retrosynthetic routes"

Screenshot of: Click "Retrosynthetic routes"

Step 20: From the Retrosynthetic routes listing select the analysis you wish to view.

Screenshot of: Click "demo project 1_20230922_18:29:46.815"

Step 21: Note the Confidence score at the top center of the page, then click on the zoom buttons for zooming in and out
What is confidence?

Predictions derived from machine learning models will often come with a computed confidence level. This number represents the machine’s estimate of the probability that the model’s prediction is correct, and that the predicted reaction will proceed as shown.

Since it is a probability score the value of the confidence is usually given as a number between 0 and 1. Confidence scores closest to 1 are considered the best. However, the relative utility of confidence scores depends very heavily on the use case and nature of the training machine learning model. A confidence score of 0.71 might be considered unacceptable in one use case, but acceptable in another.

Screenshot of: Click here.

Step 22: Click on the icon shown below to view Alternative Reactions (if any) and then close Alternative Reactions sidebar
Alternative Reactions are those reactions that may occur with lower probabiliity than the main reactions shown.

Screenshot of: Click here.

Step 23: Click on the dropdown list in the upper left of the screen

This option will allow you to view the different reaction sequences identified by RXN for Chemistry.

Screenshot of: Click here.

Step 24: Click on the TMAP icon

This will take you to a visual TMAP showing your retrosynthesis results in the context of other chemical structures and properties.

What is a TMAP?

TMAP (TreeMAP) is a set of algorithms to help visualize high-dimensional data sets containing chemical structures. It visualizes such data while preserving both global and local features with a sufficient level of detail to allow for human inspection and interpretation.

For more information on TMAPs, see the following:

Screenshot of: Click here.

Step 25: After reading the TMAP guidance, click "Continue with TMAP"

Screenshot of: Click "Continue with TMAP"

Step 26: Click on the +/- zoom icons to zoom in and out of TMAP
The legend on the right hand side of the TMAP shows you the colors used to represent your reactions most closely related neighbors in the TMAP.

Screenshot of: Click this icon.

Step 27: Click on the "X" in the upper right of the screen to close the TMAP

After you have finished exploring the TMAP, close it to return to the reactions view.

Screenshot of: Click here.

Step 28: Tutorial completed

This completes the Predict Retrosynthesis tutorial for RXN for Chemistry.

In this tutorial, we:

  • Executed a retrosynthesis analysis

  • Viewed the results in a Project

  • Viewed the details of the predicted retrosynthesis reactions

  • Viewed the TreeMAP of retrosynthesis reactions




IBM®, the IBM® logo, and ibm.com® are trademarks or registered trademarks of IBM® Corporation, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM® or other companies. A current list of IBM® trademarks is available here: Copyright and Trademark Information.