This repository supports the following manuscript.
Jiale Shi, Nathan J. Rebello, Dylan Walsh, Weizhong Zou, Michael E. Deagen, Bruno Salomao Leao, Debra J. Audus, Bradley D. Olsen, "Quantifying Pairwise Chemical Similarity for Polymers", Macromolecules. 2023. Link
In this work, we proposed a reliable method to quantitatively calculate the pairwise chemical similarity score for polymers, where the earth mover’s distance (EMD) is utilized to calculate the similarity of the repeat units and end groups, while the graph edit distance (GED) is used to calculate the similarity of the topology. These three values then are combined to yield an overall pairwise chemical similarity score for polymers.
The repository is intended for the following use cases:
- Illustrate key ideas from the manuscript Method section including earth mover's distance and graph edit distance
- Allow for full reproducibility of the data in the manuscript
If you are interested in running one or more notebooks in Google Colab, first click on the relevant links below.
- Polymer_Similarity_Method_Section_PolymerA_PolymerB
- Polymer_Similarity_Case1_Varying_Repeat_Units
- Polymer_Similarity_Case2_Varying_Topologies
- Polymer_Similarity_Case3_Varying_Both_Repeat_Units_and_Topologies
- Polymer_Similarity_Case4_Graft_Copolymers
- Polymer_Similarity_Case5_Segmented_Polymers
- SI-I: Earth Mover’s Distance vs. Simple Sum or Average Method
- SI-II: Modifying Weights for Final Ranking Order of Overall Similarity Score
- SI-III: Overall Pairwise Similarity Scores with Arithmetic Means
- Overall Pairwise Similarity Scores with Arithmetic Means Case 1
- Overall Pairwise Similarity Scores with Arithmetic Means Case 2
- Overall Pairwise Similarity Scores with Arithmetic Means Case 3
- Overall Pairwise Similarity Scores with Arithmetic Means Case 4
- Overall Pairwise Similarity Scores with Arithmetic Means Case 5
- SI-IV: Similarity Calculation for Random Copolymer Polymer A with Different Composition Ratio of Repeat Units and Diblock Copolymer Polymer B with Different Block Length Ratio
- SI-V: Similarity Calculation for Graft Polymers with Different Lengths of Side Chains
- SI-VI: Similarity Calculation for Segmented Polymers with Different Fractions and Different Lengths of Macromonomers
- SI-VII: Similarity Calculation including Tacticity
Then open the colab badge in the notebook.
It will open a colab notebook. Then you can run the notebook as normal. All the required libraries and functions are present in the colab notebook.
Jiale Shi, PhD
Postdoctoral Associate
Department of Chemical Engineering
Massachusetts Institute of Technology (MIT)
Email: [email protected]
GithubID: shijiale0609
@article{shi2023quantifying,
author = {Shi, Jiale and Rebello, Nathan J. and Walsh, Dylan and Zou, Weizhong and Deagen, Michael E. and Leao, Bruno Salomao and Audus, Debra J. and Olsen, Bradley D.},
title = {Quantifying Pairwise Similarity for Complex Polymers},
journal = {Macromolecules},
year = {2023},
doi = {10.1021/acs.macromol.3c00761},
URL = {https://doi.org/10.1021/acs.macromol.3c00761},
eprint = {https://doi.org/10.1021/acs.macromol.3c00761}
}