Abstract
Owing to their highly tunable structures, metalâorganic frameworks (MOFs) are considered suitable candidates for a range of applications, including adsorption, separation, sensing and catalysis. However, MOFs must be stable in water vapour to be considered industrially viable. It is currently challenging to predict water stability in MOFs; experiments involve time-intensive MOF synthesis, while modelling techniques do not reliably capture the water stability behaviour. Here, we build a machine learning-based model to accurately and instantly classify MOFs as stable or unstable depending on the target application, or the amount of water exposed. The model is trained using an empirically measured dataset of water stabilities for over 200 MOFs, and uses a comprehensive set of chemical features capturing information about their constituent metal node, organic ligand and metalâligand molar ratios. In addition to screening stable MOF candidates for future experiments, the trained models were used to extract a number of simple water stability trends in MOFs. This approach is general and can also be used to screen MOFs for other design criteria.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 /Â 30Â days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout




Similar content being viewed by others
Data availability
The MOF water-stability data (illustrated in Fig. 2) used to train the models were obtained from ref. 13. The water-stability data used for validation (recent 10 MOFs) and screening (88 new MOFs) were obtained from the literature as cited in the Article. These datasets, including MOF features, are deposited at https://doi.org/10.5281/zenodo.4014333. Source data are provided with this paper.
Code availability
The machine learning training and prediction codes underlying this work are freely available for general use under GNU General Public Licence v3.0 and are deposited at https://doi.org/10.5281/zenodo.4014333.
References
Yoon, J. W. et al. Selective nitrogen capture by porous hybrid materials containing accessible transition metal ion sites. Nat. Mater. 16, 526â531 (2017).
Adil, K. et al. Gas/vapour separation using ultra-microporous metalâorganic frameworks: insights into the structure/separation relationship. Chem. Soc. Rev. 46, 3402â3430 (2017).
Mason, J. A., Veenstra, M. & Long, J. R. Evaluating metalâorganic frameworks for natural gas storage. Chem. Sci. 5, 32â51 (2014).
Furukawa, H., Cordova, K. E., OâKeeffe, M. & Yaghi, O. M. The chemistry and applications of metalâorganic frameworks. Science 341, 1230444 (2013).
Dusselier, M. & Davis, M. E. Small-pore zeolites: synthesis and catalysis. Chem. Rev. 118, 5265â5329 (2018).
Yang, D. & Gates, B. C. Catalysis by metalâorganic frameworks: perspective and suggestions for future research. ACS Catal. 9, 1779â1798 (2019).
Furukawa, H. et al. Ultrahigh porosity in metalâorganic frameworks. Science 329, 424â428 (2010).
Li, H., Eddaoudi, M., OâKeeffe, M. & Yaghi, O. M. Design and synthesis of an exceptionally stable and highly porous metalâorganic framework. Nature 402, 276â279 (1999).
Cohen, S. M. Postsynthetic methods for the functionalization of metalâorganic frameworks. Chem. Rev. 112, 970â1000 (2011).
Zhang, Y.-B. et al. Introduction of functionality, selection of topology and enhancement of gas adsorption in multivariate metalâorganic framework-177. J. Am. Chem. Soc. 137, 2641â2650 (2015).
Kaye, S. S., Dailly, A., Yaghi, O. M. & Long, J. R. Impact of preparation and handling on the hydrogen storage properties of Zn4O(1,4-benzenedicarboxylate)3 (MOF-5). J. Am. Chem. Soc. 129, 14176â14177 (2007).
Ma, D., Li, Y. & Li, Z. Tuning the moisture stability of metalâorganic frameworks by incorporating hydrophobic functional groups at different positions of ligands. Chem. Commun. 47, 7377â7379 (2011).
Burtch, N. C., Jasuja, H. & Walton, K. S. Water stability and adsorption in metalâorganic frameworks. Chem. Rev. 114, 10575â10612 (2014).
Schoenecker, P. M., Carson, C. G., Jasuja, H., Flemming, C. J. & Walton, K. S. Effect of water adsorption on retention of structure and surface area of metalâorganic frameworks. Ind. Eng. Chem. Res. 51, 6513â6519 (2012).
Bosch, M., Zhang, M. & Zhou, H.-C. Increasing the stability of metal-organic frameworks. Adv. Chem. 2014, 182327 (2014).
Rieth, A. J., Wright, A. M. & Dinca, M. Kinetic stability of metalâorganic frameworks for corrosive and coordinating gas capture. Nat. Rev. Mater 4, 708â725 (2019).
ul Qadir, N., Said, S. A. & Bahaidarah, H. M. Structural stability of metalâorganic frameworks in aqueous mediaâcontrolling factors and methods to improve hydrostability and hydrothermal cyclic stability. Micropor. Mesopor. Mater. 201, 61â90 (2015).
Plessius, R. et al. Highly selective water adsorption in a lanthanum metalâorganic framework. Chem. Eur. J. 20, 7922â7925 (2014).
Qin, L. et al. A water-stable metalâorganic framework of a zwitterionic carboxylate with dysprosium: a sensing platform for Ebolavirus RNA sequences. Chem. Commun. 52, 132â135 (2016).
Liu, T.-F. et al. Topology-guided design and syntheses of highly stable mesoporous porphyrinic zirconium metalâorganic frameworks with high surface area. J. Am. Chem. Soc. 137, 413â419 (2014).
Zhang, J.-P., Zhu, A.-X., Lin, R.-B., Qi, X.-L. & Chen, X.-M. Pore surface tailored SOD-type metalâorganic zeolites. Adv. Mater. 23, 1268â1271 (2011).
Nijem, N. et al. Water cluster confinement and methane adsorption in the hydrophobic cavities of a fluorinated metalâorganic framework. J. Am. Chem. Soc. 135, 12615â12626 (2013).
Yang, C. et al. Fluorous metalâorganic frameworks with superior adsorption and hydrophobic properties toward oil spill cleanup and hydrocarbon storage. J. Am. Chem. Soc. 133, 18094â18097 (2011).
Shih, Y.-H. et al. A simple approach to enhance the water stability of a metalâorganic framework. Chem. Eur. J. 23, 42â46 (2017).
Taylor, J. M., Vaidhyanathan, R., Iremonger, S. S. & Shimizu, G. K. Enhancing water stability of metalâorganic frameworks via phosphonate monoester linkers. J. Am. Chem. Soc. 134, 14338â14340 (2012).
Canivet, J., Fateeva, A., Guo, Y., Coasne, B. & Farrusseng, D. Water adsorption in MOFs: fundamentals and applications. Chem. Soc. Rev. 43, 5594â5617 (2014).
OpenSMILES; http://opensmiles.org
Kim, C., Chandrasekaran, A., Huan, T. D., Das, D. & Ramprasad, R. Polymer genome: a data-powered polymer informatics platform for property predictions. J. Phys. Chem. C 122, 17575â17585 (2018).
Mannodi-Kanakkithodi, A. et al. Scoping the polymer genome: a roadmap for rational polymer dielectrics design and beyond. Mater. Today 21, 785â796 (2018).
Huan, T. D., Mannodi-Kanakkithodi, A. & Ramprasad, R. Accelerated materials property predictions and design using motif-based fingerprints. Phys. Rev. B 92, 014106 (2015).
Nantasenamat, C., Isarankura-Na-Ayudhya, C. & Prachayasittikul, V. Advances in computational methods to predict the biological activity of compounds. Expert Opin. Drug Discov. 5, 633â654 (2010).
RDKit Open Source Toolkit for Cheminformatics; http://www.rdkit.org/ (accessed 3 September 2019).
Jha, A., Chandrasekaran, A., Kim, C. & Ramprasad, R. Impact of dataset uncertainties on machine learning model predictions: the example of polymer glass transition temperatures. Model. Simul. Mater. Sci. Eng. (2018); https://doi.org/10.1088/1361-651X/aaf8ca
Shannon, R. D. Revised effective ionic radii and systematic studies of interatomic distances in halides and chalcogenides. Acta Crystallogr. A 32, 751â767 (1976).
Haynes, W. M. CRC Handbook of Chemistry and Physics (CRC Press, 2014).
Pauling, L. The nature of the chemical bond. IV. The energy of single bonds and the relative electronegativity of atoms. J. Am. Chem. Soc. 54, 3570â3582 (1932).
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389â422 (2002).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825â2830 (2011).
Xie, L., Liu, D., Huang, H., Yang, Q. & Zhong, C. Efficient capture of nitrobenzene from waste water using metalâorganic frameworks. Chem. Eng. J. 246, 142â149 (2014).
Wang, D., Zhang, L., Li, G., Huo, Q. & Liu, Y. Luminescent MOF material based on cadmium(ii) and mixed ligands: application for sensing volatile organic solvent molecules. RSC Adv. 5, 18087â18091 (2015).
Liao, P.-Q. et al. Drastic enhancement of catalytic activity via post-oxidation of a porous Mnii triazolate framework. Chem. Eur. J. 20, 11303â11307 (2014).
Jing, F. et al. Mil-68(Fe) as an efficient visible-light-driven photocatalyst for the treatment of a simulated waste-water contain Cr(vi) and malachite green. Appl. Catal. B Environ. 206, 9â15 (2017).
Cadiau, A. et al. Design of hydrophilic metal organic framework water adsorbents for heat reallocation. Adv. Mater. 27, 4775â4780 (2015).
Bazaga-Garcia, M. et al. Tuning proton conductivity in alkali metal phosphonocarboxylates by cation size-induced and water-facilitated proton transfer pathways. Chem. Mater. 27, 424â435 (2015).
Gutov, O. V. et al. Water-stable zirconium-based metalâorganic framework material with high-surface area and gas-storage capacities. Chem. Eur. J. 20, 12389â12393 (2014).
Duan, J., Jin, W. & Krishna, R. Natural gas purification using a porous coordination polymer with water and chemical stability. Inorg. Chem. 54, 4279â4284 (2015).
Nguyen, K. T., Blum, L. C., Van Deursen, R. & Reymond, J.-L. Classification of organic molecules by molecular quantum numbers. ChemMedChem 4, 1803â1805 (2009).
Lin, R.-B. et al. Molecular sieving of ethylene from ethane using a rigid metalâorganic framework. Nat. Mater. 17, 1128â1133 (2018).
Sun, Y. & Han, H. A novel 3D Agi cationic metalâorganic framework based on 1,2,4,5-tetra(4-pyridyl) benzene with selective adsorption of CO2 over CH4, H2O over C2H5OH, and trapping Cr2O72â. J. Mol. Struct. 1194, 73â77 (2019).
Acknowledgements
This work was supported as part of the Center for Understanding and Control of Acid Gas-Induced Evolution of Materials for Energy (UNCAGE-ME), an Energy Frontier Research Center funded by the US Department of Energy, Office of Science, Basic Energy Sciences under award no. DE-SC0012577. C.C. gratefully acknowledges a fellowship from the Achievement Rewards for College Scientists (ARCS) Foundation. R.B. acknowledges insightful discussions with D.S. Sholl.
Author information
Authors and Affiliations
Contributions
R.B. and R.R. initiated this research project. R.B. developed and analysed the ML models. C.C. and T.G.E. contributed to data collection. All co-authors contributed to the model analysis, discussions and writing of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Statistics on water stability in MOFs.
Distribution of MOFs into 4 categories of water stability based on the constituting metal node.
Extended Data Fig. 2 Performance comparison of ML algorithms for 2-class model.
Performance comparison of SVM, RF and GB methods for the 2-class model (âSâ, stable and âUâ, unstable MOFs) using the RFE based reduced feature set. Left panel shows the overall class-weighted accuracies, while the right two panels show the per-class test scores, that is F1, area under the ROC curve (AUC), precision (P) and recall (R), for the RF and SVM models. The RF model can be seen to outperform in all accounts and was selected as the 2-class model in this work.
Extended Data Fig. 3 Performance comparison of ML algorithms for 3-class model.
Performance comparison of SVM, RF and GB methods for the 3-class model (âSâ, stable, âHKâ, high kinetic stable, and âUâ, unstable MOFs) using the RFE based reduced feature set. Left panel shows the overall class-weighted accuracies, while the right two panels respectively show the per-class F1 and recall scores, for the RF and SVM models. The RF model can be seen to have poor performance for the underrepresented stable (S) class, although it was trained to maximize the class-weighted accuracy. Similar results were found for GB algorithm as well. Thus, SVM with best performance for all classes was selected as the 3-class model in this work.
Extended Data Fig. 4 Important MOF water stability descriptors.
Relative feature importance as extracted from the random forest (RF) 2-class model. The feature importance in case of RF is based on the concept of mean decrease in impurity (MDI), as explained here (G. Louppe, Understanding Random Forests: From Theory to Practice, PhD Thesis, U. of Liege, 2014). The features with relatively high importance were selected to mine important chemical trends of water stability in MOFs. The first letter of the descriptor, that is, M or L, denotes the metal or the ligand associated features, respectively (see main article for details). Features with high importance were used to derive important stability trends as discussed in the main article.
Extended Data Fig. 5 Correlation between MOF water stability and its descriptors.
A subset of post-RFE features were analyzed to see if linear correlations between MOF water stability for the case with two classes (S+HK and U+LK) and the features values could be used to derive some chemical trends. This figure suggests that the presence of certain chemical motifs, especially those containing N or ketone groups, and 5-member rings, tend to enhance the water stability in MOFs. Each marker in the figure represents a MOF from the Burtch data set. See Supplementary Information for details on the different descriptors.
Supplementary information
Supplementary Information
Supplementary Tables 1 and 2 discussing the reduced feature set and model predictions on 88 new MOFS, respectively.
Source data
Source Data Fig. 3
Statistical source data.
Source Data Fig. 5
Statistical source data.
Source Data Extended Data Fig. 1
Statistical source data.
Source Data Extended Data Fig. 2
Statistical source data.
Source Data Extended Data Fig. 3
Statistical source data.
Source Data Extended Data Fig. 4
Statistical source data.
Source Data Extended Data Fig. 5
Statistical source data.
Rights and permissions
About this article
Cite this article
Batra, R., Chen, C., Evans, T.G. et al. Prediction of water stability of metalâorganic frameworks using machine learning. Nat Mach Intell 2, 704â710 (2020). https://doi.org/10.1038/s42256-020-00249-z
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/s42256-020-00249-z
This article is cited by
-
Multi-modal conditional diffusion model using signed distance functions for metal-organic frameworks generation
Nature Communications (2025)
-
Large language models for reticular chemistry
Nature Reviews Materials (2025)
-
Identifying MOFs for electrochemical energy storage via density functional theory and machine learning
npj Computational Materials (2025)
-
Discovery of novel High-Tc superconductors via machine learning-based random forest model
Applied Physics A (2025)
-
From Algorithms to Applications: A Comprehensive Review of Machine Learning in Computational Materials Science
Archives of Computational Methods in Engineering (2025)


