The construction industry faces mounting pressure to reduce its carbon footprint, with cement production alone contributing over 6% of global greenhouse gas emissions. In recent work, machine learning models screened over 14,000 materials from scientific literature and 1 million rock samples, identifying a diverse set of secondary and natural materials that could partially replace global cement production.
Cement production produces greenhouse gas emissions, driven by the energy-intensive production of clinker and limestone calcination1. The conventional approach to reducing these emissions involves replacing clinker with supplementary cementitious materials like coal fly ash and blast furnace slag. However, the availability of these traditional substitutes has declined by 37% over two decades as coal plants close and steel recycling increases2. This supply constraint, combined with rising demand for sustainable construction, necessitates identifying new materials that can undergo similar cement-like reactions.
Machine learning offers a systematic approach to screen large databases of materials and has demonstrated exciting potential in other applications3. Recently, a team led by Soroush Mahjoubi and Elsa Olivetti at Massachusetts Institute of Technology have reported a comprehensive framework that combines natural language processing with predictive modelling to explore cement alternatives (https://doi.org/10.1038/s43246-025-00820-4)4. The team developed an innovative approach, mining over 5.7 million scientific papers to extract chemical compositions of more than 14,000 materials. They then employed fine-tuned large language models to classify these materials into 19 categories. A neural network based on this database was trained to predict three critical reactivity metrics: heat release, Ca(OH)2 consumption, and bound water content, achieving R2 values exceeding 0.85.
This approach reveals unexpected diversity among reactive materials. Construction and demolition wastes, including recycled ceramics and concrete, exhibit heat releases up to 450âJ/g, comparable to traditional pozzolansâthe broad class of siliceous and aluminous cement-forming materials. Municipal solid waste incineration ash and various biomass ashes (rice husk, sugarcane bagasse, wood) also demonstrate significant pozzolanic behaviour. Among mine tailings, copper and zinc varieties show particularly promising reactivity profiles. Excitingly, this report indicates these secondary materials could collectively replace 68% of global cement production (Fig. 1). However, not all regions have access to industrial byproducts, making the discovery of reactive natural materials particularly significant. By applying their model to predict reactivity for a global geochemical database of over 1 million rock samples, the team identified 25 rock types with significant reactivity when mechanically activated. Ignimbrite and silicic tuff show the highest reactive-to-total sample ratios (~25%), while more abundant rocks like rhyolite and andesite, though displaying lower ratios, offer greater global availability. These reactive rocks are concentrated in tectonically active regions, including the Andes, the Great Rift Valley, and the Pacific Ring of Fire, providing regional alternatives where industrial byproducts are scarce.
a Heat release versus Ca(OH)2 consumption for different material types. b Density distribution of the 18 material classes in reactivity space. Grey lines mark critical thresholds: materials releasing >120âJ/g are considered reactive (versus inert), while those consuming >50âg/100âg Ca(OH)2 exhibit pozzolanic behaviour. Colour intensity indicates sample frequency. Reproduced from Communications Materials under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (2025)4.
The technical achievement of this study required overcoming significant challenges in data scarcity and variability. âMuch of the existing data on cement substitutes is scattered, inconsistent, and incomplete, especially regarding key physical properties like amorphous contentâ, notes Dr Mahjoubi. To overcome this, the team developed a specialised neural network with multiple prediction pathways that can intelligently fill in missing data gaps while maintaining accuracy, an essential feature when dealing with the incomplete records common in materials research. This sophisticated approach enabled these researchers to capture the complex chemical and physical interactions governing cement reactivity while managing the inherent uncertainty in the available data.
Implementing these alternatives at scale could reduce global greenhouse gas emissions by 3%, equivalent to removing 260 million vehicles from roads. Many identified materials require only mechanical activation through grinding, avoiding the energy-intensive thermal processing needed for other cement substitutes. The geographic distribution of natural precursors also addresses regional disparities in access to sustainable construction materials. âExperimental validation of some of the most promising candidate materials is a critical next stepâ, emphasises Dr Mahjoubi. Importantly, the machine learning framework, able to rapidly screen materials based on chemical and physical properties, provides a valuable foundation for expanding the circular economy in construction. âFuture work will also aim to integrate knowledge about cement hydration kinetics to further improve the interpretability and accuracy of predictionsâ, comments Dr Mahjoubi. As infrastructure demands continue to grow globally, data-driven approaches offer a promising pathway to maintain material performance while significantly reducing the environmental impact of concrete production.
References
Monteiro, P. J. M. et al. Towards sustainable concrete. Nat. Mater. https://doi.org/10.1038/nmat4930 (2017).
Shah, I. M. et al. Cement substitution with secondary materials can reduce annual global CO2 emissions by up to 1.3 gigatons. Nat. Commun. https://doi.org/10.1038/s41467-022-33289-7 (2022).
Merchant, A. et al. Scaling deep learning for materials discovery. Nature https://doi.org/10.1038/s41586-023-06735-9 (2023).
Mahjoubi, S. et al. Data-driven material screening of secondary and natural cementitious precursors. Commun. Mater. https://doi.org/10.1038/s43246-025-00820-4 (2025).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Evans, J.D. Towards net zero by data-driven discovery of sustainable cement alternatives. Commun Chem 8, 213 (2025). https://doi.org/10.1038/s42004-025-01608-w
Published:
DOI: https://doi.org/10.1038/s42004-025-01608-w
