{"id":208781,"date":"2025-03-15T15:16:51","date_gmt":"2025-03-15T20:16:51","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2025\/03\/active-phase-discovery-in-heterogeneous-catalysis-via-topology-guided-sampling-and-machine-learning"},"modified":"2025-03-15T15:16:51","modified_gmt":"2025-03-15T20:16:51","slug":"active-phase-discovery-in-heterogeneous-catalysis-via-topology-guided-sampling-and-machine-learning","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2025\/03\/active-phase-discovery-in-heterogeneous-catalysis-via-topology-guided-sampling-and-machine-learning","title":{"rendered":"Active phase discovery in heterogeneous catalysis via topology-guided sampling and machine learning"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/active-phase-discovery-in-heterogeneous-catalysis-via-topology-guided-sampling-and-machine-learning.jpg\"><\/a><\/p>\n<p>Global optimization-based approaches such as basin hopping<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wexler, R. B., Qiu, T. & Rappe, A. M. Automatic prediction of surface phase diagrams using ab initio grand canonical monte carlo. J. Phys. Chem. C 123, 2321&ndash;2328 (2019).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR28\" id=\"ref-link-section-d312143174e600\">28<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wales, D. J. & Doye, J. P. K. Global optimization by basin-hopping and the lowest energy structures of lennard-jones clusters containing up to 110 atoms. J. Phys. Chem. A 101, 5111&ndash;5116 (1997).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR29\" id=\"ref-link-section-d312143174e600_1\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Panosetti, C., Krautgasser, K., Palagin, D., Reuter, K. & Maurer, R. J. Global materials structure search with chemically motivated coordinates. Nano Lett 15, 8044&ndash;8048 (2015).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR30\" id=\"ref-link-section-d312143174e600_2\">30<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 31\" title=\"Obersteiner, V., Scherbela, M., Hormann, L., Wegner, D. & Hofmann, O. T. Structure prediction for surface-induced phases of organic monolayers: Overcoming the combinatorial bottleneck. Nano Lett 17, 4453&ndash;4460 (2017).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR31\" id=\"ref-link-section-d312143174e603\">31<\/a><\/sup>, evolutionary algorithms<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Bauer, M. N., Probert, M. I. J. & Panosetti, C. Systematic comparison of genetic algorithm and basin hopping approaches to the global optimization of Si(111) surface reconstructions. J. Phys. Chem. A 126, 3043&ndash;3056 (2022).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR32\" id=\"ref-link-section-d312143174e607\">32<\/a><\/sup> and random structure search<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 33\" title=\"Schusteritsch, G. & Pickard, C. J. Predicting interface structures: From SrTiO3 to graphene. Phys. Rev. B 90, 035424 (2014).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR33\" id=\"ref-link-section-d312143174e611\">33<\/a><\/sup> offer principled approaches to comprehensively navigating the ambiguity of active phase. However, these methods usually rely on skillful parameter adjustments and predefined conditions, and face challenges in exploring the entire configuration space and dealing with amorphous structures. The graph theory-based algorithms<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Deshpande, S., Maxson, T. & Greeley, J. Graph theory approach to determine configurations of multidentate and high coverage adsorbates for heterogeneous catalysis. Npj Comput. Mater 6, 79 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR34\" id=\"ref-link-section-d312143174e615\">34<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Ghanekar, P. G., Deshpande, S. & Greeley, J. Adsorbate chemical environment-based machine learning framework for heterogeneous catalysis. Nat. Commun. 13, 5788 (2022).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR35\" id=\"ref-link-section-d312143174e615_1\">35<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Han, S., Lysgaard, S., Vegge, T. & Hansen, H. A. Rapid mapping of alloy surface phase diagrams via bayesian evolutionary multitasking. Npj Comput. Mater 9,139 (2023).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR36\" id=\"ref-link-section-d312143174e615_2\">36<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Boes, J. R., Mamun, O., Winther, K. & Bligaard, T. Graph theory approach to high-throughput surface adsorption structure generation. J. Phys. Chem. A 123, 2281&ndash;2285 (2019).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR37\" id=\"ref-link-section-d312143174e618\">37<\/a><\/sup>, which can enumerate configurations for a specific adsorbate coverage on the surface with graph isomorphism algorithms, even on an asymmetric one. Nevertheless, these methods can only study the adsorbate coverage effect on the surface because the graph representation is insensitive to three-dimensional information, making it unable to consider subsurface and bulk structure sampling. Other geometric-based methods<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 38\" title=\"Du, X. et al. Machine-learning-accelerated simulations to enable automatic surface reconstruction. Nat. Comput. Sci. 3, 1034&ndash;1044 (2023).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR38\" id=\"ref-link-section-d312143174e622\">38<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Ong, S. P. et al. Python materials genomics (pymatgen): A robust, open-source python library for materials analysis. Comput. Mater. Sci 68314&ndash;319 (2013).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR39\" id=\"ref-link-section-d312143174e625\">39<\/a><\/sup> also have been developed for determining surface adsorption sites but still face difficulties when dealing with non-uniform materials or embedding sites in subsurface.<\/p>\n<p>Topology, independent of metrics or coordinates, presents a novel approach that could potentially offer a comprehensive traversal of structural complexity. Persistent homology, an emerging technique in the field of topological data analysis, bridges the topology and real geometry by capturing geometric structures over various spatial scales through filtration and persistence<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Edelsbrunner, Letscher & Zomorodian Topological persistence and simplification. Discrete Comput. Geomet. 28511&ndash;533, 0179&ndash;5376 (2002).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR40\" id=\"ref-link-section-d312143174e632\">40<\/a><\/sup>. Through embedding geometric information into topological invariants, which are the properties of topological spaces that remain unchanged under specific continuous deformations, it allows the monitoring of the \u201cbirth,\u201d \u201cdeath,\u201d and \u201cpersistence\u201d of isolated components, loops, and cavities across all geometric scales using topological measurements. Topological persistence is usually represented by persistent barcodes, where different horizontal line segments or bars denote homology generators<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Ghrist, R. Barcodes: The persistent topology of data. Bulletin American Mathematical Society 45, 61&ndash;75 (2008). 0273&ndash;0979.\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR41\" id=\"ref-link-section-d312143174e636\">41<\/a><\/sup>. Persistent homology has been successfully employed to the feature representation for machine learning<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Raphael Reinauer, M. C. Nicolas Berkouk Persformer: A transformer architecture for topological machine learning. ArXiv preprint. arXiv:2112.15210 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR42\" id=\"ref-link-section-d312143174e640\">42<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Zia, A. et al. Topological deep learning: A review of an emerging paradigm. Artif. Intell. Rev. 57, 77 (2024).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR43\" id=\"ref-link-section-d312143174e643\">43<\/a><\/sup>, molecular science<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Townsend, J., Micucci, C. P., Hymel, J. H., Maroulas, V. & Vogiatzis, K. D. Representation of molecular structures with persistent homology for machine learning applications in chemistry. Nat. Commun. 11, 3230 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR44\" id=\"ref-link-section-d312143174e647\">44<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Steinberg, L., Russo, J. & Frey, J. A new topological descriptor for water network structure. J. Cheminform 11, 48 (2019).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR45\" id=\"ref-link-section-d312143174e650\">45<\/a><\/sup>, materials science<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, D. et al. Path topology in molecular and materials sciences. J. Phys. Chem. Lett. 14954&ndash;964 (2023).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR46\" id=\"ref-link-section-d312143174e654\">46<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, X. et al. Topology-based machine learning strategy for cluster structure prediction. J. Phys. Chem. Lett. 11, 4392&ndash;4401 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR47\" id=\"ref-link-section-d312143174e654_1\">47<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Jiang, Y. et al. Topological representations of crystalline compounds for the machine-learning prediction of materials properties. NPJ Comput Mater 7, 28 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR48\" id=\"ref-link-section-d312143174e654_2\">48<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Zheng, S., Ding, H., Li, S., Chen, D. & Pan, F. Application of topology-based structure features for machine learning in materials science. Chinese J. Struc. Chem. 42, 100120 (2023).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR49\" id=\"ref-link-section-d312143174e654_3\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Hiraoka, Y. et al. Hierarchical structures of amorphous solids characterized by persistent homology. Proc. Natl. Acad. Sci. Usa. 113, 7035&ndash;7040 (2016).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR50\" id=\"ref-link-section-d312143174e654_4\">50<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Sorensen, S. S., Biscio, C. A. N., Bauchy, M., Fajstrup, L. & Smedskjaer, M. M. Revealing hidden medium-range order in amorphous materials using topological data analysis. Sci. Adv. 6, eabc2320 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR51\" id=\"ref-link-section-d312143174e654_5\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Minamitani, E., Shiga, T., Kashiwagi, M. & Obayashi, I. Topological descriptor of thermal conductivity in amorphous si. J. Chem. Phys. 156, 244502 (2022).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR52\" id=\"ref-link-section-d312143174e654_6\">52<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Lee, Y. et al. Quantifying similarity of pore-geometry in nanoporous materials. Nat. Commun. 8, 15396 (2017).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR53\" id=\"ref-link-section-d312143174e654_7\">53<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Krishnapriyan, A. S., Montoya, J., Haranczyk, M., Hummelshoj, J. & Morozov, D. Machine learning with persistent homology and chemical word embeddings improves prediction accuracy and interpretability in metal-organic frameworks. Sci. Rep. 11, 8888 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR54\" id=\"ref-link-section-d312143174e654_8\">54<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Minamitani, E., Obayashi, I., Shimizu, K. & Watanabe, S. Persistent homology-based descriptor for machine-learning potential of amorphous structures. J. Chem. Phys. 159, 084101 (2023).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR55\" id=\"ref-link-section-d312143174e657\">55<\/a><\/sup>, and computational biology<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Chen, D., Liu, J. & Wei, G.-W. Multiscale topology-enabled structure-to-sequence transformer for protein&ndash;ligand interaction predictions. Nat. Mach. Intell. 6799&ndash;810 (2024).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR56\" id=\"ref-link-section-d312143174e662\">56<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Arango, A. S., Park, H. & Tajkhorshid, E. Topological learning approach to characterizing biological membranes. J. Chem. Inf. Model. 64, 5242&ndash;5252 (2024).\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-57824-4#ref-CR57\" id=\"ref-link-section-d312143174e665\">57<\/a><\/sup>. The successful application motivates us to explore its potential as a sampling algorithm due to its capability of characterizing material structures multidimensionally.<\/p>\n<p>In this work, we introduce a topology-based automatic active phase exploration framework, enabling the thorough configuration sampling and efficient computation via MLFF. The core of this framework is a sampling algorithm (PH-SA) in which the persistent homology analysis is leveraged to detect the possible adsorption\/embedding sites in space via a bottom-up approach. The PH-SA enables the exploration of interactions between surface, subsurface and even bulk phases with active species, without being limited by morphology and thus can be applied to periodical and amorphous structures. MLFF are then trained through transfer learning to enable rapid structural optimization of sampled configurations. Based on the energetic information, Pourbaix diagram is constructed to describe the response of active phase to external environmental conditions. We validated the effectiveness of the framework with two examples: the formation of Pd hydrides with slab models and the oxidation of Pt clusters in electrochemical conditions. The structure evolution process of these two systems was elucidated by screening 50,000 and 100,000 possible configurations, respectively. The predicted phase diagrams with varying external potentials and their intricate roles in shaping the mechanisms of CO<sub>2<\/sub> electroreduction and oxygen reduction reaction were discussed, demonstrating close alignment with experimental observations. Our algorithm can be easily applied to other heterogeneous catalytic structures of interest and pave the way for the realization of automatic active phase analysis under realistic conditions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Global optimization-based approaches such as basin hopping28,29,30,31, evolutionary algorithms32 and random structure search33 offer principled approaches to comprehensively navigating the ambiguity of active phase. However, these methods usually rely on skillful parameter adjustments and predefined conditions, and face challenges in exploring the entire configuration space and dealing with amorphous structures. The graph theory-based algorithms34,35,36,37, which [\u2026]<\/p>\n","protected":false},"author":661,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19,41,6,8],"tags":[],"class_list":["post-208781","post","type-post","status-publish","format-standard","hentry","category-chemistry","category-information-science","category-robotics-ai","category-space"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/208781","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/661"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=208781"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/208781\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=208781"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=208781"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=208781"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}