Application of Molecular Descriptors and End-to-End Deep Learning in MOFs Design

Ying He, Fangchang Tan, Xiliang Yan

Prog Chem ›› 2025, Vol. 37 ›› Issue (8) : 1177-1187.

PDF(3368 KB)
Home Journals Progress in Chemistry
Progress in Chemistry

Abbreviation (ISO4): Prog Chem      Editor in chief: Jincai ZHAO

About  /  Aim & scope  /  Editorial board  /  Indexed  /  Contact  / 
PDF(3368 KB)
Prog Chem ›› 2025, Vol. 37 ›› Issue (8) : 1177-1187. DOI: 10.7536/PC241104
Review

Application of Molecular Descriptors and End-to-End Deep Learning in MOFs Design

Author information +
History +

Abstract

Metal-organic frameworks (MOFs) exhibit great promise in diverse applications such as gas storage,catalysis,and sensing due to their distinctive structures and physicochemical properties. However,traditional experimental approaches face challenges in quickly and efficiently designing MOFs withthe desired characteristics. In recent years,artificial intelligence (AI) techniques,particularly traditional machine learning and deep learning,have been extensively applied in materials science,yielding numerous noteworthy results. An essential requirement for successful modeling with these techniques is the ability to extract the structural features of MOFs and transform them into computer-readable formats. Therefore,we present a comprehensive review of two feature extraction approaches based on molecular descriptors and end-to-end deep learning. We summarize the fundamental concepts and principles of both methods,emphasizing their specific applications and recent advancements in MOFs design. Finally,we discuss the challenges and future directions for improving the comprehensiveness,interpretability,and reproducibility of structural feature extraction. This review aims to provide valuable insights and theoretical guidance for AI-driven MOFs design.

Contents

1 Introduction

2 Traditional machine learning and end-to-end deep learning

2.1 Basic concepts and historical development of artificial intelligence

2.2 Key steps in traditional machine learning and end-to-end deep Learning

2.3 Differences between traditional machine learning and end-to-end deep Learning

2.4 Overview of the MOF databases

3 Feature extraction based on molecular descriptors

3.1 Structural descriptors

3.2 Chemical characteristics

3.3 Thermodynamic properties

3.4 Feature selection and dimensionality reduction techniques

3.5 Effective strategies for handling missing features and noisy data

4 Application of end-to-end deep learning model to MOFs design

4.1 Convolutional neural networks

4.2 Recurrent neural networks

4.3 Graph neural networks

4.4 Generative adversarial networks

5 Conclusion and outlook

Key words

artificial intelligence / computer-aided material design / QSPR / molecular descriptors / feature extraction

Cite this article

Download Citations
Ying He , Fangchang Tan , Xiliang Yan. Application of Molecular Descriptors and End-to-End Deep Learning in MOFs Design[J]. Progress in Chemistry. 2025, 37(8): 1177-1187 https://doi.org/10.7536/PC241104

References

[1]
Furukawa H, Cordova K E, O’Keeffe M, Yaghi O M. Science, 2013, 341(6149): 1230444.
[2]
Yang Q Y, Liu D H, Zhong C L, Li J R. Chem. Rev., 2013, 113(10): 8261.
[3]
Li J R, Kuppler R J, Zhou H C. Chem. Soc. Rev., 2009, 38(5): 1477.
[4]
Knebel A, Caro J. Nat. Nanotechnol., 2022, 17(9): 911.
[5]
Belmabkhout Y, Bhatt P M, Adil K, Pillai R S, Cadiau A, Shkurenko A, Maurin G, Liu G P, Koros W J, Eddaoudi M. Nat. Energy, 2018, 3(12): 1059.
[6]
Yarahmadi H, Salamah S K, Kheimi M. Sci. Rep., 2023, 13: 19136.
[7]
Agirrezabal-Telleria I, Luz I, Ortuño M A, Oregui-Bengoechea M, Gandarias I, López N, Lail M A, Soukri M. Nat. Commun., 2019, 10: 2076.
[8]
Wang S, Fu Y, Wang T, Liu W S, Wang J, Zhao P, Ma H P, Chen Y, Cheng P, Zhang Z J. Nat. Commun., 2023, 14: 7261.
[9]
Zhang J, Liu L S, Zheng C F, Li W, Wang C R, Wang T S. Nat. Commun., 2023, 14: 4922.
[10]
Deng K R, Hou Z Y, Li X J, Li C X, Zhang Y X, Deng X R, Cheng Z Y, Lin J. Sci. Rep., 2015, 5: 7851.
[11]
Hu L G, Wu W H, Hu M, Jiang L, Lin D H, Wu J, Yang K. Nat. Commun., 2024, 15: 3204.
[12]
Rosen A S, Fung V, Huck P, O’Donnell C T, Horton M K, Truhlar D G, Persson K A, Notestein J M, Snurr R Q. NPJ Comput. Mater., 2022, 8: 112.
[13]
Erickson B J. Radiol. Clin. N Am, 2021, 59(6): 933.
[14]
Rosenberg I, Sicard G, David E O. Entropy, 2018, 20(5): 390.
[15]
Salih A, Boscolo Galazzo I, Gkontra P, Lee A M, Lekadir K, Raisi-Estabragh Z, Petersen S E. Circ. Cardiovasc. Imag., 2023, 16(4): e014519.
[16]
Turing A M. Mind, 1950, 59(236): 433.
[17]
Mccarthy J, Minsky M, Rochester N, Shannon C E. AI Magazine, 2006, 27.
[18]
Krizhevsky A, Sutskever I, Hinton G JaINIPS. Commun. ACM, 2012, 25: 2.
[19]
Lu C X, Wan X L, Ma X H, Guan X J, Zhu A C. J. Chem. Inf. Model., 2022, 62(14): 3281.
[20]
He Y, Liu F, Min W C, Liu G H, Wu Y B, Wang Y, Yan X L, Yan B. ACS Appl. Mater. Interfaces, 2024, 16(48): 66367.
[21]
LeCun Y, Bengio Y, Hinton G. Nature, 2015, 521(7553): 436.
[22]
Zhang Y, Zhou B H, Cai X R, Guo W Y, Ding X K, Yuan X J. Inf. Sci., 2021, 551: 67.
[23]
Wang Z H, Chen J, Hoi S C H. IEEE Trans. Pattern Anal. Mach. Intell., 2021, 43(10): 3365.
[24]
Choi R. Y, Coyner A, Kalpathy-Cramer J, Chiang M, Campbell. J. P. Transl. Vis. Sci. Technol., 2020, 9: 14.
[25]
Talaei Khoei T, Ould Slimane H, Kaabouch N. Neural Comput. Appl., 2023, 35(31): 23103.
[26]
Wilmer C E, Leaf M, Lee C Y, Farha O K, Hauser B G, Hupp J T, Snurr R Q. Nat. Chem., 2012, 4(2): 83.
[27]
Chung Y G, Camp J, Haranczyk M, Sikora B J, Bury W, Krungleviciute V, Yildirim T, Farha O K, Sholl D S, Snurr R Q. Chem. Mater., 2014, 26(21): 6185.
[28]
Bobbitt N S, Shi K H, Bucior B J, Chen H Y, Tracy-Amoroso N, Li Z, Sun Y, Merlin J H, Siepmann J I, Siderius D W, Snurr R Q. J. Chem. Eng. Data, 2023, 68(2): 483.
[29]
Li A, Perez R B, Wiggin S, Ward S C, Wood P A, Fairen-Jimenez D. Matter, 2021, 4(4): 1105.
[30]
Yaghi O M, Li H L. J. Am. Chem. Soc., 1995, 117(41): 10401.
[31]
Düren T, Sarkisov L, Yaghi O M, Snurr R Q. Langmuir, 2004, 20(7): 2683.
[32]
Wilmer C E, Farha O K, Bae Y S, Hupp J T, Snurr R Q. Energy Environ. Sci., 2012, 5(12): 9849.
[33]
Groom C R, Bruno I J, Lightfoot M P, Ward S C. Acta Crystallogr. Sect. B Struct. Sci. Cryst. Eng. Mater., 2016, 72(2): 171.
[34]
Chung Y G, Haldoupis E, Bucior B J, Haranczyk M, Lee S, Zhang H D, Vogiatzis K D, Milisavljevic M, Ling S L, Camp J S, Slater B, Siepmann J I, Sholl D S, Snurr R Q. J. Chem. Eng. Data, 2019, 64(12): 5985.
[35]
Rosen A S, Iyer S M, Ray D, Yao Z P, Aspuru-Guzik A, Gagliardi L, Notestein J M, Snurr R Q. Matter, 2021, 4(5): 1578.
[36]
Wang J Q, Liu J P, Wang H S, Zhou M S, Ke G L, Zhang L F, Wu J Z, Gao Z F, Lu D N. Nat. Commun., 2024, 15: 1904.
[37]
Batra R, Chen C, Evans T G, Walton K S, Ramprasad R. Nat. Mach. Intell., 2020, 2(11): 704.
[38]
Mashhadimoslem H, Ali Abdol M, Karimi P, Zanganeh K, Shafeen A, Elkamel A, Kamkar M. ACS Nano, 2024, 18(35): 23842.
[39]
Cao Z L, Magar R, Wang Y Y, Barati Farimani A. J. Am. Chem. Soc., 2023, 145(5): 2958.
[40]
He Y P, Cubuk E D, Allendorf M D, Reed E J. J. Phys. Chem. Lett., 2018, 9(16): 4562.
[41]
Faber F, Lindmaa A, von Lilienfeld O A, Armiento R. Int. J. Quantum Chem., 2015, 115(16): 1094.
[42]
Pham T L, Nguyen N D, Nguyen V D, Kino H, Miyake T, Dam H C. J. Chem. Phys., 2018, 148(20): 204106.
[43]
Bai X F, Li Y, Xie Y B, Chen Q C, Zhang X, Li J R. Green Energy Environ., 2025, 10(1): 132.
[44]
Bucior B J, Bobbitt N S, Islamoglu T, Goswami S, Gopalan A, Yildirim T, Farha O K, Bagheri N, Snurr R Q. Mol. Syst. Des. Eng., 2019, 4(1): 162.
[45]
Chandrashekar G, Sahin F. Comput. Electr. Eng., 2014, 40(1): 16.
[46]
Odhiambo Omuya E, Onyango Okeyo G, Waema Kimwele M. Expert Syst. Appl., 2021, 174: 114765.
[47]
Goodall R E A, Lee A A. Nat. Commun., 2020, 11: 6280.
[48]
Zhao M M, Peng H P, Li L X, Ren Y Q. Sensors, 2024, 24(5): 1522.
[49]
Choudhary K, DeCost B. NPJ Comput. Mater., 2021, 7: 185.
[50]
Sanchez-Cesteros O, Rincon M, Bachiller M, Valladares-Rodriguez S. Sensors, 2023, 23(17): 7582.
[51]
Sarikas A P, Gkagkas K, Froudakis G E. Sci. Rep., 2024, 14: 2242.
[52]
Hung T H, Xu Z X, Kang D Y, Lin L C. J. Phys. Chem. C, 2022, 126(5): 2813.
[53]
Arús-Pous J, Johansson S V, Prykhodko O, Bjerrum E J, Tyrchan C, Reymond J L, Chen H M, Engkvist O. J. Cheminf., 2019, 11: 71.
[54]
Zhang X Y, Zhang K X, Lee Y J. ACS Appl. Mater. Interfaces, 2020, 12(1): 734.
[55]
Ju W, Fang Z, Gu Y Y, Liu Z Q, Long Q Q, Qiao Z Y, Qin Y F, Shen J H, Sun F, Xiao Z P, Yang J W, Yuan J Y, Zhao Y S, Wang Y F, Luo X, Zhang M. Neural Netw., 2024, 173: 106207.
[56]
Xie T, Grossman J C. Phys. Rev. Lett., 2018, 120(14): 145301.
[57]
Lu X Y, Xie Z Z, Wu X J, Li M M, Cai W Q. Chem. Eng. Sci., 2022, 259: 117813.
[58]
Sanchez-Lengeling B, Aspuru-Guzik A. Science, 2018, 361(6400): 360.
[59]
Gómez-Bombarelli R, Wei J N, Duvenaud D, Hernández-Lobato J M, Sánchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel T D, Adams R P, Aspuru-Guzik A. ACS Cent. Sci., 2018, 4(2): 268.
[60]
Yao Z P, Sánchez-Lengeling B, Bobbitt N S, Bucior B J, Kumar S G H, Collins S P, Burns T, Woo T K, Farha O K, Snurr R Q, Aspuru-Guzik A. Nat. Mach. Intell., 2021, 3(1): 76.
[61]
Park J, Lee Y, Kim J. Nat. Commun., 2025, 16(1): 34.
[62]
Nandy A, Terrones G, Arunachalam N, Duan C R, Kastner D W, Kulik H J. Sci. Data, 2022, 9: 74.
[63]
Burner J, Luo J, White A, Mirmiran A, Kwon O, Boyd P G, Maley S, Gibaldi M, Simrod S, Ogden V, Woo T K. Chem. Mater., 2023, 35(3): 900.
[64]
Masegosa A R, Cabañas R, Langseth H, Nielsen T D, Salmerón A. Entropy, 2021, 23(1): 117.
[65]
Selvaraju R R, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Int. J. Comput. Vis., 2020, 128(2): 336.
[66]
Yeh C, Chen Y D, Wu A Y, Chen C, Viégas F, Wattenberg M. IEEE Trans. Visual. Comput. Graphics, 2024, 30(1): 262.
[67]
Hussain I, Jany R, Boyer R, Azad A, Alyami S A, Park S J, Hasan M M, Hossain M A. Sensors, 2023, 23(17): 7452.
[68]
Rajpoot R, Gour M, Jain S, Semwal V B. Sci. Rep., 2024, 14: 24985.
[69]
Linardatos P, Papastefanopoulos V, Kotsiantis S. Entropy, 2021, 23(1): 18.
[70]
Samek W, Montavon G, Vedaldi A, Hansen L K, Müller K. Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer Cham, 2019. 5.
[71]
Kang Y, Park H, Smit B, Kim J. Nat. Mach. Intell., 2023, 5(3): 309.
[72]
Zhang W, Wang Q G, Kong X T, Xiong J C, Ni S K, Cao D H, Niu B Y, Chen M G, Li Y M, Zhang R Z, Wang Y T, Zhang L H, Li X T, Xiong Z P, Shi Q, Huang Z M, Fu Z Y, Zheng M Y. Chem. Sci., 2024, 15(27): 10600.
[73]
Bai X F, Xie Y B, Zhang X, Han H G, Li J R. J. Chem. Inf. Model., 2024, 64(13): 4958.
[74]
Kang Y, Lee W, Bae T, Han S, Jang H, Kim J. J. Am. Chem. Soc., 2025, 147(5): 3943.
[75]
Wiśniewska P, Haponiuk J, Saeb M R, Rabiee N, Bencherif S A. Chem. Eng. J., 2023, 471: 144400.
[76]
Tang M Y, Guan Q, Fang Y L, Wu X, Zhang J J, Xie H, Yu X, Ou R W. Sep. Purif. Technol., 2024, 342: 127059.
[77]
Ettlinger R, Lächelt U, Gref R, Horcajada P, Lammers T, Serre C, Couvreur P, Morris R E, Wuttke S. Chem. Soc. Rev., 2022, 51(2): 464.
[78]
He Y, Liu G, Li C, Yan X. Rev. Environ. Contam. Toxicol., 2022, 260(1): 21.

Funding

the National Natural Science Foundation of China(22476056)
the National Natural Science Foundation of China(22106025)
PDF(3368 KB)

Accesses

Citation

Detail

Sections
Recommended

/