A Nonlinear Multi‐Objective Prediction Strategy for Small‐Sample Datasets in Homogeneous Catalysis.

Saved in:
Bibliographic Details
Title: A Nonlinear Multi‐Objective Prediction Strategy for Small‐Sample Datasets in Homogeneous Catalysis.
Authors: Liu, Yining1 (AUTHOR), Wang, Shen1,2 (AUTHOR), Li, Yang1 (AUTHOR) chyangli@dlut.edu.cn, Bao, Ming1 (AUTHOR) mingbao@dlut.edu.cn
Source: Journal of Computational Chemistry. 5/30/2026, Vol. 47 Issue 14, p1-19. 19p.
Subjects: Homogeneous catalysis, Multi-objective optimization, Chemical models, Catalysts, Machine learning, Nonlinear estimation
Abstract: The development of homogeneous catalytic reactions is hindered by the resource‐intensive nature of traditional experimental and computational methods, particularly when dealing with small, sparse datasets and complex multi‐objective optimization. While machine learning (ML) offers a promising alternative, prevailing methods rely on large‐scale datasets and often fail to address the challenges of small‐sample datasets, high‐dimensional descriptor spaces, and strong nonlinear relationships inherent in catalytic processes. To address this issue, we present a nonlinear multi‐objective ML workflow, PSO_CRP, for predicting reaction categories (e.g., conversion or yield) and quantitative outcomes (e.g., enantioselectivity or site selectivity), as well as the interpretability analysis. Experimental results on 4 small‐sample datasets demonstrate that our models, relying only on simple RDKit‐derived molecular parameters rather than costly DFT calculations, achieve consistently higher predictive accuracy than at least five common ML models. Model‐based permutation feature importance (PFI) and partial dependence plot (PDP) analyses identified the key molecular descriptors governing reaction outcomes. They quantified their contributions, producing results aligned with previous studies while providing added mechanistic insight to enhance model interpretability and guide rational catalyst design. As a high‐precision, low‐cost, and interpretable framework, this PSO‐based workflow offers valuable insights for forward prediction in homogeneous catalytic systems. [ABSTRACT FROM AUTHOR]
Copyright of Journal of Computational Chemistry is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Description
Abstract:The development of homogeneous catalytic reactions is hindered by the resource‐intensive nature of traditional experimental and computational methods, particularly when dealing with small, sparse datasets and complex multi‐objective optimization. While machine learning (ML) offers a promising alternative, prevailing methods rely on large‐scale datasets and often fail to address the challenges of small‐sample datasets, high‐dimensional descriptor spaces, and strong nonlinear relationships inherent in catalytic processes. To address this issue, we present a nonlinear multi‐objective ML workflow, PSO_CRP, for predicting reaction categories (e.g., conversion or yield) and quantitative outcomes (e.g., enantioselectivity or site selectivity), as well as the interpretability analysis. Experimental results on 4 small‐sample datasets demonstrate that our models, relying only on simple RDKit‐derived molecular parameters rather than costly DFT calculations, achieve consistently higher predictive accuracy than at least five common ML models. Model‐based permutation feature importance (PFI) and partial dependence plot (PDP) analyses identified the key molecular descriptors governing reaction outcomes. They quantified their contributions, producing results aligned with previous studies while providing added mechanistic insight to enhance model interpretability and guide rational catalyst design. As a high‐precision, low‐cost, and interpretable framework, this PSO‐based workflow offers valuable insights for forward prediction in homogeneous catalytic systems. [ABSTRACT FROM AUTHOR]
ISSN:01928651
DOI:10.1002/jcc.70407