Abstract
We propose a constrained linear data-feature-mapping model as an interpretable mathematical model for image classification using a convolutional neural network (CNN). From this viewpoint, we establish detailed connections between the traditional iterative schemes for linear systems and the architectures of the basic blocks of ResNet- and MgNet-type models. Using these connections, we present some modified ResNet models that, compared with the original models, have fewer parameters but can produce more accurate results, thereby demonstrating the validity of this constrained learning data-feature-mapping assumption. Based on this assumption, we further propose a general data-feature iterative scheme to demonstrate the rationality of MgNet. We also provide a systematic numerical study on MgNet to show its success and advantages in image classification problems, particularly in comparison with established networks.
Original language | English (US) |
---|---|
Pages (from-to) | 384-392 |
Number of pages | 9 |
Journal | Neural networks : the official journal of the International Neural Network Society |
Volume | 162 |
DOIs | |
State | Published - Mar 20 2023 |
Bibliographical note
KAUST Repository Item: Exported on 2023-03-27Acknowledgements: This work was partially supported by the Center for Computational Mathematics and Applications (CCMA) at The Pennsylvania State University, the Verne M. William Professorship Fund from The Pennsylvania State University, and the National Science Foundation, United States (Grant No. DMS-1819157 and DMS-2111387). In addition, the first and second authors are partially supported by the KAUST Baseline Research Fund, the third author is supported by Hetao Shenzhen-Hong Kong Science and Technology Innovation Cooperation Zone Project (No. HZOSWS-KCCYB-2022046), and the fourth author is supported in part by Beijing Natural Science Foundation Project (No. Z200002). The authors thank Huang Huang for his help with partial numerical experiments.
ASJC Scopus subject areas
- Artificial Intelligence
- Cognitive Neuroscience