Unified Regularity Measures for Sample-wise Learning and Generalization

Bibliographic Details
Title: Unified Regularity Measures for Sample-wise Learning and Generalization
Authors: Zhang, Chi, Ma, Xiaoning, Liu, Yu, Wang, Le, Su, Yuanqi, Liu, Yuehu
Publication Year: 2021
Collection: Computer Science
Subject Terms: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
More Details: Fundamental machine learning theory shows that different samples contribute unequally both in learning and testing processes. Contemporary studies on DNN imply that such sample difference is rooted on the distribution of intrinsic pattern information, namely sample regularity. Motivated by the recent discovery on network memorization and generalization, we proposed a pair of sample regularity measures for both processes with a formulation-consistent representation. Specifically, cumulative binary training/generalizing loss (CBTL/CBGL), the cumulative number of correct classiffcations of the training/testing sample within training stage, is proposed to quantize the stability in memorization-generalization process; while forgetting/mal-generalizing events, i.e., the mis-classification of previously learned or generalized sample, are utilized to represent the uncertainty of sample regularity with respect to optimization dynamics. Experiments validated the effectiveness and robustness of the proposed approaches for mini-batch SGD optimization. Further applications on training/testing sample selection show the proposed measures sharing the unified computing procedure could benefit for both tasks.
Comment: 20 pages, 13 figures, 3 tables
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2108.03913
Accession Number: edsarx.2108.03913
Database: arXiv
More Details
Description not available.