Skip to content

Latest commit

 

History

History
57 lines (57 loc) · 2.32 KB

2021-07-01-akbari21a.md

File metadata and controls

57 lines (57 loc) · 2.32 KB
title abstract layout series publisher issn id month tex_title firstpage lastpage page order cycles bibtex_author author date address container-title volume genre issued pdf extras
How Does Loss Function Affect Generalization Performance of Deep Learning? Application to Human Age Estimation
Good generalization performance across a wide variety of domains caused by many external and internal factors is the fundamental goal of any machine learning algorithm. This paper theoretically proves that the choice of loss function matters for improving the generalization performance of deep learning-based systems. By deriving the generalization error bound for deep neural models trained by stochastic gradient descent, we pinpoint the characteristics of the loss function that is linked to the generalization error and can therefore be used for guiding the loss function selection process. In summary, our main statement in this paper is: choose a stable loss function, generalize better. Focusing on human age estimation from the face which is a challenging topic in computer vision, we then propose a novel loss function for this learning problem. We theoretically prove that the proposed loss function achieves stronger stability, and consequently a tighter generalization error bound, compared to the other common loss functions for this problem. We have supported our findings theoretically, and demonstrated the merits of the guidance process experimentally, achieving significant improvements.
inproceedings
Proceedings of Machine Learning Research
PMLR
2640-3498
akbari21a
0
How Does Loss Function Affect Generalization Performance of Deep Learning? Application to Human Age Estimation
141
151
141-151
141
false
Akbari, Ali and Awais, Muhammad and Bashar, Manijeh and Kittler, Josef
given family
Ali
Akbari
given family
Muhammad
Awais
given family
Manijeh
Bashar
given family
Josef
Kittler
2021-07-01
Proceedings of the 38th International Conference on Machine Learning
139
inproceedings
date-parts
2021
7
1