Meta-Learning Loss Functions for Deep Neural Networks

Raymond, Christian

doi:10.26686/wgtn.28448894

Meta-Learning Loss Functions for Deep Neural Networks

thesis

posted on 2025-02-20, 04:51 authored by Christian Raymond

Humans can often quickly and efficiently solve new complex learning tasks given only a small set of examples. In contrast, modern artificially intelligent systems often require thousands or millions of observations in order to solve even the most basic tasks. Meta-learning aims to resolve this issue by leveraging past experiences from similar learning tasks to embed the appropriate inductive biases into the learning system. Historically methods for meta-learning components such as optimizers, parameter initializations, and more have led to significant performance increases. This thesis aims to explore the concept of meta-learning to improve performance, through the often-overlooked component of the loss function. The loss function is a vital component of a learning system, as it represents the primary learning objective, where success is determined and quantified by the system's ability to optimize for that objective successfully. In this thesis, we developed methods for meta-learning the loss function of deep neural networks. In particular, we first introduced a method for meta-learning symbolic model-agnostic loss function called Evolved Model Agnostic Loss (EvoMAL). This method consolidates recent advancements in loss function learning and enables the development of interpretable loss functions on commodity hardware. Through empirical and theoretical analysis, we uncovered patterns in the learned loss functions, which later inspired the development of Sparse Label Smoothing Regularization (SparseLSR), which is a significantly faster and more memory-efficient way to perform label smoothing regularization. Second, we challenged the conventional notion that a loss function must be a static function by developing Adaptive Loss Function Learning (AdaLFL), a method for meta-learning adaptive loss functions. Lastly, we developed Neural Procedural Bias Meta-Learning (NPBML) a task-adaptive few-shot learning method that meta-learns the parameter initialization, optimizer, and loss function simultaneously.

History

Copyright Date

2025-02-20

Date of Award

2025-02-20

Publisher

Te Herenga Waka—Victoria University of Wellington

Rights License

CC BY-SA 4.0

Degree Discipline

Artificial Intelligence

Degree Grantor

Te Herenga Waka—Victoria University of Wellington

Degree Level

Doctoral

Degree Name

Doctor of Philosophy

ANZSRC Type Of Activity code

1 Pure basic research

Victoria University of Wellington Item Type

Awarded Doctoral Thesis

Language

en_NZ

Alternative Language

en_NZ

Victoria University of Wellington School

School of Engineering and Computer Science

Advisors

Chen, Qi; Xue, Bing; Zhang, Mengjie

Usage metrics

Keywords

meta-learning loss function machine learning School: School of Engineering and Computer Science 460299 Artificial intelligence not elsewhere classified Degree Discipline: Artificial Intelligence Degree Name: Doctor of Philosophy Degree Level: Doctoral

Licence

CC BY-SA 4.0

Meta-Learning Loss Functions for Deep Neural Networks

History

Copyright Date

Date of Award

Publisher

Rights License

Degree Discipline

Degree Grantor

Degree Level

Degree Name

ANZSRC Type Of Activity code

Victoria University of Wellington Item Type

Language

Alternative Language

Victoria University of Wellington School

Advisors

Usage metrics

Categories

Keywords

Licence

Exports