000146047 001__ 146047
000146047 005__ 20241122150934.0
000146047 037__ $$aTAZ-TFM-2023-214
000146047 041__ $$aeng
000146047 1001_ $$aPlou Izquierdo, Carlos
000146047 24200 $$aAnalysis of linearization methods for Bayesian Deep Learning.
000146047 24500 $$aAnalysis of linearization methods for Bayesian Deep Learning.
000146047 260__ $$aZaragoza$$bUniversidad de Zaragoza$$c2023
000146047 506__ $$aby-nc-sa$$bCreative Commons$$c3.0$$uhttp://creativecommons.org/licenses/by-nc-sa/3.0/
000146047 520__ $$aDeep learning has revolutionized the field of machine learning, outperforming traditional methods in a wide range of areas such as computer vision, robotics or natural language processing. However, deep learning models are often treated as a black box, providing high accuracy but with little insight into how the model arrived at its predictions. This lack of transparency can be a major drawback in certain applications where the ability to understand and trust the model’s predictions is crucial. For instance, in some tasks such as medical diagnosis or autonomous driving, it is as important to achieve high accuracy as it is to have an accurate uncertainty estimation. In these scenarios, a well-calibrated uncertainty estimate can provide valuable information for decision-making and can prevent risky situations. Bayesian deep learning and specifically Bayesian neural networks try to address the challenge of estimating uncertainty assuming that their parameters and predictions follow a specific distribution instead of deterministic values. This enables the use of Bayesian inference, a statistical technique based on the Bayes rule, that allows for updating our distributions of interest with the arrival of new observations. However, performing Bayesian inference in deep learning algorithms is not easy due to the complexity, dimensionality of the models and the large amounts of data they are trained on. This has led to the development of approximate Bayesian inference methods such as Ensembles and Monte Carlo dropout, which provide approximate solutions to the problem of uncertainty estimation in deep learning. Even so, these methods do not fully solve the overconfidence problem in deep learning models. This has motivated to reconsider in the last years a statistical technique called Laplace approximation which was introduced in 1992 by David Mackay.The focus of this Master’s thesis is to study the opportunities of Laplace approximation in Bayesian deep learning. We conduct a thorough investigation and analysis of this powerful technique that allows us to tackle the overconfidence problem. Additionally, we evaluate its performance on benchmark datasets by comparing it with the most commonly used uncertainty estimation methods such as deep ensembles and Monte Carlo dropout. We demonstrate the effectiveness of Laplace approximation in providing accurate and reliable uncertainty estimates for deep learning models. Specifically, our experiments show that Laplace approximation outperforms the other techniques in estimating uncertainty in out-of-distribution regions. As an example of real application, this work explores the use of Laplace approximation in the field of reinforcement learning (RL). While Laplace approximation has been widely used in other fields, its potential benefits in RL have yet to be fully explored. Exploration is a crucial problem in RL that involves exploring the environment to maximize an agent’s knowledge and, consequently, its ability to solve different tasks. Nevertheless, efficient exploration in high-dimensional environments remains an unsolved challenge. A promising approach is to use the degree of novelty as a reward signal. To address this, we propose integrating Laplace approximation into a model-based RL algorithm that performs active exploration by utilizing estimated uncertainty as novelty measure. Our experiments show that this approach outperforms the baselines.<br /><br />
000146047 521__ $$aMáster Universitario en Robótica, Gráficos y Visión por Computador
000146047 540__ $$aDerechos regulados por licencia Creative Commons
000146047 700__ $$aMartínez Cantín, Rubén $$edir.
000146047 700__ $$aMurillo Arnal, Ana Cristina$$edir.
000146047 7102_ $$aUniversidad de Zaragoza$$bInformática e Ingeniería de Sistemas$$cIngeniería de Sistemas y Automática
000146047 8560_ $$f756111@unizar.es
000146047 8564_ $$s6094622$$uhttps://zaguan.unizar.es/record/146047/files/TAZ-TFM-2023-214.pdf$$yMemoria (eng)
000146047 909CO $$ooai:zaguan.unizar.es:146047$$pdriver$$ptrabajos-fin-master
000146047 950__ $$a
000146047 951__ $$adeposita:2024-11-22
000146047 980__ $$aTAZ$$bTFM$$cEINA
000146047 999__ $$a20230604142747.CREATION_DATE