Resumen: This paper proposes a novel investment strategy based on deep reinforcement learning (DRL) for long-term portfolio allocation in the presence of transaction costs and risk aversion. We design an advanced portfolio policy framework to model the price dynamic patterns using convolutional neural networks (CNN), capture group-wise asset dependence using WaveNet, and solve the optimal asset allocation problem using DRL. These methods are embedded within a multi-period Bellman equation framework. An additional appealing feature of our investment strategy is its ability to optimize dynamically over a large set of potentially correlated risky assets. The performance of this portfolio is tested empirically over different holding periods, risk aversion levels, transaction cost rates, and financial indices. The results demonstrate the effectiveness and superiority of the proposed long-term portfolio allocation strategy compared to several competitors based on machine learning methods and traditional optimization techniques. Idioma: Inglés DOI: 10.1016/j.iref.2025.103996 Año: 2025 Publicado en: INTERNATIONAL REVIEW OF ECONOMICS & FINANCE 98 (2025), 103996 [18 pp.] ISSN: 1059-0560 Financiación: info:eu-repo/grantAgreement/ES/AEI/PID2023-147798NB-I00 Tipo y forma: Artículo (Versión definitiva) Área (Departamento): Área Fund. Análisis Económico (Dpto. Análisis Económico)