PreciseCam: Precise Camera Control for Text-to-Image Generation

Bernal-Berdun, Edurne; Gadelha, Matheus; Hold-Geoffroy, Yannick; Gutierrez, Diego; Sun, Xin; Masia, Belen; Serrano, Ana

doi:10.1109/CVPR52734.2025.00260

PreciseCam: Precise Camera Control for Text-to-Image Generation

Bernal-Berdun, Edurne (Universidad de Zaragoza) ; Serrano, Ana (Universidad de Zaragoza) ; Masia, Belen (Universidad de Zaragoza) ; Gadelha, Matheus ; Hold-Geoffroy, Yannick ; Sun, Xin ; Gutierrez, Diego (Universidad de Zaragoza)

Resumen: Images as an artistic medium often rely on specific camera angles and lens distortions to convey ideas or emotions; however, such precise control is missing in current text-to-image models. We propose an efficient and general solution that allows precise control over the camera when generating both photographic and artistic images. Unlike prior methods that rely on predefined shots, we rely solely on four simple extrinsic and intrinsic camera parameters, removing the need for pre-existing geometry, reference 3D objects, and multi-view data. We also present a novel dataset with more than 57,000 images, along with their text prompts and ground-truth camera parameters. Our evaluation shows precise camera control in text-to-image generation, surpassing traditional prompt engineering approaches.
Idioma: Inglés
DOI: 10.1109/CVPR52734.2025.00260
Año: 2025
Publicado en: Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2025 (2025), 2724-2733
ISSN: 1063-6919
Financiación: info:eu-repo/grantAgreement/ES/DGA/T25-24
Financiación: info:eu-repo/grantAgreement/ES/MICIU/PID2022-141766OB-I00
Tipo y forma: Article (PostPrint)
Área (Departamento): Área Lenguajes y Sistemas Inf. (Dpto. Informát.Ingenie.Sistms.)
Fecha de embargo : 2026-08-13
Exportado de SIDERAL (2025-11-07-10:25:42)

Permalink:

Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
articulos > articulos-por-area > lenguajes_y_sistemas_informaticos

Retour à la recherche

Notice créée le 2025-11-07, modifiée le 2025-11-07

Postprint:
PDF

Évaluer ce document:

(Pas encore évalué)

Ajouter au panier personnel
Exporter vers BibTeX, MARC, MARCXML, DC, EndNote, NLM, RefWorks

Atlantis Institut des Sciences Fictives

PreciseCam: Precise Camera Control for Text-to-Image Generation