Research by Will Rowan: video foundation models, 3D face reconstruction, virtual production, and generative AI. Publications at ICLR and ECCV. PhD University of York.
My current research focuses on utilising and adapting video foundation models and world models for the creative industries. At PXLD, the University of York spinout I co-founded, I work on making these powerful systems practical for filmmakers: automating VFX post-production pipelines, real-time scene relighting, and compositing tools that fit into existing production workflows. The goal is to democratise Hollywood-grade tools for productions of any budget.
Previously, I was a Research Fellow at the University of York on a £1.8M Innovate UK project with dock10 (the UK’s largest television studio facility, based at MediaCityUK) and 2LE Media, developing 3D reconstruction techniques for real-time virtual production (2024–2025). I also secured a £20,000 grant from the AI SuperConnector startup accelerator at Imperial College London, and organised the workshop “Capturing Reality and Changing It” at the 2024 Locarno Film Festival.
PhD Research
My PhD thesis, “Towards an Intelligent Agent for the Human Face”, focused on building intelligent systems for the human face: methods that take any input (text, a photo, a partial 3D scan) and produce an accurate, editable 3D model. This included new approaches to 3D face reconstruction, text-driven face generation (Text2Face, the first method of its kind), and evaluation metrics for the field. I was supervised by Dr Patrik Huber, Prof Nick Pears, and Prof Andrew Keeling at the University of York.
Publications
2025
Neuralatex: The World’s First Machine Learning Library Written Entirely in LaTeX
We present Neuralatex, the first machine learning library written entirely in LaTeX. When compiled, it generates training data, trains neural networks, runs experiments, and produces figures, all within the LaTeX compilation process. Includes implementations of MLPs, CNNs, ResNets, Transformers, and MiniGPT.
@article{rowan2025neuralatex,title={Neuralatex: The World's First Machine Learning Library Written Entirely in LaTeX},author={Rowan, Will},journal={arXiv preprint arXiv:2503.24187},year={2025},}
2024
N Heads Are Better Than One: Exploring Theoretical Performance Bounds of 3D Face Reconstruction Methods
We explore upper bounds on 3D face reconstruction accuracy using multiple views and establish benchmarks for single-view methods.
@inproceedings{rowan2024nheads,title={N Heads Are Better Than One: Exploring Theoretical Performance Bounds of 3D Face Reconstruction Methods},author={Rowan, Will and Huber, Patrik and Pears, Nick and Keeling, Andrew},booktitle={ECCV Workshop on Foundation Models for 3D Humans},year={2024},}
How Many OptiFaces? A New Evaluation Metric for 3D Face Reconstruction
We propose OptiFaces, a new evaluation metric that addresses limitations of existing 3D face reconstruction benchmarks by measuring how many optimised face shapes are needed to represent a dataset.
@inproceedings{rowan2024optifaces,title={How Many OptiFaces? A New Evaluation Metric for 3D Face Reconstruction},author={Rowan, Will and Huber, Patrik and Pears, Nick and Keeling, Andrew},booktitle={International Conference on Learning Representations (ICLR)},year={2024},}
ECCV-W
San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows
Nicolo Di Domenico, Guido Borghi, Annalisa Franco, and 2 more authors
In ECCV Workshop on Artificial Intelligence for Digital Humanities, 2024
@inproceedings{sanvitale2024,title={San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows},author={Di Domenico, Nicolo and Borghi, Guido and Franco, Annalisa and Rowan, Will and others},booktitle={ECCV Workshop on Artificial Intelligence for Digital Humanities},year={2024},}
We present Text2Face, the first method for generating 3D face shape from text descriptions using a 3D Morphable Model.
@inproceedings{rowan2023text2face,title={Text2Face: 3D Morphable Faces from Text},author={Rowan, Will and Huber, Patrik and Pears, Nick and Keeling, Andrew},booktitle={International Conference on Learning Representations (ICLR)},doi={10.48550/arXiv.2303.02688},year={2023},}
2022
The Effectiveness of Temporal Dependency in Deepfake Video Detection
@article{rowan2022deepfake,title={The Effectiveness of Temporal Dependency in Deepfake Video Detection},author={Rowan, Will and Pears, Nick},journal={arXiv preprint arXiv:2205.06684},year={2022}}
Awards & Recognition
Best Presentation, International Computer Vision Summer School (ICVSS), Sicily, 2025
Best Poster, Foundation Models for 3D Humans Workshop, European Conference on Computer Vision (ECCV), Milan, 2024
Best Presentation Runner-Up, Reproducibility in Computer Vision Workshop, CVPR, 2023
EPSRC Doctoral Scholarship, 2020–2024
Highest MSc grade on record, Advanced Computer Science, University of York (84%, Distinction)
Best Dissertation in Cohort (1 of 150), BEng Computer Science, University of York, 2019
IBM Entrance Scholarship (3 of 150), University of York, 2016