Project descriptions

The listed projects represent the projects that are available for the 2025 cohort. In year 1, students will undertake a group rotation project in 3 of the listed projects, before selecting a single project as their PhD topic.


What do wild insects do? Leveraging AI to describe wild insect behaviour and to uncover their functions for fitness

Primary supervisor: Dr. Jelle Boonekamp

Project summary

The study of behaviour is key to our understanding of the evolutionary ecology of natural animal populations. Whilst there is a vast literature on the behavioural ecology of wild vertebrates, the behaviour of insects in nature remains elusive. Our project (wildcrickets.org) has amassed 18 years of 24/7 video recordings of the entire adult lives of a wild population of crickets in a meadow in Spain. Every individual is tagged, and 140 networked video cameras have tracked their movements around the meadow, their matings, fights, escape from predators and every other facet of their lives. For 10 years we also have direct measures of reproductive success through DNA parentage analyses. The advent of machine learning and AI finally gives us the opportunity to leverage this extraordinary resource beyond the simple metrics we have extracted by watching video manually. Our aim is to develop an automated video analysis pipeline that will allow us (and those collecting similar data in the future) to critically further our understanding of the behavioural ecology of wild insects: Do individuals have a chronotype? Is there evidence of senescence in individual activity? Do individuals dynamically respond to predation risk? Are there trade-offs between male signalling effort and other aspects of reproductive investment? What is the function of these behaviours in terms of fitness? The list is endless…

Text mining with large language models to enhance infectious disease ecology

Primary supervisor: Dr. Maxwell Farrell

Project summary

Understanding factors that influence host specificity, sites of infection, and disease severity is crucial for developing effective strategies to monitor, prevent, control, and treat cross-species infectious diseases. Large-scale databases of host-pathogen interactions underlie models that predict future emerging infectious disease threats. These databases are often built through manual curation of scientific literature, but as the number of academic papers grows exponentially, scientists need new approaches for extracting and curating data contained in scientific texts. Deep learning-based Large Language Models (LLMs) have revolutionized information extraction in the biomedical sciences, but are rarely used in ecology. This project will use Natural Language Processing (NLP), LLMs, and graph neural networks (GNNs) to build comprehensive knowledge graphs for cross-species infectious diseases, including host species, infected tissues, disease pathologies, pathogen traits, and diagnostic methods. These data will be used to identify the ecological and evolutionary drivers of disease severity, and build predictive models to identify future infectious disease risks. Further, by developing a generalised approach to generate knowledge graphs using LLMs, this will improve the efficiency of large-scale literature syntheses with broad applications across ecology, environmental science, and biomedicine.

Using animal movement data to understand and predict human-wildlife conflicts

Primary supervisor: Prof. Juan Morales

Project summary

Human-wildlife conflict (HWC) is a growing concern globally, often resulting in negative consequences for both human communities and wildlife populations. Understanding the spatial and temporal dynamics of animal movements can provide critical insights into the causes and potential mitigation strategies for HWC. This research proposes to utilize animal movement and behavioural data to analyse patterns that lead to conflicts, aiming to develop data-driven approaches to mitigate these issues. We will use existing movement data from different species known to be involved in conflicts and use movement analysis tools to come up with predictive models for areas and times of conflict. In particular, we will evaluate whether movement and space use history can make animals prone to cause conflict with humans. Furthermore, we will explore individual variability in movement strategies (possibly linked to personalities) and how they translate to increased or decreased chances of conflict. The research will also explore potential mitigation strategies, and evaluate the pros and cons of the use of barriers such as fences.


Collective animal movement and resource selection in changing environments

Primary supervisor: Dr. Mu Niu

Project summary

Advances in technologies such as GPS have revolutionized the tracking of wildlife, providing detailed data on how animals move and interact. It is now feasible to track multiple animals simultaneously, at high frequency and for long periods. This project explores the movement of animals both individually and in groups, focusing on how environmental factors and resource availability shape their behaviours. Animals in groups often move interdependently, influenced by interactions between individuals. However, traditional movement models primarily address individual animals and ignore these group dynamics. On the other hand, collective movement models are often parameterized with short-term data. This research will develop innovative statistical models to better understand how animals move collectively. Such movement necessarily involves each individual responding to their physical environment, as well as other group members, and a key aspect of this project is understanding how animals use space and resources within the group setting. Incorporating both these aspects of short-term movement decisions and long-term space use in a coherent mathematical model will illuminate how animals collectively adapt to their surroundings. It will use cutting-edge statistical and machine learning methods, such as diffusion models. The findings and methodology developed will provide valuable insights into animal behaviour and ecology, supporting conservation efforts and helping manage human impacts on wildlife.

Extreme value theory for predicting animal dispersal and movement in a changing climate

Primary supervisor: Dr. Jafet Belmont Osuna

Project summary

There is an imperative need to understand and predict how populations respond to multiple aspects of global change, such as habitat fragmentation and climate change. Extreme weather events, which are expected to increase in both frequency and intensity, can profoundly impact animal movement and spatial dynamics. Additionally, for many species, rare long-distance dispersal events play a crucial role in reaching suitable habitats for germination, establishment, and colonisation across fragmented or managed landscapes. Many plant species, for instance, rely on birds for dispersal—birds ingest fruits and later deposit seeds through defecation or regurgitation. Accurately predicting such processes requires models that capture both seed retention times within birds and bird movement patterns. This project aims to develop and apply cutting-edge statistical methods for analysing animal movement and dispersal data using Extreme-Value Theory (EVT) within a Bayesian framework. EVT, a well-established theoretical framework that has been widely used in environmental sciences for modelling extreme events, has seen limited application in ecology. We will leverage EVT to (1) understand how extreme weather events can affect animal movement, and (2) to make better predictions of dispersal processes. This work offers substantial potential for novel insights and methodological advancements. By integrating experimental research and state-of-the-art tracking technologies, the project will inform the development of hierarchical Bayesian models to explore patterns and drivers of animal movement and dispersal, with a particular focus on extreme behaviours and their ecological implications.

Leveraging large language models to provide insights into global plant biodiversity

Primary supervisor: Prof. Richard Reeve

Project summary

Plants are fundamental to the provision of ecosystem services, and we are wholly dependent on them for survival. Yet, globally, many plant species are under threat of extinction. We need a comprehensive plant trait dataset as input to the next generation of biodiversity-climate models. The lack of such a dataset means that existing approaches focus on limited "Plant Functional Types" and cannot estimate the impacts of climate and land use change on individual species or help inform decision making on mitigating biodiversity loss. The needed plant trait data, from niche preferences to growth rates, are locked in the text of the vast botanical literature of the Biodiversity Heritage Library and other texts available to the Natural History Museum. This studentship would use the recent advances in large language models (LLMs) and natural language processing (NLP) to extract this information. We have developed an ecosystem modelling tool (EcoSISTEM, Harris et al., 2023, https://github.com/EcoJulia/EcoSISTEM.jl) that captures survival, competition and reproduction among multiple plant species across a landscape. LLMs will enable extraction of traits data for integration into the EcoSISTEM infrastructure and enable the inclusion of multilingual records, expanding the system's geographic and historical range. By addressing these enormous data gaps, the student will then explore global spatial and temporal variability in functional and other trait-based diversity measures to produce a unique and comprehensive evaluation of whether predictors exist of diversity at a global scale. Ultimately, the project will boost EcoSISTEM's ability to simulate plant responses to climate change with greater accuracy. Please do contact us if you’d like to know more – Richard Reeve (Richard.Reeve@glasgow.ac.uk) in general and for further details on ecological modelling, Jake Lever (Jake.Lever@glasgow.ac.uk) on machine learning and Neil Brummitt (n.brummitt@nhm.ac.uk) on botany. Harris, C. L., et al. (2023). Dynamic virtual ecosystems as a tool for detecting large-scale responses of biodiversity to environmental and land-use change. https://arxiv.org/abs/1911.12257


Implications of mosquito community composition changes for vector control

Primary supervisor: Dr. Mafalda Viana

Project summary

Vector control remains one of the most effective tools against many vector-borne diseases worldwide. For its success, some species are significantly reduced or even locally eliminated. As a result, the balance of interspecific competition among species is expected to change, potentially leading to changes in the community composition. The problem for malaria control is that disease transmission is often maintained by a suite of mosquito species that may differ in their efficiency at transmitting disease and response to interventions. Thus changes in species relative abundance could have positive or negative effects on the overall vectorial capacity of the mosquito community. The overall objective of the project would be to understand how vector control impacts mosquito species community and its implications for vector-borne disease such as malaria. Specifically, through a combination of Bayesian spatial-temporal models fit to mosquito abundance records from the global database GBIF and meta-analysis the PhD will address: 1. Mosquito-mosquito interactions: How have mosquito communities changed? 2. Mosquito-disease: What are the implications of the community changes for disease transmission? 3. Mosquito-control: Are the changes in mosquito community associated with intervention? 4. Mosquito-control-environment: What is the role of environment for the impact of intervention might have on mosquito composition.

The impact of deep learning optimization and design choices for marine biodiversity monitoring

Primary supervisor: Dr. Tiffany Vlaar

Project summary

This project aims to increase the efficiency, accuracy, and reliability of annotation and classification of large marine datasets using deep learning. Timely and accurate analysis of these long-term datasets will aid marine biodiversity monitoring efforts. Design of more efficient strategies further aims to reduce the carbon footprint of training and fine-tuning large machine learning models. The project is expected to lead to various novel insights for the machine learning community such as on optimal pre-training choices for downstream robust performance, the optimal order of learning samples with varying complexity levels, navigating instances with label ground truth uncertainty, and re-evaluation of metric design. The PhD student will be supported in building international collaborations with researchers across different disciplines and in developing effective research communication skills.

Computationally efficient inference methods for integrated spatial capture-recapture and movement models

Primary supervisor: Dr. Zhang Wei

Project summary

Spatial capture-recapture (SCR) models are widely used in ecology to estimate population-level parameters, such as density and abundance. To make it conceptually and computationally easier to formulate and fit SCR models, one often ignores or simplifies the dynamic nature (i.e. movement) of activity centres of individuals over the period of study. Integrating movement into SCR models has been a promising research area for years, but progress so far has been limited. This PhD project aims to develop and validate SCR models that integrate more realistic movement processes, and computationally efficient methods and software packages to fit such models. First, this project aims to evaluate optimal survey designs for SCR models with movement, including detector placement, sampling intervals, and telemetry integration, to balance accuracy, effort, and cost for both closed and open populations. Second, it will develop SCR models that incorporate multi-behavioural movement (e.g., foraging, migrating) and long-term activity centre dynamics, while accounting for open-population processes such as recruitment and emigration. Third, it will develop computational tools such as tailored MCMC methods, sparse matrices, and a combination of numerical and approximate integration techniques to enable scalable inference for high-dimensional complex models. These innovations will be validated through thorough simulation studies and applied to case studies. The outputs of this project will benefit ecological practitioners, conservation management, and wildlife monitoring, providing useful tools and methodologies to improve population estimation, habitat connectivity analysis, and the design of efficient survey strategies for diverse species and ecosystems.