PreprintPDF Available

Categorical Longevity: Category Theory & Computational Biology Well-being

Authors:
  • DIYgenomics
Preprints and early-stage research may not have been peer reviewed yet.

Abstract

Categorical Longevity is introduced as a computational systems biology approach which applies category theory to unify and formalize the activity, metrics, and interventions of biological systems in addressing the accelerated multisystem decline characteristic of human aging. By providing a comprehensive and computation-ready framework, this method integrates diverse mathematical models and data representations, enabling a deeper understanding of biological dynamics across scales. While advancements such as hallmarks of aging , biomarkers, epigenetic clocks, and organ-specific aging metrics have proliferated, the absence of a standardized framework limits their joint analysis and practical application. Categorical Longevity addresses this gap, offering tools for multiscalar analysis, robust dynamical modeling, and the integration of computational technologies such as blockchain to facilitate scalable, secure population-level deployment. Category Theory and Longevity A modern scientific trend is formalization, meaning the extensive use of mathematical models, algorithmic analysis, and computational instantiation. This is seen in deep learning as a foundational technology used in many fields, with the concept technology connoting active processes involving compute, algorithms, and data. Category theory (Eilen-berg and MacLane 1945) provides a universal language for formalizing the modeling of physical, chemical, and biological systems in a computation-amenable way (Spivak 2014). In biology, for example, categorical genomics attempts to organize vast data volumes into a coherent whole. One project identifies Dist (distance) as a category for genes, a structure unifying the descriptive gene-related sub-categories of open Petri nets (gene regulatory networks), preorder (gene orders), operad (gene trees), and olog (gene ontology) (Wu 2023). Another project specifies a genetic algebra with the categories Set and Semimodules to study genotype, phenotype , and haplotype groups (Tuyeras 2018). Longevity entails potentially curing the diseases of aging. Aging is the primary driver of many leading causes of death,
Categorical Longevity: Category Theory & Computational Biology Well-being
Melanie Swan1, Takashi Kido2, Renato P. dos Santos3
1University College London
2Teikyo University
3academicum.ai
melanie@DIYgenomics.org, kido.takashi@gmail.com, info@academicum.ai
Abstract
Categorical Longevity is introduced as a computational sys-
tems biology approach which applies category theory to unify
and formalize the activity, metrics, and interventions of bio-
logical systems in addressing the accelerated multisystem de-
cline characteristic of human aging. By providing a compre-
hensive and computation-ready framework, this method inte-
grates diverse mathematical models and data representations,
enabling a deeper understanding of biological dynamics
across scales. While advancements such as hallmarks of ag-
ing, biomarkers, epigenetic clocks, and organ-specific aging
metrics have proliferated, the absence of a standardized
framework limits their joint analysis and practical applica-
tion. Categorical Longevity addresses this gap, offering tools
for multiscalar analysis, robust dynamical modeling, and the
integration of computational technologies such as blockchain
to facilitate scalable, secure population-level deployment.
Category Theory and Longevity
A modern scientific trend is formalization, meaning the ex-
tensive use of mathematical models, algorithmic analysis,
and computational instantiation. This is seen in deep learn-
ing as a foundational technology used in many fields, with
the concept technology connoting active processes involv-
ing compute, algorithms, and data. Category theory (Eilen-
berg and MacLane 1945) provides a universal language for
formalizing the modeling of physical, chemical, and biolog-
ical systems in a computation-amenable way (Spivak 2014).
In biology, for example, categorical genomics attempts to
organize vast data volumes into a coherent whole. One pro-
ject identifies Dist (distance) as a category for genes, a struc-
ture unifying the descriptive gene-related sub-categories of
open Petri nets (gene regulatory networks), preorder (gene
orders), operad (gene trees), and olog (gene ontology) (Wu
2023). Another project specifies a genetic algebra with the
categories Set and Semimodules to study genotype, pheno-
type, and haplotype groups (Tuyeras 2018).
Longevity entails potentially curing the diseases of aging.
Aging is the primary driver of many leading causes of death,
Copyright © 2025, Association for the Advancement of Artificial Intelli-
gence (www.aaai.org). All rights reserved.
including type 2 diabetes, neurodegenerative disease, and
cardiovascular disease, as organisms experience accelerated
decline (entropy) with aging (Cipriano et al. 2024). In recent
years, quantization has been a primary trend in longevity.
This has been through the identification and validation of
biomarkers (Lyu et al. 2024), hallmarks (Lopez-Otin et al.
2024), data collection methods (Mengelkoch et al. 2024),
and intervention (Guarente et al. 2024). The implication of
measurement is having quantitative metrics for targeted in-
tervention. However, these quantitative measures have not
been assembled into an overarching formal structure.
Hence, Categorical Longevity is introduced as a research
program to consolidate the quantitative systems biology of
aging into a standardized approach. Implementation could
be by composing a category Longev (in the vein of Dynam,
a dynamical open system category (Baez and Pollard,
2020)), with four sub-categories for hallmarks, biomarkers,
data, and interventions. Each category has category objects
(e.g. the twelve hallmarks of aging), morphisms (mappings)
between them (e.g. DNA mutation and damage repair rates),
and functors (equations relating categories).
Categorical Longevity may be deployed at three levels.
First is in the use of category theory in mathematical models
used to describe underlying biology directly (e.g. Petri nets
representing epigenetic change (Bardini et al. 2016)). Sec-
ond is in category-theoretic approaches to the mathematical
analysis of biological data (e.g. topological data analysis of
cancer tumors with persistent homology (Stolz et al. 2024)).
Third is via category-theoretic approaches used in AI ma-
chine learning applications in biology such as drug design
(e.g. monad algebras in a 2-category of parametric maps in
categorical deep learning (Gavranovic et al. 2024)).
Categorical Longevity Toolkit
Category theory operates at the level of mathematical struc-
ture shared across objects, abstracting away from the detail
of content. The broader argument of Categorical Longevity
is the use of category theory as a toolkit for diagnosing the
kinds of formal problems at hand and analyzing them in an
overarching framework that connects to other problems. A
unified formal framework could be indispensable in ad-
dressing the complex systems biology of aging. Some of the
kinds of formal problems are dynamics, multiscalarity, con-
currency in distributed systems, and optimization.
Dynamics: Rate Laws
A broad trend in biological study is dynamics. A challenge
is that traditional models (e.g. Lotka-Volterra predator-prey)
have only basic dynamics. Hence, more sophisticated mod-
els are needed such as entropy-based measures that build to
an “effective temperature” of aging (Denisov et al. 2024).
Petri nets are a categorical method used to model biochem-
ical processes in which tokens accumulate at network nodes
to trigger an event. More robust dynamics could help (e.g.
to model DNA damage signaling (Gutowska et al. 2022)).
Multiscalarity: Duality
Multiscalar systems have different mathematics at different
scale tiers, as captured in the integrated analysis of hybrid
PET/MRI brain scans (Munn et al. 2024). The category-the-
oretic concept of duality allows one scale to be viewed from
another. For example, the Heisenberg and Schrödinger pic-
tures of quantum mechanics are brought into a unified pic-
ture with AQFT-FQFT (algebraic and functorial QFT)
(Schreiber 2008), applying to neural field theories of aging.
Research Agenda
Categorical Longevity research projects could include:
Algebraic topology analysis of genome TADs (topologi-
cally associated domains) in metabolic pathways
Enzyme kinetics (Michaelis-Menten) applied to Petri net
model of DNA mutation and damage repair rates
Knot theory hypergraph model of senescent cell clearance
2-Segal space simplicial sets for immune system analysis
AQFT-FQFT field theory of aging per amplitude metric
Feynman category fibration-foliation duality analysis of
accumulated damage of aging and inflammation profiles
Conclusion
Categorical Longevity is introduced as a research program
for the interconnected formal analysis of the systems biol-
ogy of aging. Category theory is growing in use in the com-
putational infrastructure (e.g. in smart network technologies
deep learning (endofunctor algebras), quantum computing
(categorical quantum circuits), and blockchain (categorical
cryptoeconomics)), readily extending to computational biol-
ogy. Blockchain is implicated for the implementation of
Categorical Longevity, practically due to its secure compu-
tational substrate, and theoretically due to its tokenized na-
ture facilitating the energy-entropy bookkeeping of longev-
ity solutions with causal histories in digital health twins.
Overall, Categorical Longevity provides a comprehensive
formal approach for global well-being that addresses both
the worldwide demographic shift in aging populations as
well as the on-boarding of new computing platforms such as
BCIs (brain-computer interfaces).
References
Baez, J. C. and Pollard, B. S. 2017. A Compositional Framework
for Reaction Networks. Rev Math Phy 29(9):1750028.
Bardini, R.; Benso, A.; Di Carlo, S.; et al. 2016. Using Nets-
Within-Nets for Modeling Differentiating Cells in the Epigenetic
Landscape. In: 4th International Conference on Bioinformatics and
Biomedical Engineering (IWBBIO), Granada, Spain, April 20-22,
2016. Pp. 315-321.
Cipriano, A.; Moqri, M.; Maybury-Lewis, S. Y.; et al. 2024. Mech-
anisms, pathways and strategies for rejuvenation through epige-
netic reprogramming. Nat Aging 4(1):14-26.
Denisov, K. A.; Gruber, J. and Fedichev, P. O. 2024. Discovery of
Thermodynamic Control Variables that Independently Regulate
Healthspan and Maximum Lifespan. bioRxiv:2024.12.01.626230.
Eilenberg, S. and MacLane, S. 1945. General Theory of Natural
Equivalences. Trans AMS 58(2):231–94.
Gavranovic, B.; Lessard, P.; Dudzik, A.; et al. 2024. Position: Cat-
egorical Deep Learning is an Algebraic Theory of All Architec-
tures. arXiv:2402.15332v2.
Guarente, L.; Sinclair, D. A. and Kroemer, G. 2024. Human trials
exploring anti-aging medicines. Cell Metabolism 36(2):P354-376.
Gutowska, K.; Kogut, D.; Kardynska, M.; et al. 2022. Petri nets
and ODEs as complementary methods for comprehensive analysis
on an example of the ATM–p53–NF-κB signaling pathways. Sci
Rep 12(1135).
Lopez-Otin, C.; Blasco, M. A.; Partridge, L.; et al. 2023. Hallmarks
of aging: An expanding universe. Cell 186(2):243-278.
Lyu, Y. X.; Fu, Q.; Wilczok, D.; et al. 2024. Longevity biotechnol-
ogy: bridging AI, biomarkers, geroscience and clinical applications
for healthy longevity. Aging 16(20):12955-12976.
Mengelkoch, S.; Gassen, J.; Lev-Ari, S.; et al. 2024. Multi-omics
in stress and health research: study designs that will drive the field
forward. Stress 27(1).
Munn, B. R.; Muller, E. J.; Favre-Bulle, I.; et al. 2024. Multiscale
organization of neuronal activity unifies scale-dependent theories
of brain function. Cell 187(25):7303-7313.e15.
Schreiber, U. 2008. AQFT from n-functorial QFT.
arXiv:0806.1079v2.
Spivak, D. I. 2014. Category Theory for the Sciences. Cambridge
MA: The MIT Press.
Stolz, B. J., Dhesi, J., Bull, J. A.; et al. 2024. Relational persistent
homology for multispecies data with application to the tumor mi-
croenvironment. Bulletin Math Biol 86(11):128.
Tuyeras, R. 2018. Category Theory for Genetics.
arXiv:1708.05255.
Wu, Y. 2023. A Category of Genes. arXiv:2311.08546v1.
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
The recent unprecedented progress in ageing research and drug discovery brings together fundamental research and clinical applications to advance the goal of promoting healthy longevity in the human population. We, from the gathering at the Aging Research and Drug Discovery Meeting in 2023, summarised the latest developments in healthspan biotechnology, with a particular emphasis on artificial intelligence (AI), biomarkers and clocks, geroscience, and clinical trials and interventions for healthy longevity. Moreover, we provide an overview of academic research and the biotech industry focused on targeting ageing as the root of age-related diseases to combat multimorbidity and extend healthspan. We propose that the integration of generative AI, cutting-edge biological technology, and longevity medicine is essential for extending the productive and healthy human lifespan.
Article
Full-text available
Topological data analysis (TDA) is an active field of mathematics for quantifying shape in complex data. Standard methods in TDA such as persistent homology (PH) are typically focused on the analysis of data consisting of a single entity (e.g., cells or molecular species). However, state-of-the-art data collection techniques now generate exquisitely detailed multispecies data, prompting a need for methods that can examine and quantify the relations among them. Such heterogeneous data types arise in many contexts, ranging from biomedical imaging, geospatial analysis, to species ecology. Here, we propose two methods for encoding spatial relations among different data types that are based on Dowker complexes and Witness complexes. We apply the methods to synthetic multispecies data of a tumor microenvironment and analyze topological features that capture relations between different cell types, e.g., blood vessels, macrophages, tumor cells, and necrotic cells. We demonstrate that relational topological features can extract biological insight, including the dominant immune cell phenotype (an important predictor of patient prognosis) and the parameter regimes of a data-generating model. The methods provide a quantitative perspective on the relational analysis of multispecies spatial data, overcome the limits of traditional PH, and are readily computable.
Article
Full-text available
Despite decades of stress research, there still exist substantial gaps in our understanding of how social, environmental, and biological factors interact and combine with developmental stressor exposures, cognitive appraisals of stressors, and psychosocial coping processes to shape individuals’ stress reactivity, health, and disease risk. Relatively new biological profiling approaches, called multi-omics, are helping address these issues by enabling researchers to quantify thousands of molecules from a single blood or tissue sample, thus providing a panoramic snapshot of the molecular processes occurring in an organism from a systems perspective. In this review, we summarize two types of research designs for which multi-omics approaches are best suited, and describe how these approaches can help advance our understanding of stress processes and the development, prevention, and treatment of stress-related pathologies. We first discuss incorporating multi-omics approaches into theory-rich, intensive longitudinal study designs to characterize, in high-resolution, the transition to stress-related multisystem dysfunction and disease throughout development. Next, we discuss how multi-omics approaches should be incorporated into intervention research to better understand the transition from stress-related dysfunction back to health, which can help inform novel precision medicine approaches to managing stress and fostering biopsychosocial resilience. Throughout, we provide concrete recommendations for types of studies that will help advance stress research, and translate multi-omics data into better health and health care.
Article
Full-text available
Over the past decade, there has been a dramatic increase in efforts to ameliorate aging and the diseases it causes, with transient expression of nuclear reprogramming factors recently emerging as an intriguing approach. Expression of these factors, either systemically or in a tissue-specific manner, has been shown to combat age-related deterioration in mouse and human model systems at the cellular, tissue and organismal level. Here we discuss the current state of epigenetic rejuvenation strategies via partial reprogramming in both mouse and human models. For each classical reprogramming factor, we provide a brief description of its contribution to reprogramming and discuss additional factors or chemical strategies. We discuss what is known regarding chromatin remodeling and the molecular dynamics underlying rejuvenation, and, finally, we consider strategies to improve the practical uses of epigenetic reprogramming to treat aging and age-related diseases, focusing on the open questions and remaining challenges in this emerging field.
Article
Full-text available
Intracellular processes are cascades of biochemical reactions, triggered in response to various types of stimuli. Mathematical models describing their dynamics have become increasingly popular in recent years, as tools supporting experimental work in analysis of pathways and regulatory networks. Not only do they provide insights into general properties of these systems, but also help in specific tasks, such as search for drug molecular targets or treatment protocols. Different tools and methods are used to model complex biological systems. In this work, we focus on ordinary differential equations (ODEs) and Petri nets. We consider specific methods of analysis of such models, i.e., sensitivity analysis (SA) and significance analysis. So far, they have been applied separately, with different goals. In this paper, we show that they can complement each other, combining the sensitivity of ODE models and the significance analysis of Petri nets. The former is used to find parameters, whose change results in the greatest quantitative and qualitative changes in the model response, while the latter is a structural analysis and allows indicating the most important subprocesses in terms of information flow in Petri net. Ultimately, both methods facilitate finding the essential processes in a given signaling pathway or regulatory network and may be used to support medical therapy development. In the paper, the use of dual modeling is illustrated with an example of ATM/p53/NF- κ\kappa κ B pathway. Each method was applied to analyze this system, resulting in finding different subsets of important processes that might be prospective targets for changing this system behavior. While some of the processes were indicated in each of the approaches, others were found by one method only and would be missed if only that method was applied. This leads to the conclusion about the complementarity of the methods under investigation. The dual modeling approach of comprehensive structural and parametric analysis yields results that would not be possible if these two modeling approaches were applied separately. The combined approach, proposed in this paper, facilitates finding not only key processes, with which significant parameters are associated, but also significant modules, corresponding to subsystems of regulatory networks. The results provide broader insight into therapy targets in diseases in which the natural control of intracellular processes is disturbed, leading to the development of more effective therapies in medicine.
Article
Full-text available
Reaction networks, or equivalently Petri nets, are a general framework for describing processes in which entities of various kinds interact and turn into other entities. In chemistry, where the reactions are assigned "rate constants", any reaction network gives rise to a nonlinear dynamical system called its "rate equation". Here we generalize these ideas to "open" reaction networks, which allow entities to flow in and out at certain designated inputs and outputs. We treat open reaction networks as morphisms in a category. Composing two such morphisms connects the outputs of the first to the inputs of the second. We construct a functor sending any open reaction network to its corresponding "open dynamical system". This provides a compositional framework for studying the dynamics of reaction networks. We then turn to statics: that is, steady state solutions of open dynamical systems. We construct a "black-boxing" functor that sends any open dynamical system to the relation that it imposes between input and output variables in steady states. This extends our earlier work on black-boxing for Markov processes.
Conference Paper
Full-text available
In this work the authors propose the use of a high-level Petri net formalism for modeling developmental processes at the cell level, taking explicitly into account the role of epigenetic regulation.
Article
Aging is driven by hallmarks fulfilling the following three premises: (1) their age-associated manifestation, (2) the acceleration of aging by experimentally accentuating them, and (3) the opportunity to decelerate, stop, or reverse aging by therapeutic interventions on them. We propose the following twelve hallmarks of aging: genomic instability, telomere attrition, epigenetic alterations, loss of proteostasis, disabled macroautophagy, deregulated nutrient-sensing, mitochondrial dysfunction, cellular senescence, stem cell exhaustion, altered intercellular communication, chronic inflammation, and dysbiosis. These hallmarks are interconnected among each other, as well as to the recently proposed hallmarks of health, which include organizational features of spatial compartmentalization, maintenance of homeostasis, and adequate responses to stress.