Volume 122, Issue 3 p. 677-697
ARTICLE

Oleaginous Yeast Biology Elucidated With Comparative Transcriptomics

Sarah J. Weintraub

Sarah J. Weintraub

Department of Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Search for more papers by this author
Zekun Li

Zekun Li

Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Search for more papers by this author
Carter L. Nakagawa

Carter L. Nakagawa

Department of Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Search for more papers by this author
Joseph H. Collins

Joseph H. Collins

Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Search for more papers by this author
Eric M. Young

Corresponding Author

Eric M. Young

Department of Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Correspondence: Eric M. Young ([email protected])

Search for more papers by this author
First published: 10 December 2024

ABSTRACT

Extremophilic yeasts have favorable metabolic and tolerance traits for biomanufacturing- like lipid biosynthesis, flavinogenesis, and halotolerance – yet the connection between these favorable phenotypes and strain genotype is not well understood. To this end, this study compares the phenotypes and gene expression patterns of biotechnologically relevant yeasts Yarrowia lipolytica, Debaryomyces hansenii, and Debaryomyces subglobosus grown under nitrogen starvation, iron starvation, and salt stress. To analyze the large data set across species and conditions, two approaches were used: a “network-first” approach where a generalized metabolic network serves as a scaffold for mapping genes and a “cluster-first” approach where unsupervised machine learning co-expression analysis clusters genes. Both approaches provide insight into strain behavior. The network-first approach corroborates that Yarrowia upregulates lipid biosynthesis during nitrogen starvation and provides new evidence that riboflavin overproduction in Debaryomyces yeasts is overflow metabolism that is routed to flavin cofactor production under salt stress. The cluster-first approach does not rely on annotation; therefore, the coexpression analysis can identify known and novel genes involved in stress responses, mainly transcription factors and transporters. Therefore, this work links the genotype to the phenotype of biotechnologically relevant yeasts and demonstrates the utility of complementary computational approaches to gain insight from transcriptomics data across species and conditions.

Data Availability Statement

RNA-Seq reads are available on PRJNA1028627 NCBI BioProject, and the Joint Genome Institute. Genome sequences can be found on NCBI with the following accession IDs: Y. lipolytica PO1f - PRJNA1047134, D. subglobosus dep8 - PRJNA1047133. All data is available on GitHub (https://github.com/emyounglab/rnaseq-onboarding).

The data that support the findings of this study are openly available in NCBI at https://submit.ncbi.nlm.nih.gov/.