- Review
- Open access
- Published:
From beer to breadboards: yeast as a force for biological innovation
Genome Biology volume 25, Article number: 10 (2024)
Abstract
The history of yeast Saccharomyces cerevisiae, aka brewer’s or baker’s yeast, is intertwined with our own. Initially domesticated 8,000 years ago to provide sustenance to our ancestors, for the past 150 years, yeast has served as a model research subject and a platform for technology. In this review, we highlight many ways in which yeast has served to catalyze the fields of functional genomics, genome editing, gene–environment interaction investigation, proteomics, and bioinformatics—emphasizing how yeast has served as a catalyst for innovation. Several possible futures for this model organism in synthetic biology, drug personalization, and multi-omics research are also presented.
A short history of yeast from the perspective of genomics
Tracing the journey of the budding yeast, Saccharomyces cerevisiae (S. cerevisiae) reveals a rich history of interactions with our ancestors and us. Domesticated for millennia for wine and breadmaking, yeast was introduced as an experimental organism in the 1930s by Herschel Roman and colleagues [1]. Genetic studies, pioneered by Øjvind Winge and Carl Lindegren in the late 1940s [2], helped set the stage for the broad adoption of this model system. In the ensuing century, S. cerevisiae became a workhorse in genetics, molecular biology, and biotechnology; more recently, it facilitated the establishment of genomics as a discipline. For example, S. cerevisiae has become a model for studying metabolism, morphology, cell division, secretion, and other fundamental cellular functions. Its experimental advantages are manifold—in particular, it is a unicellular organism that, unlike metazoans, can be cultured on defined media, allowing the researcher to control all environmental factors.
The introduction of genetically stable, homothallic strains (i.e., lacking a functional HO endonuclease and unable to switch mating type), genetically marked haploid and diploid cells, and the ability to control mating and meiosis opened up the field of classical yeast genetics. Both mitotic and meiotic approaches were developed to map yeast genes (reviewed in reference [3]). After the first genetic map was published in 1949 [4], molecular techniques and recombinant DNA—introduced in the 1950s and 1960s—were quickly adapted to yeast research. Later, a watershed experiment in 1977 was the demonstration of functional complementation of a yeast mutant with a leucine biosynthetic gene from Escherichia coli [5].
Combining both genetic and molecular approaches to study yeast cells accelerated the development of reverse genetics (i.e., proceeding from gene to phenotype) and led to the characterization of hundreds of yeast genes. In parallel, yeast biochemists gathered a wealth of biochemical information on metabolic pathways, characterizing enzymes involved in metabolic processes as well as the underlying regulatory circuits. Cytological studies have contributed to our understanding of mitosis and meiosis, cytoskeletal structure and function, and organelle biology. Yeast studies have also contributed foundational insights on nucleic acid metabolism and genome structure, DNA repair, cell cycle regulation, gene expression, and the response to diverse stresses. Indeed, several Nobel Prizes have recognized these contributions to our understanding of the cell cycle, secretion, and autophagy (see Fig. 1 and reference [2] for details).
Because its core biological processes are functionally conserved, yeast research has direct, translational implications for human health [6]. Once sequence data became available on a large scale in the early 1990s, it became obvious that, despite being separated by nearly a billion years of evolutionary distance, most fundamental biological structures and functions are conserved between yeast and mammals. Indeed, many homologous genes can complement (i.e., functionally substitute) for each other (for instance, see reference [11]). By the 1990s, interest in the yeast genome was ascendant; as part of the Human Genome Project (HGP), the smaller genomes of yeast and worm served as pilot tests of HGP experimental and computational logistics (see reference [12] by Lander et al.). When S. cerevisiae became the first completely sequenced eukaryotic genome in 1996, the abundance of information collected in this project (performed by a network of yeast labs led by Andre Goffeau and colleagues) became a crucial reference against which human, animal, plant, and microbial genes were compared [13]. Despite the S. cerevisiae genome being one of the best-characterized and extensively studied model systems, it is somewhat surprising that several hundred open reading frames (ORFs) remain uncharacterized (Table 1). The yeast genome sequence revealed that between a third to a half of yeast genes are related to human genes by homology. The fact that these homologs have persisted (with modest alterations) suggests that they support important basic cellular functions (reviewed in reference [7]).
At the turn of the last century, comparative studies of a small number of closely related yeast genomes helped build the framework for comparative genomics [16]. The introduction of massively parallel sequencing brought hundreds (and eventually thousands) of diverse S. cerevisiae genomes for comparison [17,18,19,20,21]. In 2009, Gianni Liti and colleagues accelerated the nascent field of yeast population genetics by sequencing over 70 yeast isolates. They found that phenotypic variation was correlated with global genome-wide phylogenetic relationships. This study also revealed that human influences facilitated cross-breeding and the emergence of new variations [22]. Schacherer et al. conducted a nucleotide-level survey of genomic variation in 63 S. cerevisiae strains sampled from diverse ecological niches. They identified 1.89 million single-nucleotide polymorphisms and 3985 larger deletions. The study provided insights into the population structure of S. cerevisiae, supporting multiple domestication events, and also shed light on the origins of pathogenic strains [18]. In a study led by John H. McCusker, 93 genomes of S. cerevisiae strains from various geographic and environmental origins were sequenced and annotated as part of the “100-genomes” resource [23]. These studies set the stage for ever larger whole-genome surveys of Saccharomyces. For example, Peter et al. reported whole-genome sequencing and phenotyping of 1011 S. cerevisiae isolates, providing broad evolutionary insight into how genomic variants shape the species-wide phenotypic landscape [24], including evidence that S. cerevisiae spread worldwide from a single out-of-China event.
These comparative genomic projects, combined with the large-scale analysis of genetic regulatory elements and chromatin structure studies, provided the data that fueled a comprehensive annotation of the yeast genome, which continues to this day. Genome annotation describes the process of identifying the functional elements and characteristics of genes within a genome. In practice, gene and genome annotation involves several overlapping activities, including the following: (1) computational gene prediction to identify ORFs and noncoding regions, (2) functional annotation using both forward and reverse genetics, (3) identification of gene–gene interactions and gene–chemical interactions, (4) regulatory element identification, and (5) comparative genomics. These studies, which collectively assess over a thousand diverse species, provide a comprehensive view of genome evolution, including SNPs, structural changes, and large-scale differences in ploidy (i.e., changes in chromosome number). Most recently, long-read sequencing has been added to the toolkit of comparative genomics—enabling complete or nearly complete telomere-to-telomere genome assemblies [14].
The yeast sequencing project was contemporaneous with the establishment of the GO Consortium, which began as a joint project of the SGD [25], FlyBase [26], and the Mouse Genome Database [27]. The founders of the GO consortium envisioned gene annotation as a tool that would unify biology; their prediction that “there is likely to be a single limited universe of genes and proteins, many of which are conserved in most or all living cells,” has motivated a generation of computational biologists. For practical purposes, the consortium defined three categories of GO: biological process, i.e., the biological objective that the gene product executes; molecular function, i.e., the biochemical activity (or potential activity) of a gene product; and cellular component, i.e., where in the cell that a gene product is localized and active.
Gene sequencing, comparative genomics, and gene annotation are symbiotic because all three activities help define genome function; improvements to these methods drive better annotations. For example, in the early days of the yeast genome sequencing project, annotation suffered from false positives as well as missed genes. The lack of other sequences for comparison also stymied annotation of conserved, noncoding sequences. New technologies such as ChIP-seq, nucleosome mapping, and proximity techniques were crucial for each genome revision. Indeed, while the yeast genome is arguably quite stable at the molecular level, it has undergone continuous revision, including changes in absolute gene number [28], with much of the reduction (nearly 10%) of the original ~ 6200 ORFs arising from comparative genomics. On the other hand, newly defined genes have been dominated by small ORFs that were not originally included because they did not pass the 100 amino acid threshold for being annotated as ORFs [29]. While the sequence of the yeast reference genome is arguably complete, the annotation of its gene complement will be continually revised as new technologies and insights are introduced.
Yeast functional genomics
Early efforts in functional genomics
Once the first phase of the yeast sequencing project was completed in 1996, the challenging task of assigning functions remained. Even before the yeast sequencing project was finished, several laboratories had constructed large-scale yeast mutant collections. For example, transposon tagging was used to generate 11,000 mutants in 2000 genes to track gene expression, protein localization, and disruption phenotypes [30]. The data from screens of 8000 strains performed in 20 different growth conditions were made widely available. This study highlighted the importance of making screening data publicly available [31] and helped lay the foundation for future genome-wide approaches to identify functionally related genes (for details, see reference [32]). These studies provided crucial early insights, including the observation that 20% of the genes are essential and further, which essentiality is condition-dependent. Analysis of the so-called nonessential genes argued against the idea that duplicated genes are redundant; indeed, experimental results showed that every gene, when deleted, exhibited a measurable fitness phenotype [33]. Such transposon-based screens have caveats—because insertions are not targeted, it is difficult to unambiguously distinguish between effects due to a gene disruption or a neighboring sequence feature. In addition, transposons have target sequence biases [34]. Nonetheless, these early studies underscored the need for a complete, systematic deletion collection that would encompass all essential and nonessential genes and simplify mutant interpretation by using complete, start-to-stop deletions.
The S. cerevisiae deletion project
The yeast S. cerevisiae deletion project (aka the yeast knockout or YKO collection) involved an international consortium of 16 laboratories (many of whom participated in the genome sequencing project) that, over the course of 3 years, deleted and distributed a systematic set of yeast deletion strains [35,36,37]. The history of this project and the essential roles of Ron Davis at Stanford and Mark Johnston at Washington University in St. Louis is reviewed in [7]. Each gene was precisely deleted—from the start-to-stop codon (non-inclusive)—and replaced (using mitotic recombination) with the KanMX deletion “cassette” (Fig. 2a) [38]. The KanMX gene inserted into the deletion locus in each mutant is flanked by two strain-specific 20-nucleotide sequences that serve as molecular barcodes to uniquely identify each deletion mutant. For the majority of mutants, the cassette was introduced into a diploid strain to produce the heterozygous deletion strain, which was sporulated to generate the MATa and MATα haploid deletion strains, followed by mating of the two haploids to generate the homozygous deletion strain [35]. Mutants that could not be constructed in diploids were made directly in haploids. In total, four sets of deletions were produced, all genes as heterozygous diploids, homozygous diploids, and both a and α haploids. A snapshot of the key pages of the original yeast deletion project website has been restored at “http://chemogenomics.pharmacy.ubc.ca/GGCN_Lab/SGDP/”—hosted by our lab at the University of British Columbia.
The unique sequence tags (i.e., barcodes) linked to each gene deletion allow the strains to be analyzed in parallel in competitive fitness assays. In these pooled experiments, a mixed culture containing every deletion mutant is grown, samples are collected at several times during growth, and the molecular barcode tags are amplified from the genomic DNA by PCR using common primers that flank the unique barcodes. The abundance of each deletion strain is then determined by quantifying the molecular barcodes by next-generation sequencing. The greater the degree that a gene is required for growth, the more rapidly that strain (and its corresponding sequence tags) diminish in the culture. Thus, all genes required for growth can be identified and ranked in order of their relative contribution to fitness in a single experiment [36, 37]. Figure 2 illustrates the workflow of these fitness assays.
The first progress report of the YKO consortium appeared in Science in 1999 [37] when a third of the deletion strains had been constructed. Major findings included the following: (1) of 2026 ORFs deleted, 17% were essential, (2) only half of all ORFs were previously known, and (3) in a competitive fitness assay across ~ 60 generations (performed in either minimal or rich medium using a pool comprised of 558 homozygous deletions), a fitness defect was revealed for 40% of the strains. The second consortium report, published in Nature in 2002 [36], announced the completion of the YKO collection and reported a tally of 18.7% essential genes out of 5916 ORFs. This landmark paper included full genome functional profiling of the homozygous deletion collection in five environmental stress conditions and in the antifungal drug Nystatin. Notable findings included a slow-growth phenotype for 15% of the strains in rich media as well as phenotypes for all mutants in well-characterized stress conditions, including high salinity (1-M NaCl) and high osmolarity (1.5-M sorbitol).
The 2002 study also established that there is no correlation between the genes necessary for survival in a specific condition and those genes whose transcription is increased after exposure to that condition (for example, high salinity). [36]. This surprising lack of correlation between fitness and gene expression changes has subsequently been supported by numerous studies, including a report aptly titled “Transcriptional response of S. cerevisiae to DNA-damaging agents does not identify the genes that protect against these agents” [39]. Since this time, the YKO has been used to study the nuances of the stress response. For example, it has been suggested that the immediate mRNA stress response may be more important for surviving the next encounter with stress. A hallmark of the stress is now known to include a response at the protein level, which occurs much more rapidly than the transcriptional response and can include posttranscriptional events that provide “just in time production of genes” such as rapid reprogramming of translation through a variety of mechanisms—uORFs [40], stress granules [41], and active blocking of the exit of ribosomal subunits through the nuclear pore [42] to name a few.
Derivative libraries and methodologies inspired by the YKO project
The completion of the YKO collection inspired the development and application of derivative strain collections (Table 2) and new genome-wide technologies. For example, genome-wide yeast libraries that reference the original YKO seminal publications have themselves been cited > 6000 times, including the yeast tandem affinity purification (TAP-tagged) collection [8], the GFP collection [43], and the glutathione-S-transferase (GST-tagged) ORF collection [44]. Other highly cited papers inspired by the YKO include novel methods for mutant construction in other organisms (for example, the Arabidopsis thaliana [45] and E. coli mutant collections [46]). Applications that leveraged the YKO project include genome-scale protein-complex analysis by mass spectrometry [47,48,49], protein microarrays [50], whole-genome analysis of synthetic genetic arrays (SGA) [9, 51, 52], and large-scale gene expression studies [32, 39, 53]. Although a detailed description of these libraries and technologies is beyond the scope of this review, we highlight several studies that have contributed to the understanding of yeast gene function and cellular processes.
Methods to generate synthetic genetic double mutants include the aforementioned synthetic genetic array (SGA) [61] and diploid-based synthetic lethality analysis on microarrays (dSLAM) [62]. The first SGA genetic interaction study described the systematic construction of all pairwise double mutants of dozens of haploid deletion strains [51, 61]. Subsequent, and ever-larger genetic interaction maps iterated this approach to produce a nearly complete dataset [52, 63, 64], including essential genes as ts (temperature-sensitive) or DAmP (decreased abundance by mRNA perturbation) hypomorphic alleles [65].
The TAP-tagged library [8] allowed the expression levels of all proteins in the cell to be quantified, while the GFP library [43] provided localization data on the proteome; together, these two libraries allowed an integrated view of the localization and abundance of nearly all proteins in the cell. The TAP-tagged library became instrumental in the first mass spectrometry-based genome-scale isolation of protein complexes. Subsequent improvements on these two collections have refined these datasets. For example, the SWAp-Tag (SWAT) was introduced by Weill et al. as a flexible library that facilitates the rapid construction of an endless number of variants [66] to characterize the yeast proteome for protein abundance, localization, topology, and interactions [67].
Genome-wide phenotypic screens
The YKO collection has been used in thousands of genome-wide phenotypic assays and has provided insights into biological function, the response to stress, and the mechanism of drug action. Many genome-wide phenotypic screens have been independently repeated, with DNA metabolism and repair screens being prominent examples. The reader is referred to comprehensive phenotypic screening studies for details, e.g. [68,69,70,71]; here, we provide an overview of yeast phenotypic screens and highlight select examples.
In 2005, a screen of the heterozygous deletion collection revealed that 3% of ~ 5900 genes are haploinsufficient, manifesting a fitness defect in rich media [72]. At the time, there were two prevailing hypotheses of haploinsufficiency. The “balance hypothesis” posited that haploinsufficiency is due to a disruption in the stoichiometry of protein complex members [73], making the testable prediction that the haploinsufficient phenotype will be the same as the overexpression phenotype, because both scenarios disrupt the balance of protein subunits. Under this scenario, haploinsufficiency should be maintained regardless of the growth conditions because the stoichiometry of a protein complex would still be unbalanced. Alternatively, the “insufficient amounts hypothesis” postulated that haploinsufficiency results from reduced levels of protein product, rendering the cell less fit [72]. Deutchbauer observed that haploinsufficiency in minimal media was associated with a repression of gene expression, in contrast to predictions of the balance hypothesis—suggesting the importance of absolute transcript levels and indicating that specific gene products play a crucial role in growth limitation only in rich media [72]. Further, overexpression of 13 haploinsufficient genes did not cause a growth defect, and growth in minimal media (which slows the rate of cell division) alleviates most haploinsufficiency, as does any treatment that slows growth (e.g., high pH or growth inhibitors). Taken together, this work suggests that most cases of haploinsufficiency in yeast are caused by insufficient amounts of protein, with some exceptions like cytoskeletal genes (ACT1, TUB1, and SPC97) that maintain haploinsufficiency in both YPD and minimal media; for those genes, the balance hypothesis best explains their haploinsufficiency [72].
The majority of haploinsufficient genes have human homologs (107 of the 184), and all complexes that are haploinsufficient in yeast are present in humans. Of the 3% haploid-insufficient strains, over half were functionally related or functionally enriched for ribosomal function. The importance of ribosomal haploinsufficiency in eukaryotes is illustrated in Drosophila by the minute mutants that have several developmental abnormalities (for review, see [74]). In mammals, pathogenic effects of ribosome haploinsufficiency include Diamond-Blackfan anemia and 5 q- syndrome (a hematological disorder) [75]; also, haploinsufficiency of RPL5 (a ribosomal 60S subunit) in human breast cancer cells accelerates tumor progression in a mouse model [76]. As the relevance of haploinsufficiency to human diseases and cancers becomes better characterized [77], the list of yeast haploinsufficient genes may serve as a valuable reference for understanding the role of their human orthologs in disease.
Mitochondrial respiration screens
Many of the earliest studies on yeast, spearheaded by European brewers (e.g., the Carlsberg laboratories), focused on their ability to grow either in a fermentative manner or via respiration. Assessing the requirement of a gene for mitochondrial respiration is straightforward in yeast—the inability to grow on an obligate respiratory carbon source strongly implicates that the deleted gene product is required for this process. In 2002, Steinmetz et al. performed a systematic screen with varying carbon sources on the nonessential diploid deletion set and identified 466 genes whose deletion impaired mitochondrial respiration, including 265 that were novel [78]. Three independent colony-based, genome-wide studies also screened the deletion collection for genes required for respiratory growth [79,80,81]. As opposed to liquid growth assays that typically measure fitness by light scattering, colony-based yeast studies measure fitness based on colony size. In one of the most recent of these studies, Merz and Westermann included a welcome comparison of these results, revealing an overlap of 176 genes between all three colony-based studies, each representing approximately half of the genes identified in each individual screen [81]. The discrepancy between the number of respiratory-deficient mutants identified between studies could come from several sources, but regardless of the cause, this observation highlights a limitation of the YKO, namely, it is limited to a single genetic background. Newer tools (discussed below) should expand the deletion approach to other strains—indeed, several small-scale efforts have shown the utility of this approach in the context of wine strains and those that undergo pseudohyphal growth [82, 83].
Caveats on the yeast deletion collection
Despite its merits, the YKO has several key limitations that limit its utility. These include the fact that the YKO represents a single genetic background, which contains several well-characterized polymorphisms that compromise sporulation, mitochondrial function, and other less obvious phenotypes [84]. While there are other yeast deletion mutant collections, most of them are hybrids in which the deletion cassettes were derived from the original YKO (for example, [83] and [85]). Additionally, improper maintenance of large strain collections (i.e., wrong colony on plate or polyclonal colonies) can lead to the wrong mutant being tested. Another confounding factor is that individual mutant strains may have acquired mutations (e.g., second-site suppressors, aneuploidy, and diploidization). Accordingly, it is recommended that results from YKO experiments (either individually or in pools) should be independently constructed and validated. The compact nature of the yeast genome often complicates the study of individual genes. In other words, deletion of one gene can occasionally disrupt the promoter, terminator, or coding sequences of nearby genes on the opposite strand of genomic DNA. Interestingly, very closely spaced deletion mutant pairs can occasionally be used to confirm neighboring gene function as described below for SIR2. Despite these caveats, the broad uptake of the YKO as an experimental platform is clear—in 2023, Anastasia Baryshnikova’s group analyzed ~ 14,500 yeast knockout screens and clustered these datasets into what they dubbed the Yeast Phenome, illustrating the continued usefulness of the YKO [86].
Protein–protein interactions
Two-hybrid studies
One of the first methods to study protein–protein interactions (PPIs) at scale was the two-hybrid system. In the original version of the assay, two query proteins are constructed, with the “bait” protein fused to the DNA-binding domain of the Gal4 protein and the “prey” protein fused to the activation domain of Gal4. If these two proteins physically interact, Gal4 activity is restored and can be measured via activation of a reporter gene. An alternative assay, called “the interaction trap,” was introduced by Golemis et al. in 2008 [87]. By generating genome-wide “orfeome” collections to be used as either bait or prey libraries, these assays can be carried out on a genome-wide scale in an array format (reviewed in reference [88]). In two early extensive two-hybrid studies, all possible combinations of ~ 6000 proteins in yeast were interrogated. One study identified 841 interactions [89], and the other identified 691 [90]. These two reports made progress towards a comprehensive protein–protein interaction map, yet their datasets shared only 141 genes in common, 40 of which were known interactions. Technical differences, such as different reporter plasmids, may explain the lack of agreement, and additional limitations of the two-hybrid system should be considered. For example, when fused to the Gal4 binding or activation domain, many proteins may fail to fold properly, contributing to a false-positive rate of approximately 25% per unique interaction for yeast [91]. With the introduction of NGS, 2-hybrid screens have been adapted to massively parallel formats; nonetheless, careful validation of any hits is still required. Newer yeast protein–protein interaction technologies include those from the David Baker’s lab at the University of Washington [92], companies (e.g., A-Alpha Bio), as well as PROPER-seq, a technique which infers PPIs based on assessing the transcriptome [93].
Protein complexes identified used mass spectrometry
Comprehensive protein–protein interaction maps, powered by developments of tagged proteomes and the development of unbiased mass spectrometry methodologies, have increased the breadth and depth of our understanding of the yeast interactome. The TAP (tandem affinity purification) method is used to purify TAP-tagged proteins and their associated proteins. The TAP tag comprises a calmodulin-binding peptide (CPB), a protein A moiety, and a tobacco etch virus (TEV) protease cleavage to facilitate the isolation process (see reference [8]). Two early large-scale mass spectrometry studies took advantage of the TAP tag fusion collection to identify all protein complexes in the yeast genome. Gavin et al. identified 491 complexes comprising 23% of the yeast proteome (257 of these complexes were novel). Many of the proteins identified are “modular” with some always appearing together, while others present in more than one complex [49]. Krogan et al. identified 547 complexes, comprising 47% of the yeast proteome, with 2702 proteins in total [48]. It is difficult to compare the two datasets because they used different methods, and indeed, only six complexes were identical between the two sets. Hart et al. [94] integrated the two datasets along with a third dataset (from Ho et al., [47]) and found a consensus of 1689 proteins representing 390 protein complexes. Of the 132 with 4 or more subunits, 69% are highly enriched for specific GO component annotations suggesting that the complexes are highly accurate. Essential genes are enriched in complexes and based on the high proportion of complexes that are already annotated, and the relative dearth of uncharacterized genes in the high confidence data suggests that these studies may have largely saturated the fraction of the yeast “complexome” that is accessible in these conditions using these methods of isolation [95, 96].
Chemogenomics: identifying drug targets
Chemogenomic profiling is a method designed to study the genome-wide response to small molecules. The ability to identify drug targets in vivo in an unbiased manner without prior knowledge has made yeast instrumental in such mechanism-of-action studies. Traditional chemogenomic approaches to determine the mechanism of action (MoA) of drugs include isolation of drug-resistant mutants followed by genetic mapping. While this mutational approach can identify the drug target, it is difficult to scale. Alternatively, one can clone a drug target by complementation [97]. In one early cloning-by-complementation study, a strain mutated for the gene encoding a drug target (HMG-CoA reductase) was transformed with a genomic DNA clone bank to identify drug-resistant colonies able to grow on solid media containing lovastatin [98]. This method inspired multicopy suppression profiling (MSP), where a library of clones is introduced, in parallel, into a pool of mutants, and once resistant strains are identified, the complementing plasmid-borne gene is sequenced to reveal candidate drug targets. Traditional MSP screens involve plating techniques and characterization of individual clones by sequencing [99]. They are prone to false negatives, for example, if the wrong time point is assayed or the wrong drug concentration is used, and the results can be dominated by a gene product unrelated to the drug target. There are now several well-characterized overexpression libraries that can be used for high-throughput studies [57, 100], including the molecular-barcoded yeast ORF (MoBY-ORF) collection built by Ho et al. in 2009 [59]. This collection is barcoded, and because each CEN-based plasmid carries a single ORF flanked by its native upstream and downstream genomic sequences, the copy number is low and predictable, minimizing overexpression toxicity [101]. Indeed, high-level overexpression can disrupt cellular homeostasis, and several groups have exploited this phenotype to find inhibitors that alleviate the fitness defect caused by overexpression of toxic proteins [102,103,104].
Haploinsufficiency profiling–homozygous profiling (HIP − HOP) is a gene-dose assay that relies on an increase in drug sensitivity to identify drug targets. The HIP assay relies on the drug-induced haploinsufficiency phenotype, which is based on the observation that reducing the copy number of a drug target from two copies to one copy in diploid yeast results in increased sensitivity to a compound that inhibits the gene product of the heterozygous locus [105]. HIP uses essential heterozygous deletion strains in competitive fitness assays combined with quantitative analysis of the molecular barcodes to identify relative strain abundance—strains most sensitive to the drug provide a ranked list of the most likely drug target candidates [10, 105,106,107]. HIP has the advantage of simultaneously identifying both the inhibitory compound and its candidate target(s) without prior knowledge of either. In some cases, a 50% decrease in gene dosage is not sufficient to identify the drug target. In these cases, complementary approaches that use DAmP (decreased abundance by mRNA perturbation) alleles can be used [65]. The DAmP collection is a set of hypomorphic alleles that carry a disruption in the 3′-untranslated region in the essential genes, which destabilizes the corresponding RNA transcript and results in a ~ 5–50-fold decrease in mRNA levels [65].
The HOP assay is analogous to HIP, except that the homozygous deletion collection is used. It complements the HIP assay by providing a ranked list of genes (by virtue of their deletion strain sensitivity) that buffer the target pathway, including those that comprise pathway components as well as genes involved in multidrug resistance (e.g., drug transport, detoxification, and metabolism). When combined, HIP–HOP chemogenomic profiles give a comprehensive view of drug mechanism along with primary and secondary targets, identifying all genes required for drug exposure–response in a single assay. HIP − HOP has been successfully used to identify the target of known and novel compounds [10, 106,107,108,109]. An illustrative example includes Sir2, a histone deacetylase, as the target of tenovin, a small-molecule p53 activator [110]. This study screened the heterozygous diploid deletion with a derivative of tenovin to show that sir2 deletion strains manifested tenovin-induced haploinsufficiency. While none of the silent information regulator genes was represented in the YKO (because they are unable to mate), the deletion of an adjacent dubious ORF (YDL041W) removed the first 300 nucleotides of SIR2, abolishing its function and establishing Sir2 as a potential tenovin target.
Combining the results of several thousand such HIP − HOP screens revealed that the cellular response to small molecules is limited and can be described by 45 “signature” chemical-genetic interaction profiles that are detectable in other large-scale genomic datasets, suggesting that they represent fundamental, conserved small-molecule response systems present across eukaryotic cells [10]. Figure 3 shows an overview of a HIP − HOP growth assay result, representing chemical–genetic interactions of a small molecule, erodoxin, with a complex network of yeast genes highly enriched for “post-translational protein targeting to membrane” and “endoplasmic reticulum membrane” genes.
Other genome-wide chemogenomic strategies (e.g., SGA and gene–expression profiling) rely on “guilt by association” to identify the target of a drug from a compendium of reference profiles (e.g. genetic interactions or gene expression) [111,112,113,114]. Since drugs with similar mechanisms have similar chemical–genetic profiles, the drug target can be inferred by global analysis of chemical–genetic profiles to uncover reference compounds with established MoA [113].
Recent advances in chemogenomics
With advancements in computational and statistical techniques, combined with the increasing ease of genetic engineering, functional genomic studies in yeast have flourished. Machine learning approaches have been successfully used in yeast functional genomics, including cell growth prediction, pathway engineering, and chemogenomics [115,116,117]. Recently, genome-wide CRISPR screens (including CRIPSRi) have been applied to yeast, and to date (see below), the results are in agreement with more traditional molecular techniques.
The Charlie Boone/Brenda Andrews laboratories have extensively surveyed gene–gene interactions across the genome. In 2020, they extended their pioneering digenic interaction studies to interrogate trigenic interactions to identify those genes that have maintained their functional overlap versus those that have evolved novel functions [118]. A large number (~ 550,000 double and ~ 260,000 triple mutants) were screened; ~ 4700 negative digenic interactions and ~ 2500 negative trigenic interactions were identified. Statistical analysis suggested that two-thirds of paralogs have functionally diverged during the course of evolution, while one-third are functionally redundant [118].
Recently, researchers have used genome-scale CRISPR screens in yeast to both improve the technology and to apply it in novel ways. Smith et al. performed a genome-wide CRISPR interference screen to investigate the effectiveness of gRNA for transcriptional repression [119]. Using an inducible, plasmid-based CRISPRi system (with 20 gRNAs directed to 20 genes whose expression should influence sensitivity to specific small molecules) along with 18 small molecules, the effects of gRNAs on CRIPSRi-induced fitness defects were studied, and generalizable characteristics associated with gRNA efficacy were assessed. The chemical–genetic interactions identified by this strategy were precisely consistent with previously described interactions. More recently, Momen-Roknabadi et al. validated the CRISPR approach by introducing a genome-scale, inducible CRISPRi library, which they applied to uncover haploinsufficient genes and enzymatic and regulatory genes involved in adenine and arginine biosynthesis [120].
For yeast CRISPR applications, targeting genes and their regulatory elements is straightforward, but modifying SNPs is more difficult, owing to the high degree of sequence similarity between the guide and the donor (which can result in loss of the variant through cell death or mutation by NHEJ) and uncertainty about the availability of PAMs near the SNP. To address these limitations, Lars Steinmetz’s lab published a CRISPR–Cas9-based method called MAGESTIC (multiplexed accurate genome editing with short, trackable, integrated cellular barcodes) for variant analysis [121]. Using MAGESTIC, they carried out a saturation mutagenesis experiment on the essential gene SEC14 and determined which amino acids are crucial for chemical inhibition of lipid signaling. They showed that the editing efficacy can be improved five-fold when the donor DNA is recruited to the site of breaks using LexA–Fkh1p fusion protein [121]. Most genome-wide genotype–phenotype screens had been restricted to a single mode of alteration—deletion, repression, or overexpression. Taking advantage of the trifunctional CRISPR system (aka CRISPR-AID [122]), Huimin Zhao’s lab developed multifunctional genome-wide CRISPR (MAGIC) to regulate the expression level of each gene to prespecified levels. They constructed three genome-scale gRNA-expressing plasmid libraries for upregulating, downregulating, and deleting genes, representing new options for yeast functional libraries [123].
In 2021, Alford et al. introduced a reverse genetic method called ReporterSeq to define pathways involved in the yeast stress response. ReporterSeq identifies genes that regulate stress-induced transcription factors in a time-resolved manner in different environments by pairing the enumeration of RNA-encoded barcodes to pathway-specific outputs that are enumerated by DNA sequencing. Employing ReporterSeq in 15 stress environments, they discovered novel, stress-specific, time-specific, and constitutive regulators and suggest that this method could be applied with any encodable genetic perturbation (e.g., RNAi, CRISPR knockouts) [124].
The utility of the genetically encoded barcodes of the YKO has recently been extended by several groups that combined pooled approaches to genetic interaction mapping with the ability to generate “barcode fusions” between two distinct cells. In brief, these approaches rely on the ability to isolate interacting cells, either by (1) mating, (2) by transforming cells carrying one barcoded locus with a plasmid containing a second barcode, or (3) by encapsulating cells within oil-in-water emulsions. Cells or pairs of cells are then subjected to barcode fusion either by Cre-Lox recombination or fusion PCR, followed by massively parallel sequencing [125,126,127] as shown in Fig. 4. Barcode-fusion genetics and variations on this theme promise massive increase in both scope and scale.
Bioinformatics
The Yeast Genome Project established the field of functional genomics in the twenty-first century. Concurrently, digital computers and broad accessibility to the Internet made reconstructing genomes from sequence fragments a reality. Bioinformatics played a vital role in the interpretation of the information encoded in the yeast genome. The computational transformation of primary sequence data into biologically relevant information was first reported on a large scale by Frishman and team in 2001; they published the first systematic genome analysis pipeline based on their experience with yeast [128]. Soon after, databases such as Saccharomyces Genome Database (SGD) [25], Comprehensive Yeast Genome Database (CYGD) [129], YeastWeb [129], and others were built to store the raw sequences and bioinformatically refined data. Today, SGD has become a gold-standard resource for genetics and molecular biology of yeast S. cerevisiae. It also provides detailed information about genes and their biological functions, as well as resources and tools for exploring sequence data [130]. SGD has spawned dozens of organism-specific databases with similar aims. While the SGD has evolved into a premier model organism database, BioGRID (Biological General Repository for Interaction Datasets) [131], originally launched in 2006 to comprehensively curate all available biological interaction data generated in yeast, has expanded to encompass other organisms, including human cells. This open-access database resource is highly complementary to SGD, with manually curated protein and genetic interactions from multiple species. To date, BioGRID curators have read more than 197,000 publications, a number which should increase greatly with the broader adoption of large language models or LLMs (see reference [132] for a prescient review of this topic).
To accommodate the increasingly complex and diverse needs of researchers for searching and comparing data, SGD introduced YeastMine [133], a multifaceted search and retrieval environment that provides access to diverse data types. This tool is functionally integrated into Galaxy [134], an interactive system that combines the power of existing genome annotation databases with a web portal to enable researchers to search remote resources, combine data from independent queries, and visualize the results. Increasingly, these research tools are being made available from strain and plasmid repositories such as the well-established American Type Culture Collection (ATCC) and newer sources such as Addgene. Also, bioinformatics and data visualization tools like TheCellMap [135] and discipline-focused databases (such as those curated by in the Nucleic Acids Research annual database issue) add to the growing bioinformatics toolkit.
The open-source software project Cytoscape, which integrates biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework [136], has been instrumental in yeast bioinformatics, especially when used in conjunction with large databases of protein–protein, protein-DNA, and genetic interactions. Cytoscape possesses functionalities to query interaction networks, visually integrate them with expression profiles, phenotypes, and other molecular states, and it can link to databases of functional annotations. Another Cytoscape-based app is GeneMANIA, a web-based tool that helps predict the function of genes and gene sets using a very large set of functional association data [137]; this data includes protein and genetic interactions, pathways, co-expression, co-localization, and protein domain similarity. STRING is another well-known web tool providing orthology prediction, functional and physical network prediction, etc. A similar web tool with additional capabilities to use transcriptional regulation association and mutant phenotype association from yeast has been developed, called YAGM (Yeast Associated Genes Miner) [138].
Whole-genome sequencing, coupled with bioinformatics, has enabled fast and cost-effective mutation identification. Multiple web-based tools developed by the Fritz Roth lab, such as ChromoZoom [139] and FuncBase [140], have been developed. ChromoZoom is a genome browser that hosts tracks for yeast and human genomes, whereas FuncBase enables browsing of quantitative gene function assignments for yeast, mouse, and human genes. In addition, the Roth lab has been instrumental in developing MaveDB, a central repository allowing researchers to store and publish processed multiplex assays of variant effect (MAVE) datasets, such as deep mutational scans and massively parallel reporter assays [141]. This work has been done in collaboration with Douglas Fowler’s lab—who has contributed to developing genome engineering tools and combining cutting-edge genomic methods with computational analyses to measure the consequences of tens of thousands of DNA sequence alterations simultaneously.
To identify new yeast mutants, Mudi or Mutation discovery is a browser-accessible and easy-to-use bioinformatics tool that enables “one-click” identification of causative mutations from sequence data [142]. CRIMEtoYHU (CTY) is a similar web tool that helps geneticists evaluate the functional impact of cancer-associated missense variants. Since S. cerevisiae and humans share thousands of protein-coding genes, yeast humanization is useful for deciphering the functional consequences of human genetic variants found, for example, in cancer and providing information on the pathogenicity of missense variants (see below). CTY finds yeast homologous genes, identifies the corresponding variants, and simultaneously determines the transferability of human variants to their yeast counterparts by assigning a reliability score which may serve as a predictor for the validity of a functional assay. It analyzes and ranks newly identified mutations or mutations from the COSMIC database. Then, it provides information about the functional conservation between yeast and human and shows the mutation distribution in human genes [143].
Humanization of yeast
Functional genomics can be used to further many aims, such as providing a comprehensive description of the overall functioning of a single-cell organism at the systems level. Additionally, by virtue of the evolutionary conservation between yeast and human cells, we may better understand human physiology and pathophysiology. A complementary approach marries these two aims by directly engineering yeast to express human proteins of interest. Specifically, this work involves identifying the human orthologs of yeast genes for expression in yeast. By Eugene Koonin’s definition, “Orthologs are genes originating from a single ancestral gene in the last common ancestor of the compared genomes” [144]. The ortholog–function hypothesis states that orthologous genes have identical or similar functions in divergent species [145]. By exchanging orthologs from one species into another, many individual studies have directly investigated the conservation of function. When direct investigation of human biology is constrained by ethical and practical concerns, model organisms can be useful proxies. For example, S. cerevisiae has proven to be a vital tool for deciphering much of the biology that underpins human cell function and disease. In 1985, for instance, Kataoka et al. showed that yeast cells deleted for RAS genes cannot germinate, but that the expression of a chimeric mammalian/yeast RAS gene under control of a galactose-inducible promotor can complement this defect [146]. Humanized yeast cells can also be employed to investigate the function of human genes in response to drug treatment, including screening small molecules for their activity against human protein targets such as DNA damage checkpoint repair (DDCR) inhibitors [147].
Some of the most highly conserved genes in the human and yeast genomes encode proteins involved in cell machinery (e.g., DNA replication and repair) whose defects lead to various disorders and diseases. Such genes can display a range of genetic variation, which can be difficult to study in the original organism. Humanized yeast provides an in vivo platform for screening drugs or small molecules that inhibit human proteins [148]. For example, FEN1 (human ortholog of RAD27 in yeast) is a protein that functions in DNA replication and repair. The expression of Fen1 in many cancer types is very high, supporting the hyper-proliferation of cancer cells. Phil Hieter’s lab used this system to test two known human FEN1 inhibitors: PTPD (a N-hydroxyurea-based compound) and a derivative of NSC-13755, while both compounds inhibited growth of a humanized FEN1 yeast strain in the presence of MMS, only PTPD was a potent, specific inhibitor of hFEN1. In contrast. NSC-13755 was associated with general toxicity [149].
Since the 2010s, researchers have been engineering yeast with orthologous genes from plants, humans, and even prokaryotes to test their functional compatibility [150,151,152,153,154]. In a landmark study from Edward Marcotte’s laboratory, the “swappability” of 414 essential genes in yeast was tested by replacement with their human orthologs. Nearly half of the human orthologs (47%) could complement the yeast growth defect. The ability of many human genes to substitute for their yeast orthologs indicates the remarkable level of functional conservation in eukaryotic systems throughout billion-year evolutionary periods (Fig. 5). Conspicuously, this high degree of “swappability” was not highly correlated with sequence similarity; instead, genes involved in specific complexes or pathways behaved in the same way [151]. Fully humanized protein complexes may be restricted in their capacity to interact with their correct partners in the setting of a yeast cell, similar to particular humanized sites without the context of their human protein [6]. Indeed, the modular nature of replaceability suggests that this may be the case and hints that the inability to properly form the necessary interactions may be a driving force behind certain proteins being unable to replace their yeast counterparts [6].
An important humanization study was carried out in Jef Boeke’s lab where they coaxed yeast to survive solely with human core histones in their nucleosomes. They used a plasmid-shuffle strategy to replace yeast histones with human counterparts, resulting in a yeast cell with a humanized epigenome [155]. This work showed that the human core histones are able to function in S. cerevisiae (albeit with reduced fitness) without the accessory human genes to deposit them on to DNA. This work, while highlighting the power of yeast humanization, also underscores the challenges—when Truong and Boeke measured the growth rate of humanized yeast under the growth conditions of both yeast and human cells, they found that yeast bearing humanized nucleosomes required genomic suppressors to recover their growth rate [155]. In an interesting twist on complementing yeast mutants with a human ortholog, Sturley’s group has developed a yeast-human model of Niemann-Pick disease and showed that the yeast NPC1 gene can functionally complement cells derived from NPC patients [156].
Numerous complex interactions with various organelles and signaling pathways that are present in human cells are absent in yeast. In these instances, model, humanized strains can be used to examine the pathway in a “clean” background. For example, Boonekamp et al. recently reported the first successful humanization of skeletal muscle glycolysis in yeast, opening up the possibility of exploring human glycolysis in yeast [139]. When paired with evolutionary strategies, single gene and complete pathway transplantation can demonstrate the extraordinary conservation of glycolytic and moonlighting functions as well as context-dependent responses. For instance, the study showed that human hexokinases 1 and 2, but not 4, required alterations in their catalytic or allosteric regions in order to function in yeast, while hexokinase 3 was unable to complement its yeast ortholog. Human glycolytic enzyme turnover rates were preserved in both yeast and human cell cultures when compared to human tissues. The construction of metazoan models that are tailored to certain species, tissues, and diseases is made possible by this example of the transplanting of a complete critical pathway [139]. However, the characterization of the physiological and cellular impact of the transplantation is one of these techniques’ general limitations.
The future of yeast humanization is promising. Perhaps the most exciting avenue for humanized yeast is the potential for constructing “personalized” strains, expressing any given allele of a human gene or combinations of genes, to make personalized yeast avatars. Finally, we need not restrict ourselves to replacing only orthologous genes. While likely not a widespread phenomenon, complementation of yeast mutants by non-orthologous human genes may prove useful, especially in cases where orthologs do not complement or where orthologs cannot be identified.
Yeast as a synthetic biology innovator
Cameron et al. generally describe synthetic biology as the use of molecular biology tools and techniques to forward-engineer a desired function to produce a desired cellular behavior [157]. Although many microorganisms have been exploited using genome engineering to produce a specific cellular function, E. coli and S. cerevisiae are the preeminent test beds of synthetic biology, and they remain crucial drivers of the field. The history of synthetic biology is discussed in a detailed review by Cameron et al. [157], and here, we highlight the landmark events of the field, focusing on yeast contributions.
In the mid-2000s, a crucial milestone of metabolic engineering was published by Jay D. Keasling’s lab, in which they reported the biosynthesis of an antimalarial lactone, artemisinin, in yeast [158]. They also developed E. coli strains capable of producing any terpenoid compound for which a terpene synthase gene is available [159]. Such achievements paved the way for commercial and industrial applications of synthetic biology.
The pioneers of synthetic biology aimed for comprehensive control of cellular function, as envisioned at the SB1.0 conference. Venter and colleagues used new DNA assembly techniques to create a viable bacterial cell that was “rebooted” by a chemically synthesized genome. Subsequently, two teams leveraged CRISPR to minimize the number of chromosomes in haploid yeast cells from 16 to 1 or 2 (aka Sc2.0), and the results were published in the same issue of Nature in August 2018; Jef Boeke’s lab reported a synthetic yeast cell with only two chromosomes; however, fusing these two giant chromosomes was lethal to the cells [160]. In contrast, Shao et al. managed to engineer a functional yeast cell with a single chromosome [161]. Surprisingly, in both n = 1 and n = 2 strains, the expression of only a few genes was significantly different from wild type. Such efforts in synthetic biology will allow us to address very fundamental questions such as the following: (1) why almost all eukaryotes distribute their genome into multiple chromosomes, (2) if particular chromosome numbers can be of benefit for specific species, and (3) how chromosomal structures affect cell viability.
Seven years after the first laboratory-scale synthesis of artemisinin using yeast, Amyris, Inc. engineered an optimized artemisinin acid pathway in yeast, which led to the large-scale production of the drug [162]. As a result, hundreds of thousands of individuals in lower-income countries have access to antimalarial the drug at a low cost. Another event that has indeed revolutionized synthetic biology was the emergence of CRISPR-Cas technology, pioneered by Jennifer Doudna, Emmanuelle Charpentier, and others (for review, see reference [163]). In 2013, George Church’s team reported a CRISPR-based approach for site-specific mutagenesis and allelic replacement in yeast, which demonstrated that the introduction of targeted double-strand breaks significantly enhances the rates of homologous recombination. They reported a fivefold increase in recombination rates when using single-stranded oligonucleotide donors and a remarkable 130-fold increase when employing double-stranded oligonucleotide donors. This study laid the foundation for efficient site-specific mutagenesis and allelic replacement in yeast [164]. In an important proof-of-principle experiment, Christina Smolke’s lab reported the complete biosynthesis of opioids in engineered yeast cells using sugar as the starting material [165]. The resulting yeast cell factories were modified with more than 20 genes expressing enzymes from plants, mammals, bacteria, and yeast itself [165].
Because the S288c strain used in the Sc2.0 project lacks many of the genes that give industrial and environmental isolates their phenotypic variation, Kutyna et al. created a neo-chromosome that incorporates many different yeast pan-genomic components [166]. This “neo-existence” chromosome gives the Sc2.0 parental strain phenotypic plasticity, including an increase in the variety of usable carbon sources. The ability to adapt synthetic strains to a larger range of conditions may thus be made possible by the inclusion of this neo-chromosome within the Sc2.0 backbone. This process will be crucial to moving Sc2.0 from the lab into more practical industrial applications.
As we evolve from genomics as a “read-only” discipline (i.e., decoding genomes by sequencing) to a “read–write discipline” (combining sequencing with synthesis), yeast will remain a primary organism for the development of modular biofoundries for synthetic chemistry of diverse biomolecules, including human pharmaceuticals.
Looking ahead
In 2011, Botstein and Fink published a compelling perspective entitled “Yeast: An Experimental Organism for 21st Century Biology” [167]—an update of the paper published by the same authors 23 years earlier [168]. They initially posited that yeast, owing to a convergence of genetics and molecular biology, was poised to become the premier experimental organism for modern biology. These predictions were prescient, and indeed, yeast has exceeded expectations, particularly with respect to being the chief innovator in the interfacial disciplines of functional genomics and systems biology. From our perch in 2023, we suggest that these new fields will expand in scale, scope, and impact. With regard to single-cell genomics, analysis of yeast represents a powerful means to understand both genetic and epigenetic contributions to cell variation. Finally, the extraordinary advances in yeast bioengineering, including a complete recoding of the genome, promise to bridge the gap between yeast as a living cell and a semisynthetic biosensor.
As has been true for the past 150 years, the impact of yeast on scientific research is vast and not completely predictable. Below, we highlight some future prospects for the technological and experimental development of yeast in the near future, with examples of each.
Three areas to watch
As yeast enters its second century as a model organism, it is fair to ask if its best days are in the rear-view mirror. The list of genetic and molecular features that were once exclusive to yeast experimentalists has undergone a transformation with the advent of NGS and high-performance computing. These technologies, which make any genotype accessible, have fueled the expansion of increasingly sophisticated genome modifications and automated phenotyping. Nevertheless, the institutional knowledge accumulated for yeast and its ability to adapt to new experimental contexts suggest (at least to the authors) that yeast’s second century will be equally fruitful (Fig. 6). Below we highlight just a few of these nonexclusive areas for future research.
Yeast biosensors
One underexplored aspect of yeast biotechnology lies in the field of the so-called living biosensors. Most of the tools are available to design a next-generation living dosimeter (for example) in which the sensitivity or differential sensitivity of yeast strains to an environmental stress could be used as a sensitive, unicellular canary in a coal mine. By way of example, imagine a small badge containing yeast in a semisolid medium where one strain will fluoresce green if it encounters a UV-C light source, while the control strain emits a low level of red fluorescence. An LED-based fluorimeter detects the difference and displays it on a screen or as a holographic image in the subject’s field of view. Now imagine a multiplexed badge contain dozens or hundreds of threat-specific strains. By combining this technology with a simple means to activate the badge (e.g., hydrating a lyophilized, immobilized strain set), these bio-based sensors could be particularly well suited to resource-challenged environments and autonomous, field-based applications.
Indeed, the groundwork for such autonomous applications has been laid, in large part, by the efforts of researchers who have developed increasingly sophisticated electronics to support the growth and sample collection of yeast mutants on space-based missions from Space Lab to the International Space Station to our recent Deep Space Radiation Genomics experiment (DSRG) in which the yeast deletion collections were sent to and returned from lunar orbit in 2022 [169]. This experiment represents the first long-term cosmic radiation exposure of yeast (or any biological material) in over 70 years. In parallel with these space missions, the ground-based controls will help illuminate the adverse effects induced by the complex space environment.
Finally, the idea of genetic modification of yeast (either permanently using CRISPR or transiently via controlled RNA expression) can be useful to identify genes that when modified can offer radiation resistance for long-term missions to Mars and beyond. By using the methods developed to humanize yeast to introduce diverse extremophile genes into yeast, we can directly measure the effects of genes in such extreme environments. While yeast is not a particularly extremophilic organism, with its growth limited to modest ranges of salinity, temperature, etc., its genome offers an excellent platform to systematically test exogenous transgenes for the effects of genotype on extreme environments. Indeed, much of the benchmark or control data already exists for collections of yeast mutants exposed to diverse (drugs, salt, radiation, etc.). The ability to thrive in extreme environments, to test genes for food crops, as sentinels for the effects of environmental change (temp, humidity, flooding/drought cycles, pathogens) already exists, so we have a lot of the raw material to design such experiments.
Yeast avatars
The humanization of yeast has been employed on a gene-by-gene level to catalog a range of functional orthologs and complexes of orthologs, as well as on an allele-by-allele basis to understand the effects of both common and rare polymorphisms. The logical extension of this work will be to generate comprehensive yeast avatars for human individuals to model a range of diseases in specific genomic backgrounds. One can imagine that, by leveraging the advances in synthetic biology and genome editing, the development of a yeast avatar possessing millions of human variants in, for example, drug metabolism and cancer susceptibility genes that would accompany each of us to the pharmacy or doctor’s office.
Robotic scientists
If a bioengineer working when Botstein and Fink published their 2011 update had suggested that, a decade later, we would be analyzing the results of fully autonomous experiments, they would be right in anticipating a cool reception. But, in fact, Steve Oliver and colleagues had already proposed a robot scientist capable of performing yeast genomic studies [170]. Recent advances in simple-to-program, inexpensive lab automation, combined with advances in natural language processing and diverse machine learning algorithms, has us on the precipice of a research community that comprises both carbon-based and silicon-based principal investigators. A case can be made that the unbiased (occasionally derided as “hypothesis-free”) nature of genomics investigations is well-suited to the “ready-fire-aim” approach used to feed machine learning applications. A useful example can be found in the discipline of in-lab evolution [171,172,173,174]. The growth characteristics of yeast make it an ideal in-lab-evolution platform, but the requirements for human intervention to decide on what traits to select for and when to impose selection are arguably better left to an algorithm that can also evolve. By simply combining optical density measurements with on-demand liquid transfers and sample collection, these assays can be maintained indefinitely. The prospect of increasing the autonomy of such a robot scientist by equipping them with LLMs to inform an autonomous analysis seems close to becoming a reality.
Regardless of the precise direction that future yeast research takes, the remarkable adaptability of this model organism is poised to remain a catalyst for groundbreaking discoveries. The unique attributes of yeast make it an invaluable tool for scientists exploring the intricacies of biological processes, ensuring that it will continue to contribute significantly to both fundamental and practical advancements in research. Its versatility not only enriches our understanding of basic biological principles but also holds the promise of impacting diverse fields, from medicine to biotechnology. In essence, the enduring legacy of yeast as a model organism lies in its capacity to inspire discoveries with far-reaching implications across the spectrum of scientific inquiry.
References
Feldmann H. Yeast: molecular and cell biology. Second. Weinheim, Germany: Wiley-VCH Verlag GmbH & Co. KGaA; 2012. pp. 1-3. ISBN: 978-3-527-33252-6.
Duina AA, Miller ME, Keeney JB. Budding yeast for budding geneticists: a primer on the Saccharomyces cerevisiae model system. Genetics. 2014;197:33–48.
Mortimer RK, Schild D, Contopoulou CR, Kans JA. Genetic and physical maps of Saccharomyces cerevisiae. In: Guide to Yeast Genetics and Molecular Biology. Academic Press; 1991. p. 827–63.
Lindegren CC. The yeast cell, its genetics and cytology. St. Louis: Educational Publishers; 1949.
Hicks J, Fink GR. Identification of chromosomal location of yeast DNA from hybrid plasmid pYelueu10. Nature. 1977;269:265–7.
Laurent JM, Young JH, Kachroo AH, Marcotte EM. Efforts to make and apply humanized yeast. Brief Funct Genomics. 2016;15:155–63.
Giaever G, Nislow C. The yeast deletion collection: a decade of functional genomics. Genetics. 2014;197:451–65.
Ghaemmaghami S, Huh W-K, Bower K, Howson RW, Belle A, Dephoure N, et al. Global analysis of protein expression in yeast. Nature. 2003;425:737–41.
Costanzo M, Kuzmin E, van Leeuwen J, Mair B, Moffat J, Boone C, et al. Global genetic networks and the genotype-to-phenotype relationship. Cell. 2019;177:85–100.
Lee AY, St.Onge RP, Proctor MJ, Wallace IM, Nile AH, Spagnuolo PA, et al. Mapping the cellular response to small molecules using chemogenomic fitness signatures. Sci. 2014;344:208–11.
Zhou B, Gitschier J. hCTR1: A human gene for copper uptake identified by complementation in yeast. Proc Natl Acad Sci. 1997;94:7481–6.
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921.
Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, et al. Life with 6000 genes. Science. 1996;274:546–67.
O’Donnell S, Yue J-X, Saada OA, Agier N, Caradec C, Cokelaer T, et al. Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae. Nat Genet. 2023. https://0-doi-org.brum.beds.ac.uk/10.1038/s41588-023-01459-y.
Laurent JM, Garge RK, Teufel AI, Wilke CO, Kachroo AH, Marcotte EM. Humanization of yeast genes with multiple human orthologs reveals functional divergence between paralogs. PLoS Biol. 2020;18:e3000627.
Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature. 2003;423:241–54.
Muller LAH, McCusker JH. Nature and distribution of large sequence polymorphisms in Saccharomyces cerevisiae. FEMS Yeast Res. 2011;11:587–94.
Schacherer J, Shapiro JA, Ruderfer DM, Kruglyak L. Comprehensive polymorphism survey elucidates population structure of Saccharomyces cerevisiae. Nature. 2009;458:342–5.
Muller LAH, Lucas JE, Georgianna DR, McCusker JH. Genome-wide association analysis of clinical vs. nonclinical origin provides insights into Saccharomyces cerevisiae pathogenesis. Mol Ecol. 2011;20:4085–97.
Wei W, McCusker JH, Hyman RW, Jones T, Ning Y, Cao Z, et al. Genome sequencing and comparative analysis of Saccharomyces cerevisiae strain YJM789. Proc Natl Acad Sci. 2007;104:12825–30.
Doniger SW, Kim HS, Swain D, Corcuera D, Williams M, Yang S-P, et al. A catalog of neutral and deleterious polymorphism in yeast. Plos Genet. 2008;4:1–15.
Liti G, Carter DM, Moses AM, Warringer J, Parts L, James SA, et al. Population genomics of domestic and wild yeasts. Nature. 2009;458:337–41.
Strope PK, Skelly DA, Kozmin SG, Mahadevan G, Stone EA, Magwene PM, et al. The 100-genomes strains, an S. cerevisiae resource that illuminates its natural phenotypic and genotypic variation and emergence as an opportunistic pathogen. Genome Res. 2015;25:762–74.
Peter J, De Chiara M, Friedrich A, Yue J-X, Pflieger D, Bergström A, et al. Genome evolution across 1,011 Saccharomyces cerevisiae isolates. Nature. 2018;556:339–44.
Cherry JM, Adler C, Ball C, Chervitz SA, Dwight SS, Hester ET, et al. SGD: Saccharomyces genome database. Nucleic Acids Res. 1998;26:73–9.
Gramates LS, Agapite J, Attrill H, Calvi BR, Crosby MA, Dos Santos G, et al. FlyBase: a guided tour of highlighted features. Genetics. 2022;220(4):iyac035.
Blake JA, Baldarelli R, Kadin JA, Richardson JE, Smith CL, Bult CJ. Mouse Genome Database (MGD): knowledgebase for mouse-human comparative biology. Nucleic Acids Res. 2021;49:D981–7.
Fisk DG, Ball CA, Dolinski K, Engel SR, Hong EL, Issel-Tarver L, et al. Saccharomyces cerevisiae S288C genome annotation: a working hypothesis. Yeast. 2006;23:857–65.
Kastenmayer JP, Ni L, Chu A, Kitchen LE, Au W-C, Yang H, et al. Functional genomics of genes with small open reading frames (sORFs) in S. cerevisiae. Genome Res. 2006;16:365–73.
Ross-Macdonald P, Coelho PS, Roemer T, Agarwal S, Kumar A, Jansen R, et al. Large-scale analysis of the yeast genome by transposon tagging and gene disruption. Nature. 1999;402:413–8.
Kumar A, Cheung K-H, Tosches N, Masiar P, Liu Y, Miller P, et al. The TRIPLES database: a community resource for yeast molecular biology. Nucleic Acids Res. 2002;30:73–5.
DeRisi JL, Iyer VR, Brown PO. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science. 1997;278:680–6.
Musso G, Costanzo M, Huangfu M, Smith AM, Paw J, San Luis B-J, et al. The extensive and condition-dependent nature of epistasis among whole-genome duplicates in yeast. Genome Res. 2008;18:1092–9.
Davies CJ, Hutchison CA 3rd. Insertion site specificity of the transposon Tn3. Nucleic Acids Res. 1995;23:507–14.
Chu AM, Davis RW. High-throughput creation of a whole-genome collection of yeast knockout strains. Methods Mol Biol. 2008;416:205–20.
Giaever G, Chu AM, Ni L, Connelly C, Riles L, Véronneau S, et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387–91.
Winzeler EA, Shoemaker DD, Astromoff A, Liang H, Anderson K, Andre B, et al. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Sci. 1999;285:901–6.
Smith AM, Heisler LE, Mellor J, Kaper F, Thompson MJ, Chee M, et al. Quantitative phenotyping via deep barcode sequencing. Genome Res. 2009;19:1836–42.
Birrell GW, Brown JA, Wu HI, Giaever G, Chu AM, Davis RW, et al. Transcriptional response of saccharomyces cerevisiae to DNA-damaging agents does not identify the genes that protect against these agents. Proc Natl Acad Sci U S A. 2002;99:8778–83.
Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–23.
Kedersha N, Anderson P. Regulation of translation by stress granules and processing bodies. Prog Mol Biol Transl Sci. 2009;90:155–85.
Altmann M, Linder P. Power of yeast for analysis of eukaryotic translation initiation. J Biol Chem. 2010;285:31907–12.
Huh W-K, Falvo JV, Gerke LC, Carroll AS, Howson RW, Weissman JS, et al. Global analysis of protein localization in budding yeast. Nature. 2003;425:686–91.
Gelperin DM, White MA, Wilkinson ML, Kon Y, Kung LA, Wise KJ, et al. Biochemical and genetic analysis of the yeast proteome with a movable ORF collection. Genes Dev. 2005;19:2816–26.
Alonso JM, Stepanova AN, Leisse TJ, Kim CJ, Chen H, Shinn P, et al. Genome-wide insertional mutagenesis of Arabidopsis thaliana. Science. 2003;301:653–7.
Baba T, Ara T, Hasegawa M, Takai Y, Okumura Y, Baba M, et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol. 2006;2006(2):0008.
Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams S-L, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415:180–3.
Krogan NJ, Cagney G, Yu H, Zhong G, Guo X, Ignatchenko A, et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature. 2006;440:637–43.
Gavin A-C, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M, et al. Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006;440:631–6.
Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, et al. Global analysis of protein activities using proteome chips. Science. 2001;293:2101–5.
Tong AHY, Lesage G, Bader GD, Ding H, Xu H, Xin X, et al. Global mapping of the yeast genetic interaction network. Sci. 2004;303:808 LP – 813.
Costanzo M, VanderSluis B, Koch EN, Baryshnikova A, Pons C, Tan G, et al. A global genetic interaction network maps a wiring diagram of cellular function. Science. 2016;353:aaf1420. https://0-doi-org.brum.beds.ac.uk/10.1126/science.aaf1420.
Primig M, Williams RM, Winzeler EA, Tevzadze GG, Conway AR, Hwang SY, et al. The core meiotic transcriptome in budding yeasts. Nat Genet. 2000;26:415–23.
Mnaimneh S, Davierwala AP, Haynes J, Moffat J, Peng W-T, Zhang W, et al. Exploration of essential gene functions via titratable promoter alleles. Cell. 2004;118:31–44.
Breslow DK, Cameron DM, Collins SR, Schuldiner M, Stewart-Ornstein J, Newman HW, et al. A comprehensive strategy enabling high-resolution functional analysis of the yeast genome. Nat Methods. 2008;5:711–8.
Yan Z, Costanzo M, Heisler LE, Paw J, Kaper F, Andrews BJ, et al. Yeast barcoders: a chemogenomic application of a universal donor-strain collection carrying bar-code identifiers. Nat Methods. 2008;5:719–25.
Jones GM, Stalker J, Humphray S, West A, Cox T, Rogers J, et al. A systematic library for comprehensive overexpression screens in Saccharomyces cerevisiae. Nat Methods. 2008;5:239–41.
Bharucha N, Ma J, Dobry CJ, Lawson SK, Yang Z, Kumar A. Analysis of the yeast kinome reveals a network of regulated protein localization during filamentous growth. Mol Biol Cell. 2008;19:2708–17.
Ho CH, Magtanong L, Barker SL, Gresham D, Nishimura S, Natarajan P, et al. A molecular barcoded yeast ORF library enables mode-of-action analysis of bioactive compounds. Nat Biotechnol. 2009;27:369–77.
Kofoed M, Milbury KL, Chiang JH, Sinha S, Ben-Aroya S, Giaever G, et al. An updated collection of sequence barcoded temperature-sensitive alleles of yeast essential genes. G3 Genes, Genomes, Genet. 2015;5:1879–87.
Tong AHY, Evangelista M, Parsons AB, Xu H, Bader GD, Pagé N, et al. Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science. 2001;294:2364–8.
Pan X, Yuan DS, Ooi S-L, Wang X, Sookhai-Mahadeo S, Meluh P, et al. dSLAM analysis of genome-wide genetic interactions in Saccharomyces cerevisiae. Methods. 2007;41:206–21.
Costanzo M, Baryshnikova A, Bellay J, Kim Y, Spear ED, Sevier CS, et al. The genetic landscape of a cell. Science. 2010;327:425–31.
Costanzo M, Hou J, Messier V, Nelson J, Rahman M, VanderSluis B, et al. Environmental robustness of the global yeast genetic interaction network. Sci. 2021;372:eabf8424.
Schuldiner M, Collins SR, Thompson NJ, Denic V, Bhamidipati A, Punna T, et al. Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell. 2005;123:507–19.
Yofe I, Weill U, Meurer M, Chuartzman S, Zalckvar E, Goldman O, et al. One library to make them all: streamlining the creation of yeast libraries via a SWAp-Tag strategy. Nat Methods. 2016;13:371–8.
Weill U, Yofe I, Sass E, Stynen B, Davidi D, Natarajan J, et al. Genome-wide SWAp-Tag yeast libraries for proteome exploration. Nat Methods. 2018;15:617–22.
Cheung-Ong K, Song KT, Ma Z, Shabtai D, Lee AY, Gallo D, et al. Comparative chemogenomics to examine the mechanism of action of DNA-targeted platinum-acridine anticancer agents. ACS Chem Biol. 2012;7:1892–901.
Cheung-Ong K, Giaever G, Nislow C. DNA-damaging agents in cancer chemotherapy: serendipity and chemical biology. Chem Biol. 2013;20:648–59.
Lee W, St.Onge RP, Proctor M, Flaherty P, Jordan MI, Arkin AP, et al. Genome-wide requirements for resistance to functionally distinct DNA-damaging agents. PLoS Genet. 2005;1:235–46.
Mira NP, Teixeira MC, Sá-Correia I. Adaptive response and tolerance to weak acids in Saccharomyces cerevisiae: a genome-wide view. OMICS. 2010;14:525–40.
Deutschbauer AM, Jaramillo DF, Proctor M, Kumm J, Hillenmeyer ME, Davis RW, et al. Mechanisms of haploinsufficiency revealed by genome-wide profiling in yeast. Genetics. 2005;169:1915–25.
Papp B, Pál C, Hurst LD. Dosage sensitivity and the evolution of gene families in yeast. Nature. 2003;424:194–7.
Roote J, Russell S. Toward a complete Drosophiladeficiency kit. Genome Biol. 2012;13:149.
Narla A, Ebert BL. Ribosomopathies: human disorders of ribosome dysfunction. Blood. 2010;115:3196–205.
Fancello L, Kampen KR, Hofman IJF, Verbeeck J, De Keersmaecker K. The ribosomal protein gene RPL5 is a haploinsufficient tumor suppressor in multiple cancer types. Oncotarget. 2017;8:14462–78.
Berger AH, Pandolfi PP. Haplo-insufficiency: a driving force in cancer. J Pathol. 2011;223:137–46.
Steinmetz LM, Scharfe C, Deutschbauer AM, Mokranjac D, Herman ZS, Jones T, et al. Systematic screen for human disease genes in yeast. Nat Genet. 2002;31:400–4.
Dimmer KS, Fritz S, Fuchs F, Messerschmitt M, Weinbach N, Neupert W, et al. Genetic basis of mitochondrial function and morphology in Saccharomyces cerevisiae. Mol Biol Cell. 2002;13:847–53.
Luban C, Beutel M, Stahl U, Schmidt U. Systematic screening of nuclear encoded proteins involved in the splicing metabolism of group II introns in yeast mitochondria. Gene. 2005;354:72–9.
Merz S, Westermann B. Genome-wide deletion mutant analysis reveals genes required for respiratory growth, mitochondrial genome maintenance and mitochondrial protein synthesis in Saccharomyces cerevisiae. Genome Biol. 2009;10:R95.
Peter JJ, Watson TL, Walker ME, Gardner JM, Lang TA, Borneman A, et al. Use of a wine yeast deletion collection reveals genes that influence fermentation performance under low-nitrogen conditions. FEMS Yeast Res. 2018;18(3):foy009.
Ryan O, Shapiro RS, Kurat CF, Mayhew D, Baryshnikova A, Chin B, et al. Global gene deletion analysis exploring yeast filamentous growth. Science. 2012;337:1352–6.
Deutschbauer AM, Davis RW. Quantitative trait loci mapped to single-nucleotide resolution in yeast. Nat Genet. 2005;37:1333–40.
Acton E, Huei-Yi Lee A, Zhao PJ, Flibotte S, Neira M, Sinha S, et al. Comparative functional genomic screens of three yeast deletion collections reveal unexpected effects of genotype in response to diverse stress. Open Biol. 2017;7:160330. https://0-doi-org.brum.beds.ac.uk/10.1098/rsob.160330
Turco G, Chang C, Wang RY, Kim G, Stoops EH, Richardson B, et al. Global analysis of the yeast knockout phenome. Sci Adv. 2023;9:eadg5702.
Golemis EA, Serebriiskii I, Finley RLJ, Kolonin MG, Gyuris J, Brent R. Interaction trap/two-hybrid system to identify interacting proteins. Curr Protoc Mol Biol. 2008;80(Chapter 20):Unit 20.1.1-20.1.35.
Brückner A, Polge C, Lentze N, Auerbach D, Schlattner U. Yeast two-hybrid, a powerful tool for systems biology. Int J Mol Sci. 2009;10:2763–88.
Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci. 2001;98:4569–74.
Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, et al. A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae. Nature. 2000;403:623–7.
Huang H, Jedynak BM, Bader JS. Where have all the interactions gone? Estimating the coverage of two-hybrid protein interaction maps. PLOS Comput Biol. 2007;3:1–20.
Sahtoe DD, Praetorius F, Courbet A, Hsia Y, Wicky BIM, Edman NI, et al. Reconfigurable asymmetric protein assemblies through implicit negative design. Sci. 2022;375:eabj7662.
Johnson KL, Qi Z, Yan Z, Wen X, Nguyen TC, Zaleta-Rivera K, et al. Revealing protein-protein interactions at the transcriptome scale by sequencing. Mol Cell. 2021;81:4091-4103.e9.
Hart GT, Lee I, Marcotte ER. A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality. BMC Bioinformatics. 2007;8:236.
Humphreys IR, Pei J, Baek M, Krishnakumar A, Anishchenko I, Ovchinnikov S, et al. Computed structures of core eukaryotic protein complexes. Sci. 2021;374:eabm4805.
Cao L, Coventry B, Goreshnik I, Huang B, Park JS, Jude KM, et al. Design of protein binding proteins from target structure alone. Nature. 2022. https://0-doi-org.brum.beds.ac.uk/10.1038/s41586-022-04654-9.
Forsburg SL. The art and design of genetic screens: yeast. Nat Rev Genet. 2001;2:659–68.
Basson ME, Moore RL, O’Rear J, Rine J. Identifying mutations in duplicated functions in Saccharomyces cerevisiae: recessive mutations in HMG-CoA reductase genes. Genetics. 1987;117:645–55.
Rine J, Hansen W, Hardeman E, Davis RW. Targeted selection of recombinant clones through gene dosage effects. Proc Natl Acad Sci. 1983;80:6750–4.
Butcher RA, Bhullar BS, Perlstein EO, Marsischky G, LaBaer J, Schreiber SL. Microarray-based method for monitoring yeast overexpression strains reveals small-molecule targets in TOR pathway. Nat Chem Biol. 2006;2:103–9.
Sopko R, Huang D, Preston N, Chua G, Papp B, Kafadar K, et al. Mapping pathways and phenotypes by systematic gene overexpression. Mol Cell. 2006;21:319–30.
Tugendreich S, Perkins E, Couto J, Barthmaier P, Sun D, Tang S, et al. A streamlined process to phenotypically profile heterologous cDNAs in parallel using yeast cell-based assays. Genome Res. 2001;11:1899–912.
Arnoldo A, Curak J, Kittanakom S, Chevelev I, Lee VT, Sahebol-Amri M, et al. Identification of small molecule inhibitors of Pseudomonas aeruginosa exoenzyme S using a yeast phenotypic screen. PLoS Genet. 2008;4(4):10.1371.
Fleming J, Outeiro TF, Slack M, Lindquist SL, Bulawa CE. Detection of compounds that rescue Rab1-synuclein toxicity. Methods Enzymol. 2008;439:339–51.
Giaever G, Shoemaker DD, Jones TW, Liang H, Winzeler EA, Astromoff A, et al. Genomic profiling of drug sensitivities via induced haploinsufficiency. Nat Genet. 1999;21:278–83.
Hillenmeyer ME, Fung E, Wildenhain J, Pierce SE, Hoon S, Lee W, et al. The chemical genomic portrait of yeast: uncovering a phenotype for all genes. Science. 2008;320:362–5.
Hoepfner D, Helliwell SB, Sadlish H, Schuierer S, Filipuzzi I, Brachat S, et al. High-resolution chemical dissection of a model eukaryote reveals targets, pathways and gene functions. Microbiol Res. 2014;169:107–20.
Giaever G, Flaherty P, Kumm J, Proctor M, Nislow C, Jaramillo DF, et al. Chemogenomic profiling: identifying the functional interactions of small molecules in yeast. Proc Natl Acad Sci. 2004;101:793–8.
Hillenmeyer ME, Ericson E, Davis RW, Nislow C, Koller D, Giaever G. Systematic analysis of genome-wide fitness data in yeast reveals novel gene function and drug action. Genome Biol. 2010;11:R30. https://0-doi-org.brum.beds.ac.uk/10.1186/gb-2010-11-3-r30.
Lain S, Hollick JJ, Campbell J, Staples OD, Higgins M, Aoubala M, et al. Discovery, in vivo activity, and mechanism of action of a small-molecule p53 activator. Cancer Cell. 2008;13:454–63.
Hughes TR, Marton MJ, Jones AR, Roberts CJ, Stoughton R, Armour CD, et al. Functional discovery via a compendium of expression profiles. Cell. 2000;102:109–26.
Parsons AB, Brost RL, Ding H, Li Z, Zhang C, Sheikh B, et al. Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways. Nat Biotechnol. 2004;22:62–9.
Parsons AB, Lopez A, Givoni IE, Williams DE, Gray CA, Porter J, et al. Exploring the mode-of-action of bioactive compounds by chemical-genetic profiling in yeast. Cell. 2006;126:611–25.
Piotrowski JS, Li SC, Deshpande R, Simpkins SW, Nelson J, Yashiroda Y, et al. Functional annotation of chemical libraries across diverse biological processes. Nat Chem Biol. 2017;13:982–93.
Zhou Y, Li G, Dong J, Xing X, Dai J, Zhang C. MiYA, an efficient machine-learning workflow in conjunction with the YeastFab assembly strategy for combinatorial optimization of heterologous metabolic pathways in Saccharomyces cerevisiae. Metab Eng. 2018;47:294–302.
Culley C, Vijayakumar S, Zampieri G, Angione C. A mechanism-aware and multiomic machine-learning pipeline characterizes yeast cell growth. Proc Natl Acad Sci. 2020;117:18869–79.
Fu C, Zhang X, Veri AO, Iyer KR, Lash E, Xue A, et al. Leveraging machine learning essentiality predictions and chemogenomic interactions to identify antifungal targets. Nat Commun. 2021;12:6497.
Kuzmin E, VanderSluis B, Nguyen Ba AN, Wang W, Koch EN, Usaj M, et al. Exploring whole-genome duplicate gene retention with complex genetic interaction analysis. Science. 2020;368:eaaz5667. https://0-doi-org.brum.beds.ac.uk/10.1126/science.aaz5667
Smith JD, Suresh S, Schlecht U, Wu M, Wagih O, Peltz G, et al. Quantitative CRISPR interference screens in yeast identify chemical-genetic interactions and new rules for guide RNA design. Genome Biol. 2016;17:45.
Momen-Roknabadi A, Oikonomou P, Zegans M, Tavazoie S. An inducible CRISPR interference library for genetic interrogation of Saccharomyces cerevisiae biology. Commun Biol. 2020;3:723.
Roy KR, Smith JD, Vonesch SC, Lin G, Tu CS, Lederer AR, et al. Multiplexed precision genome editing with trackable genomic barcodes in yeast. Nat Biotechnol. 2018;36:512–20.
Lian J, HamediRad M, Hu S, Zhao H. Combinatorial metabolic engineering using an orthogonal tri-functional CRISPR system. Nat Commun. 2017;8:1688.
Lian J, Schultz C, Cao M, HamediRad M, Zhao H. Multi-functional genome-wide CRISPR system for high throughput genotype–phenotype mapping. Nat Commun. 2019;10:5794.
Alford BD, Tassoni-Tsuchida E, Khan D, Work JJ, Valiant G, Brandman O. ReporterSeq reveals genome-wide dynamic modulators of the heat shock response across diverse stressors. Elife. 2021;10:e57376.
Evans-Yamamoto D, Rouleau FD, Nanda P, Makanae K, Liu Y, Després PC, et al. Barcode fusion genetics-protein-fragment complementation assay (BFG-PCA): tools and resources that expand the potential for binary protein interaction discovery. Nucleic Acids Res. 2022;50: e54.
Díaz-Mejía JJ, Celaj A, Mellor JC, Coté A, Balint A, Ho B, et al. Mapping DNA damage-dependent genetic interactions in yeast via party mating and barcode fusion genetics. Mol Syst Biol. 2018;14:1–17.
Yachie N, Petsalaki E, Mellor JC, Weile J, Jacob Y, Verby M, et al. Pooled-matrix protein interaction screens using Barcode Fusion Genetics. Mol Syst Biol. 2016;12:863.
Frishman D, Albermann K, Hani J, Heumann K, Metanomski A, Zollner A, et al. Functional and structural genomics using PEDANT. Bioinformatics. 2001;17:44–57.
Güldener U, Münsterkötter M, Kastenmüller G, Strack N, van Helden J. CYGD: the Comprehensive Yeast Genome Database. Nucleic Acids Res. 2005;33(suppl_1):D364-8.
Issel-Tarver L, Christie KR, Dolinski K, Andrada R, Balakrishnan R, Ball CA, et al. Saccharomyces genome database. Methods Enzymol. 2002;350:329–46.
Oughtred R, Rust J, Chang C, Breitkreutz B-J, Stark C, Willems A, et al. The BioGRID database: a comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci. 2021;30:187–200.
Yip KY, Cheng C, Gerstein M. Machine learning and genome annotation: a match meant to be? Genome Biol. 2013;14:205.
Balakrishnan R, Park J, Karra K, Hitz BC, Binkley G, Hong EL, et al. YeastMine--an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit. Database (Oxford). 2012;2012:bar062.
Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, et al. Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005;15:1451–5.
Usaj M, Tan Y, Wang W, VanderSluis B, Zou A, Myers CL, et al. TheCellMap.org: a web-accessible database for visualizing and mining the global yeast genetic interaction network. G3 Genes|Genomes|Genetics. 2017;7:1539–49.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–504.
Mostafavi S, Ray D, Warde-Farley D, Grouios C, Morris Q. GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008;9(Suppl 1):S4.
Wu W-S, Wang C-C, Jhou M-J, Wang Y-C. YAGM: a web tool for mining associated genes in yeast based on diverse biological associations. BMC Syst Biol. 2015;9:S1.
Pak TR, Roth FP. ChromoZoom: a flexible, fluid, web-based genome browser. Bioinformatics. 2013;29:384–6.
Beaver JE, Taşan M, Gibbons FD, Tian W, Hughes TR, Roth FP. FuncBase : a resource for quantitative gene function annotation. Bioinformatics. 2010;26:1806–7.
Esposito D, Weile J, Shendure J, Starita LM, Papenfuss AT, Roth FP, et al. MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect. Genome Biol. 2019;20:223.
Iida N, Yamao F, Nakamura Y, Iida T. Mudi, a web tool for identifying mutations by bioinformatics analysis of whole-genome sequence. Genes Cells. 2014;19:517–27.
Mercatanti A, Lodovichi S, Cervelli T, Galli A. CRIMEtoYHU: a new web tool to develop yeast-based functional assays for characterizing cancer-associated missense variants. FEMS Yeast Res. 2017;17(8):fox078. https://0-doi-org.brum.beds.ac.uk/10.1093/femsyr/fox078.
Koonin EV. Orthologs, paralogs, and evolutionary genomics. Annu Rev Genet. 2005;39:309–38.
Gabaldón T, Koonin EV. Functional and evolutionary implications of gene orthology. Nat Rev Genet. 2013;14:360–6.
Kataoka T, Powers S, Cameron S, Fasano O, Goldfarb M, Broach J, et al. Functional homology of mammalian and yeast RAS genes. Cell. 1985;40:19–26.
Tamble CM, St. Onge RP, Giaever G, Nislow C, Williams AG, Stuart JM, et al. The synthetic genetic interaction network reveals small molecules that target specific pathways in Sacchromyces cerevisiae. Mol Biosyst. 2011;7:2019–30.
Brown GW, Andrews B. Setting molecular traps in yeast for identification of anticancer drug targets. Proc Natl Acad Sci. 2021;118(18):e2105547118.
Hamza A, Driessen MRM, Tammpere E, O’Neil NJ, Hieter P. Cross-species complementation of nonessential yeast genes establishes platforms for testing inhibitors of human proteins. Genetics. 2020;214:735–47.
Smith AG, Santana MA, Wallace-Cook AD, Roper JM, Labbe-Bois R. Isolation of a cDNA encoding chloroplast ferrochelatase from Arabidopsis thaliana by functional complementation of a yeast mutant. J Biol Chem. 1994;269:13405–13.
Kachroo AH, Laurent JM, Yellman CM, Meyer AG, Wilke CO, Marcotte EM. Systematic humanization of yeast genes reveals conserved functions and genetic modularity. Science. 2015;348:921–5.
Hamza A, Tammpere E, Kofoed M, Keong C, Chiang J, Giaever G, et al. Complementation of yeast genes with human genes as an experimental platform for functional testing of human genetic variants. Genetics. 2015;201:1263–74.
Yang F, Sun S, Tan G, Costanzo M, Hill DE, Vidal M, et al. Identifying pathogenicity of human variants via paralog-based yeast complementation. PLOS Genet. 2017;13:1–21.
Kachroo AH, Laurent JM, Akhmetov A, Szilagyi-Jones M, McWhite CD, Zhao A, et al. Systematic bacterialization of yeast genes identifies a near-universally swappable pathway. Elife. 2017;6:e25093.
Truong DM, Boeke JD. Resetting the yeast epigenome with human nucleosomes. Cell. 2017;171:1508-1519.e13.
Munkacsi AB, Chen FW, Brinkman MA, Higaki K, Gutiérrez GD, Chaudhari J, et al. An “exacerbate-reverse” strategy in yeast identifies histone deacetylase inhibition as a correction for cholesterol and sphingolipid transport defects in human Niemann-Pick type C disease. J Biol Chem. 2011;286:23842–51.
Cameron DE, Bashor CJ, Collins JJ. A brief history of synthetic biology. Nat Rev Microbiol. 2014;12:381–90.
Ro D-K, Paradise EM, Ouellet M, Fisher KJ, Newman KL, Ndungu JM, et al. Production of the antimalarial drug precursor artemisinic acid in engineered yeast. Nature. 2006;440:940–3.
Martin VJJ, Pitera DJ, Withers ST, Newman JD, Keasling JD. Engineering a mevalonate pathway in Escherichia coli for production of terpenoids. Nat Biotechnol. 2003;21:796–802.
Luo J, Sun X, Cormack BP, Boeke JD. Karyotype engineering by chromosome fusion leads to reproductive isolation in yeast. Nature. 2018;560:392–6.
Shao Y, Lu N, Wu Z, Cai C, Wang S, Zhang L-L, et al. Creating a functional single-chromosome yeast. Nature. 2018;560:331–5.
Paddon CJ, Westfall PJ, Pitera DJ, Benjamin K, Fisher K, McPhee D, et al. High-level semi-synthetic production of the potent antimalarial artemisinin. Nature. 2013;496:528–32.
Lander ES. The Heroes of CRISPR. Cell. 2016;164:18–28.
DiCarlo JE, Norville JE, Mali P, Rios X, Aach J, Church GM. Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems. Nucleic Acids Res. 2013;41:4336–43.
Galanie S, Thodey K, Trenchard IJ, Interrante MF, Smolke CD. Complete biosynthesis of opioids in yeast. Science. 2015;349:1095–100.
Kutyna DR, Onetto CA, Williams TC, Goold HD, Paulsen IT, Pretorius IS, et al. Construction of a synthetic Saccharomyces cerevisiae pan-genome neo-chromosome. Nat Commun. 2022;13:3628.
Botstein D, Fink GR. Yeast: an experimental organism for 21st century biology. Genetics. 2011;189:695–704.
Botstein D, Fink GR. Yeast: an experimental organism for modern biology. Science. 1988;240:1439–43.
Zea L, Piper SS, Gaikani H, Khoshnoodi M, Niederwieser T, Hoehn A, et al. Experiment verification test of the Artemis I ‘Deep Space Radiation Genomics’ experiment. Acta Astronaut. 2022;198:702–6.
King RD, Whelan KE, Jones FM, Reiser PGK, Bryant CH, Muggleton SH, et al. Functional genomic hypothesis generation and experimentation by a robot scientist. Nature. 2004;427:247–52.
Coutant A, Roper K, Trejo-Banos D, Bouthinon D, Carpenter M, Grzebyta J, et al. Closed-loop cycles of experiment design, execution, and learning accelerate systems biology model development in yeast. Proc Natl Acad Sci U S A. 2019;116:18142–7.
Beal J, Rogers M. Levels of autonomy in synthetic biology engineering. Mol Syst Biol. 2020;16: e10019.
Yachie N, Natsume T. Robotic crowd biology with Maholo LabDroids. Nat Biotechnol. 2017;35:310–2.
Wong BG, Mancuso CP, Kiriakov S, Bashor CJ, Khalil AS. Precise, automated control of conditions for high-throughput growth of yeast and bacteria with eVOLVER. Nat Biotechnol. 2018;36:614–23.
Review history
The review history is available as Additional file 1.
Peer review information
Andrew Cosgrove was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Funding
This work was supported in part by a CRC tier 1 chair to CN.
Author information
Authors and Affiliations
Contributions
Conceived of the review, HKG, CN, and GG. Writing, HKG, DK, CN, and GG. Figures, MS and CN. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Review history.
Glossary
- ARS
-
Autonomously replicating sequence
- ATCC
-
American Type Culture Collection
- COSMIC
-
Catalogue Of Somatic Mutations In Cancer
- CRIMEtoYHU
-
Choosing the Right Cancer-Associated Mutation for Evaluation to Yeast HUmanization
- CRISPR
-
Clustered regularly interspaced short palindromic repeats
- CRISPR-AID
-
Trifunctional CRISPR system comprising CRISPRa, CRISPRi, and CRISPRd
- CRISPRi
-
CRISPR interference
- CYGD
-
Comprehensive Yeast Genome Database
- DAmP
-
Decreased abundance by mRNA perturbation
- DDCR
-
DNA damage checkpoint repair
- dSLAM
-
Diploid-based synthetic lethality analysis on microarray
- HIP–HOP
-
Haploinsufficiency profiling–homozygous profiling
- MAGESTIC
-
Multiplexed accurate genome editing with short, trackable, integrated cellular barcodes
- MAGIC
-
Multifunctional genome-wide CRISPR
- MAVE
-
Multiplexed assays of variant effect
- MoBY-ORF
-
Molecular-barcoded yeast ORF
- MSP
-
Multicopy suppression profiling
- NGS
-
Next-Generation Sequencing
- NHEJ
-
Nonhomologous end joining
- PAMs
-
Protospacer adjacent motifs
- PPIs
-
Protein–protein interactions
- SGA
-
Synthetic genetic array
- SGD
-
Saccharomyces Genome Database
- SNP
-
Single-nucleotide polymorphisms
- STRING
-
Search Tool for the Retrieval of Interacting Genes
- TAP
-
Tandem affinity purification
- TEV
-
Tobacco etch virus
- Tn
-
Transposon
- ts
-
Temperature sensitive
- UMI
-
Unique molecular identifier
- uORFs
-
Upstream open reading frames
- Y2H
-
Yeast two-hybrid
- YAGM
-
Yeast-Associated Genes Miner
- YKO
-
Yeast knockout
- YPD
-
Yeast Protein Database
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Gaikani, H.K., Stolar, M., Kriti, D. et al. From beer to breadboards: yeast as a force for biological innovation. Genome Biol 25, 10 (2024). https://0-doi-org.brum.beds.ac.uk/10.1186/s13059-023-03156-9
Received:
Accepted:
Published:
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s13059-023-03156-9