 Your new post is loading...
|
Scooped by
mhryu@live.com
Today, 5:28 PM
|
Methanol has emerged as a sustainable C1 feedstock owing to its compatibility with existing infrastructure and the potential for renewable production from CO₂ and green hydrogen. Methylotrophic yeasts, including Komagataella phaffii (Pichia pastoris) and Ogataea polymorpha, can natively assimilate methanol and therefore represent attractive hosts for biomanufacturing. However, industrial application of methanol-based processes remains constrained by cytotoxicity, redox imbalance, and limited productivity compared to sugar-based fermentations. To address these challenges, extensive metabolic engineering strategies have been implemented to enhance methanol assimilation and redirect carbon flux toward value-added products. Over the past decade, remarkable progress has been achieved through the development of synthetic methylotrophy in non-methylotrophic yeasts, the expansion of product portfolios to glycans, fatty acid derivatives, polyketides, terpenoids, organic acids, and polyols, and the integration of multi-omics tools for systems-level design. This review summarizes recent advances in methanol assimilation enhancement, synthetic pathway construction, and fermentation engineering, highlighting strategies such as metabolic engineering and dynamic bioprocess control. In addition, current challenges and future perspectives are discussed with an emphasis on overcoming toxicity, improving efficiency, and establishing advanced methylotrophic yeasts as robust cell factories for sustainable C1-based biomanufacturing.
|
Scooped by
mhryu@live.com
Today, 5:21 PM
|
Precise regulation of protein abundance is essential for cellular function and physiology. Conventional approaches are often limited by insufficient resolution or unintended crosstalk. In contrast, orthogonal control technologies enable programmable and precise modulation of protein abundance while remaining insulated from native networks. In this review, we summarize the development and application of regulation technologies with different orthogonality across multiple levels. Orthogonal transcriptional control primarily involves the design and engineering of orthogonal RNA polymerases and transcription factors; orthogonal translational regulation focuses on advances in genetic codon expansion and post-translational modifications; targeted protein degradation and compartmentalized regulation are also discussed. Finally, we highlight the integration across the different levels described above. This review might bring disruptive insights and conceptual breakthroughs to precision medicine and sustainable biomanufacturing.
|
Scooped by
mhryu@live.com
Today, 5:16 PM
|
Quorum sensing (QS) is a cell–cell communication mechanism that enables bacteria to coordinate gene expression in response to population density and community composition. In many pathogens, QS plays a central role in host colonization and virulence, making it an attractive target for antimicrobial intervention. Synthetic biology offers powerful tools to exploit this vulnerability by either disrupting QS signaling or engineering microorganisms with QS-based circuits to detect and eliminate pathogens. In this review, we examine how QS and QS interference can be harnessed for QS circuit engineering and translated into applications such as therapeutic microorganisms. We also highlight the transition of QS research from fundamental microbiology to translational biotechnology, underscoring its potential to drive innovative strategies against microbial virulence and antimicrobial resistance.
|
Scooped by
mhryu@live.com
Today, 5:07 PM
|
Fungal exopolysaccharides (EPSs) are increasingly recognized as structurally programmable microbial polymers with applications spanning biomedicine, materials engineering, food systems, and environmental technologies. While previous reviews have often addressed fungal EPS diversity, production variables, or application domains separately, an integrated framework linking biosynthesis, molecular architecture, process control, and translational manufacturing remains underdeveloped. This review positions fungal EPSs as next-generation biomaterials by integrating (i) biochemical and genetic regulation of EPS biosynthesis, (ii) structure–function mapping across major polymer classes, (iii) cultivation and downstream processing workflows that enable reproducible product specifications, and (iv) industrial translation pathways within scalable and sustainability-aligned biomanufacturing systems. Gene-cluster–resolved case studies and process-to-product design principles illustrate how metabolic flux, fermentation parameters, and polymer modification shape functional performance. Current bottlenecks—including strain-dependent variability, purification complexity, quality harmonization, and techno-economic constraints—are critically evaluated to distinguish laboratory potential from scalable feasibility. By shifting from descriptive cataloging toward platform-based engineering logic, this review provides a translational roadmap for rational fungal EPS design within standardized and application-driven manufacturing frameworks.
|
Scooped by
mhryu@live.com
Today, 3:46 PM
|
The tricarboxylic acid (TCA) cycle is an essential part of the central metabolic hub that provides energy and biosynthetic precursors. Efficient regulation of central carbon flux is critical for maintaining optimal productivity of microbial cell factories (MCFs). However, biosensors capable of sensing TCA intermediates remain limited. Here, we engineered the catabolite control protein C (CcpC) from Bacillus species to reconstruct citrate-responsive biosensors in E. coli. Through hybrid promoter engineering, we systematically characterized and identified the functional roles of two CcpC binding sites. By applying the hybrid promoter, the engineered biosensor BcCcpC-PLBs exhibited the broadest dynamic range and highest expression level among its counterparts. Ligand profiling revealed the diverse responsiveness of BcCcpC to multiple metabolites of the TCA cycle. By structure-guided mutagenesis of BcCcpC, the obtained variant BcCcpC(S138L) exhibited an improved dynamic range of up to 3.02-fold under 80 mM citrate induction. This work establishes the first transcription factor (TF)-based citrate-responsive biosensor, which broadens the regulatory toolkit for central metabolism engineering.
|
Scooped by
mhryu@live.com
Today, 1:31 AM
|
DNA methylation plays critical roles in gene regulation in bacteria, from regulating essential processes like the cell cycle to phenotypes of practical interest like pathogenicity and motility. Synthetic manipulation of global methylation levels has broad impacts on cellular physiology, changing expression patterns of hundreds of genes. However, whether or how environmental variation in natural settings similarly impacts DNA methylation patterns has been unclear. In this work, using the alphaproteobacteria Methylobacterium extorquens and Caulobacter crescentus as model systems, we discover the methylome is highly fluid in response to environmental variation, with different environments leading to distinct patterns of increased or decreased methylation levels along the chromosome. Despite a heterogeneous effect of different environments on methylation patterns, we find a general principle where the dependence of methylation states on position in the genome decreases in proportion to growth rate. A simple model that considers the methylation state through different phases of the cell cycle as a function of distance from an origin provides a framework to interpret the effects of different stressors upon the observed environmental responsiveness of the methylation patterns. Our work highlights how sequencing data alone can shed light on important aspects of microbial physiology.
|
Scooped by
mhryu@live.com
Today, 1:24 AM
|
Plasmids frequently impose measurable fitness costs on their bacterial hosts, yet they remain abundant across clinical and environmental microbiomes. This apparent contradiction, known as the plasmid paradox, has traditionally been explained through mechanisms such as horizontal gene transfer, compensatory evolution, addiction systems, and fluctuating selection. Here we suggest that part of the paradox may arise from implicit physiological assumptions embedded in most empirical measurements—specifically, the assumption that growth rate is a direct proxy for fitness and that plasmid burden necessarily reduces it. We argue that these assumptions may not hold under many ecological conditions. We formalize cell division time as the maximum of several required cellular modules, including cytoplasmic biosynthesis and membrane or envelope synthesis. If plasmid carriage primarily increases cytoplasmic demand, its cost will be expressed only when cytoplasmic processes constitute the dominant bottleneck for growth. When other modules limit division, plasmid-associated burdens may be physiologically real yet evolutionarily silent. More broadly, equating fitness with maximal exponential growth rate overlooks well-established growth-survival trade-offs in bacteria, suggesting that plasmid costs measured under optimized laboratory conditions may systematically overestimate ecological selection against plasmid carriage.
|
Scooped by
mhryu@live.com
Today, 12:41 AM
|
Bottom-up manufacturing of structural DNA nanotechnology requires a long single-stranded DNA (ssDNA) scaffold and hundreds of short (~30 nt) ssDNA staples. However, scaling production remains bottlenecked by the high economic cost and environmental footprint of solid-phase chemical staple synthesis. To address these limitations, a phage-free, biological nanomanufacturing platform engineered in E. coli is developed here. Two intracellular strategies for producing programmable ssDNA were systematically evaluated: retron-based multicopy ssDNA (msDNA) synthesis via the Ec67 system and plasmid-encoded rolling circle replication (RCR). While native structural topology constraints within the retron (msd) cassette limit its sequence-design flexibility, the alternative RCR-driven engine successfully decouples ssDNA replication from sequence secondary structures, enabling the synthesis of arbitrary staples. This RCR platform reliably generates long circular ssDNA (cssDNA) precursors of at least 1.8 kb with exceptional sequence fidelity (>99%). Integrating programmable BseGI cleavage sites allows targeted strand-selective enzymatic processing to cleanly release stoichiometric, origami-grade pools of 32-nt staple strands. Atomic force microscopy (AFM) confirms that these biologically produced staples successfully drive the high-fidelity self-assembly of complex DNA tiles and hollow tubules. Crucially, robust structural folding is demonstrated directly within crude, unpurified cellular lysates, establishing a green, cost-effective framework for the one-pot fabrication of advanced DNA-based nanomaterials.
|
Scooped by
mhryu@live.com
Today, 12:15 AM
|
Climate change increasingly threatens global agriculture by intensifying abiotic stresses and destabilizing crop productivity, necessitating a deeper understanding of root-mediated traits governing resource acquisition and stress resilience. Here, we synthesize recent advances in root-centered plant phenomics, emphasizing how high-throughput phenotyping (HTP) enables high-resolution, scalable characterization of complex root traits and robust comparative analysis across diverse genotypes and environments. Innovations in multimodal imaging notably X-ray computed tomography (CT), magnetic resonance imaging (MRI), and machine learning-integrated rhizotrons facilitate detailed reconstruction of root system architecture and its temporal dynamics under both controlled and semi-field conditions. Furthermore, root phenotyping is increasingly interpreted within an integrated whole-plant framework. The integration of organ-specific assessments with physiological phenomics leveraging spectral and thermal data enables the characterization of developmental plasticity and root-mediated processes, including water-use dynamics, nutrient acquisition, and canopy stress responses under heterogeneous field conditions. These approaches link root traits such as rooting depth and spatial distribution to canopy-level physiological responses under stress. Despite these advances, significant bottlenecks persist in data interoperability, analytical scalability, and protocol standardization. Future progress will require integration of root phenomics with genomics, predictive modeling, and digital twin frameworks to improve resource-use efficiency, yield stability, and climate resilience in global cropping systems.
|
Scooped by
mhryu@live.com
Today, 12:04 AM
|
Methodological inconsistencies hinder the identification of gut keystone bacteria and their functional mechanisms. This work critically compares identification approaches, elucidates their interactions within the microbiota and host, and systematically details targeted dietary interventions. By explicitly linking keystone identification to diet-driven modulation, it provides a practical and precise framework for developing gut health strategies and managing microbiota-related diseases. review
|
Scooped by
mhryu@live.com
May 25, 11:37 PM
|
Amino acid-based phylogenetics usually relies on first clustering and aligning orthologous proteins. This approach is powerful but computationally demanding. Here, we present kamino, a reference-free and alignment-free method that builds amino acid phylogenomic alignments directly from proteomes. kamino adapts a local graph-based variant-calling algorithm to efficiently identify variable homologous positions among proteins and concatenate these polymorphic regions. Across diverse prokaryotic and eukaryotic datasets, we showed that kamino is able to generate good quality alignments. Phylogenetic analyses revealed that kamino generally recovered signals broadly similar to those obtained from marker-based approaches, while being much faster. Its main limitations are reduced performance on deeply divergent prokaryotic datasets and substantial memory requirements for large eukaryotic datasets. kamino therefore provides a fast and simple approach for constructing phylogenomic amino acid alignments, complementing classical marker-based workflows. The program is implemented in Rust and is freely available at https://github.com/rderelle/kamino.
|
Scooped by
mhryu@live.com
May 25, 11:15 PM
|
Liquid–liquid phase separation (LLPS) has emerged as a fundamental mechanism underlying the formation and regulation of membraneless cellular compartments and is increasingly implicated in diverse physiological processes and diseases. Alongside rapid experimental and high-throughput advances, bioinformatics data resources and computational models have expanded substantially, enabling systematic cataloguing of LLPS-associated components and prediction of phase-separation behavior from molecular features. However, the resulting computational landscape remains highly fragmented. In this review, we provide a comprehensive and critical synthesis of bioinformatics resources and predictive modelling approaches for LLPS. We examine and compare major LLPS databases, highlighting differences in evidence types, curation strategies, coverage, and cross-resource inconsistencies that limit integrative analysis. We then survey computational models across core LLPS prediction tasks, encompassing more than 40 representative algorithms and tracing methodological evolution from classical machine learning to deep learning and large language model-based frameworks. By integrating these advances, we identify a fundamental mismatch between molecule-centric data abstractions and the inherently multicomponent, context-dependent organization of LLPS phenomena. We argue that future progress may benefit from event-centric frameworks that explicitly represent molecular assemblies, contextual conditions and observable phase behaviors, thereby providing a coherent foundation for next-generation LLPS datasets and computational models with improved mechanistic interpretability and translational relevance.
|
Scooped by
mhryu@live.com
May 25, 10:34 PM
|
RNA molecules interact extensively with each other and with RNA-binding proteins (RBPs) to regulate gene expression. To improve detection of RBP–RNA and RNA–RNA interactions, we developed the rbsSeeker and rriScan tools and integrated the interactions into the Encyclopedia of RNA Interactomes (ENCORI), providing web-based modules to explore RNA interactions. Using this resource, we identified and validated novel N6-methyladenosine (m6A)-associated proteins, an orphan small nucleolar RNA guiding rRNA pseudouridines and target-directed microRNA degradation events, establishing ENCORI as a framework for studying RNA interactomes. ENCORI is a comprehensive analysis platform and resource for exploring RNA–protein and RNA–RNA interactomes.
|
|
Scooped by
mhryu@live.com
Today, 5:25 PM
|
Rhodobacter sphaeroides, a purple nonsulfur photosynthetic bacterium, displays exceptional metabolic versatility, enabling growth under both aerobic and anaerobic conditions and utilization of diverse carbon sources. Its flexible metabolism, combined with native pathways for terpenoid and tetrapyrrole biosynthesis, makes it a highly promising microbial chassis for the production of valuable compounds. Advances in metabolic and synthetic biology have allowed the engineering of R. sphaeroides for the efficient synthesis of coenzyme Q10 (CoQ10) and porphyrin derivatives through strategies such as precursor supply enhancement, pathway optimization, modulation of redox and energy balance, manipulation of global regulatory systems, and fermentation control. Beyond CoQ10 and porphyrins, this organism holds the potential to produce hydrogen, carotenoids, and other high-value terpenoids. This review summarizes the metabolic features, native regulatory networks, and engineering approaches in R. sphaeroides, highlighting its versatility and robustness as a platform organism. The insights provided here underscore its potential as a chassis for synthetic biology applications and industrial bioproduction of a wide range of bioactive compounds.
|
Scooped by
mhryu@live.com
Today, 5:18 PM
|
Microbial communities deliver essential functions in ecosystems. In plant environments, the plant microbiome facilitates nutrient uptake, supports plants during abiotic stress, and counteracts disease. As implementation of synthetic microbial communities becomes more of a realistic strategy for mitigating the effects of biotic and abiotic stressors on plant productivity, it is increasingly important to understand how interactions between microbes, which are essential for ecosystem function (hub microbes), are maintained. Recent research highlights the ecological role of bacteriophages, the viruses of bacteria, in host-associated microbial communities. Current evidence demonstrates the influence of the phageome on microbiomes, ranging from effects on an individual (transduction, lysogenic conversion, and evolutionary pressure) to entire populations and communities, such as Kill-the-Winner dynamics. These dynamics appear to affect the overall function of microbial communities and support plant growth. In this review, we lay out recent insights on the role of bacteriophages in plant-associated microbiomes through an eco-evolutionary lens and future directions of research to broaden our understanding of the ecological implications of bacteriophages.
|
Scooped by
mhryu@live.com
Today, 5:09 PM
|
Circularly permuted green fluorescent protein (cpGFP)-based high-performance cAMP sensors have enabled real-time monitoring of cAMP dynamics with high spatiotemporal resolution in living animals. However, their utility is hampered by significant spectral overlap with other green/yellow fluorescent indicators and blue/cyan light-activated optogenetic actuators, limiting their compatibility in multiplexed imaging applications. While existing red cAMP sensors offer great spectral separation, they often suffer from a limited dynamic range ( < 1.5-fold in HEK293T cells), low cellular brightness, aggregation, or significant blue-light-induced photoactivation. Here, we report R-Flamp1, a red cAMP sensor with a large dynamic range ( > 10-fold in HEK293T cells), enhanced cellular brightness, appropriate cAMP affinity (Kd ~1.9 μM), subsecond response kinetics, and minimal photoactivation under blue or cyan light exposure. Using R-Flamp1, we visualized region-specific cAMP dynamics, and when paired with green indicators, revealed differential activation patterns between cAMP and neuromodulators or calcium using two-photon imaging and fiber photometry during various behaviors. These findings provide valuable insights into the role of cAMP signaling in complex behaviors. R-Flamp1 is a high-performance red fluorescent cAMP sensor and the authors monitor region-specific cAMP dynamics in vivo and reveal distinct activation patterns between cAMP, calcium, and neuromodulators via two-photon imaging and fiber photometry in animals.
|
Scooped by
mhryu@live.com
Today, 3:47 PM
|
Lactic acid bacteria (LAB), as one of the key microorganisms in the food industry, play a crucial role in functional food fermentation. However, current LAB strains are limited by challenges, such as plasmid instability, low gene expression efficiency, and complex regulation of metabolic fluxes, which hinder their broader application. This review provides an overview of the traditional applications of LAB while highlighting current limitations that constrain their effective use. Then, it focuses on synthetic biology-driven strategies for precisely designing and expanding functions through gene editing, metabolic engineering, and genetic circuits. Finally, this review discusses how to improve gene expression efficiency in LAB and the use of directed evolution to optimize exogenous genes, offering perspectives for future research in the development of personalized food applications. The emerging tools of synthetic biology will further improve the production efficiency and product diversity and meet the needs of consumers for high-quality, multifunctional, and personalized food.
|
Scooped by
mhryu@live.com
Today, 1:37 AM
|
The endosymbiotic evolution of plastids and mitochondria was central to the origin and success of eukaryotes. One of the most prominent molecular machineries thought to have disappeared early in eukaryote evolution is the multi-subunit bacterial DNA polymerase III (DNApol-III), which is the principal enzyme complex supporting DNA replication in bacteria. Here, we combined worldwide metagenomics and cultivation to characterisz the mosaic genomic landscape of abundant phytoplankton lineages of Teleaulax (Cryptophyceae), which contain an endosymbiotically-derived nucleomorph genome. Unexpectedly, the nuclear, plastid and nucleomorph genomes of Teleaulax contain ubiquitously expressed genes for plastid-targeted DNApol-III subunits. These genes shed light on the functioning of Teleaulax genomes when sequestered by the ciliate Mesodinium during its kleptoplastidic photosynthetic activity. In particular, the alpha subunit gene (encoding the polymerase activity), which resides in the nucleomorph genome, is continuously expressed in Mesodinium in controlled laboratory experiments. This provides a mechanistic explanation for the replication of Teleaulax plastid genomes weeks after the nuclear genome is lost. Beyond Teleaulax and close relatives, we also identified genes encoding plastid-targeted DNApol-III subunits (including alpha) in nuclear genomes of unicellular and multicellular lineages of Archaeplastida that form, along with those of Cryptophyceae, monophyletic clades firmly positioned within Cyanobacteria. Together, our results reveal a previously overlooked retention of bacterial DNA replication machinery from plastid primary endosymbiosis in Archaeplastida, its acquisition by Cryptophyceae during secondary endosymbiosis, and its direct role in contemporary plankton as a facilitator of kleptoplastidic photosynthetic activity by heterotrophic ciliates.
|
Scooped by
mhryu@live.com
Today, 1:26 AM
|
Bacteriophages (phages) play essential roles in microbial systems, yet most phage proteins remain poorly characterized. Protein tertiary and quaternary structure information contributes valuable information about protein function. As many phage proteins function as homooligomers, complexes that consist of multiple identical subunits, there is great interest in computationally predicting their configurations. Here we present a computational framework, the Phage Homomer Level Estimate and Generation Method (PHLEGM) for inferring homooligomeric states directly from the protein sequence by combining AlphaFold-Multimer modelling with inter-subunit interface quality assessment. We proceeded to experimentally validate two out of nine predicted homooligomers using size exclusion chromatography and complementary hydrodynamic techniques. These efforts confirmed our predictions for a dimer and a trimer, highlighting the value of experimentally benchmarked computational predictions and showing the challenges of heterologous phage protein production. Applied to >22,000 phage protein sequences in the PHROGs database, our approach revealed extensive diversity in phage homooligomeric protein complexes. Benchmarking against protein language model-based predictors on a curated reference set of known phage homooligomers demonstrated superior accuracy of our structure-based method, achieving robust performance in classifying protein homooligomeric states, with the highest accuracy observed for trimers and higher-order complexes. These results highlight the value of computational predictions to decipher the complexities of the vast viral sequence space. All predicted complex structures and functional inferences are made publicly available to support structural and functional studies of phage proteins.
|
Scooped by
mhryu@live.com
Today, 12:45 AM
|
Cross-feeding interactions, in which a producer species release by-products that serve as resources for a consumer species, play an important role in shaping microbial community diversity. Producers create opportunities for consumers by supplying high-energy resources that are often scarce in the environment. However, they also exert strong top-down effects by releasing metabolites in pulses and generating spatial gradients of resource availability. How these spatiotemporal constraints shape consumer evolution remains poorly understood. To address this question, we used a two-species cross-feeding system in which Acinetobacter johnsonii excretes benzoate (a by-product of benzyl alcohol oxidation) into the external environment where it is consumed by Pseudomonas putida. To assess how the origin of benzoate (externally supplied or produced by cross-feeding) shapes consumer evolution, we evolved P. putida for 200 generations in monoculture or in co-culture with A. johnsonii. Populations evolved in monoculture exhibited improved growth relative to the ancestor, whereas populations evolved under cross-feeding showed little to no growth improvement. Whole-genome sequencing revealed pervasive loss-of-function mutations in flagellar genes among populations evolved in monoculture, but not under cross-feeding conditions. High-throughput imaging assays showed that populations evolved under cross-feeding not only maintained but also enhanced functional motility. Competition experiments with single mutants revealed context-dependent fitness effects: loss-of-function mutations were highly beneficial when benzoate was externally supplied but deleterious when benzoate was supplied by A. johnsonii, highlighting the importance of motility in cross-feeding interactions. Together, our results show that resource origin fundamentally reshapes selective pressures and alters evolutionary outcomes in microbial communities.
|
Scooped by
mhryu@live.com
Today, 12:18 AM
|
Heme is an essential cofactor involved in diverse cellular processes and is an important target for microbial biosynthesis. However, engineering microbial heme production remains challenging due to the requirement for coordinated activity of multiple pathway enzymes and the lack of scalable strategies for enzyme-level screening. In this study, we developed a growth-coupled screening system in Corynebacterium glutamicum by establishing a ChrSA-based heme sensor derived from the native heme-responsive two-component system, which directly converts intracellular heme levels into a selectable growth phenotype. Using this biosensor-enabled system, random mutagenesis libraries were constructed for four key enzymes of the coproporphyrin-dependent (CPD) heme biosynthesis pathway, and improved variants were identified through biosensor-guided selection. The selected variants were subsequently validated by chromosomal allelic replacement, resulting in an overall increase in heme production of approximately 38.9% and a final titer of 308.7 mg/L under native, single-copy regulation. Sequence and structural analyses indicated that the beneficial substitutions were predominantly located outside catalytic residues, suggesting that changes in enzyme stability and conformational properties contributed to the enhanced heme biosynthesis. This work established a heme biosensor-guided strategy for enzyme variant screening in tightly coupled metabolic pathways and provided a practical systems-level approach for microbial heme pathway engineering.
|
Scooped by
mhryu@live.com
Today, 12:11 AM
|
We identify a peptide discovered through fluorescence-based screening of a cell-penetrating peptide (CPP) library that preferentially associates with bacterial spores and evaluate its utility as a fluorescence-guided tool for spore isolation. Specifically, an eight-ornithine peptide conjugated with 5(6)-carboxyfluorescein (Orn8-K-FAM) generates a distinct and spatially confined fluorescence signal in spores. Across multiple spore-forming bacterial species, more than 90% of spores were labelled by Orn8-K-FAM, whereas only a minor fraction of vegetative cells from non-spore-forming bacteria exhibited detectable fluorescence, indicating a high degree of preference at the cellular level under the conditions tested. To assess practical applicability, Orn8-K-FAM labeling was combined with flow-cytometric sorting FACS, enabling fluorescence-guided isolation of spores from mixed microbial populations, including both a defined synthetic community and an environmentally derived bacterial fraction. Compared with conventional spore isolation approaches based on selective cultivation or heat-mediated inactivation of vegetative cells, this workflow provides a cultivation-independent means to separate spore-associated cellular states from heterogeneous microbial samples. This work highlights the potential of CPP-derived probes as analytical tools for spore-focused studies in applied and environmental microbiology, while providing a foundation for further investigation into the scope and mechanistic basis of CPP-spore interactions.
|
Scooped by
mhryu@live.com
Today, 12:03 AM
|
Human-associated microbial genomes encode extensive strain-level diversity and niche-specific gene repertoires that are critical to host health. However, these complex sequence features remain difficult to capture using general-purpose DNA foundation models, highlighting the need for dedicated representation learning tailored to the human microbiome. Here, we introduce Genos-m, an open-source foundation model for human-associated microbial genome representation. Genos-m was pretrained on approximately 1.2 trillion nucleotide tokens from a curated microbial genome corpus, including human-associated prokaryotic isolates, high-quality metagenome-assembled genomes (MAGs) and bacteriophages, supplemented with GTDB species-level representative genomes to broaden prokaryotic taxonomic breadth. The model uses a sparsely activated Mixture-of-Experts (MoE) Transformer architecture, with 4.7 billion total parameters, approximately 330 million activated parameters per forward pass and a maximum context length of one million base pairs. We evaluated frozen Genos-m representations across short-sequence and gene-level tasks, biosynthetic gene cluster (BGC)-based regional sequence tasks, whole-genome strain phenotype prediction, and zero-shot transfer on prokaryote-related RNAfitness assays. Across these benchmarks, Genos-m consistently ranked among the leading comparison models, with the best performance in five of eight gene-fitness regression tasks and in BGC type classification. Using sparse autoencoders, we identified sparse features in Genos-m hidden activations that aligned with annotated ORFs, intergenic regions, and tRNA and rRNA loci. In downstream applications, Genos-m-derived genome-informed species representations incorporated into a human microbiome self-supervised learning model improved colorectal cancer (CRC)-control classification over conventional species-abundance random forest models. Genos-m also generated stable sample-level embeddings from as few as 10,000 metagenomic reads, retaining gut microbial community structure that distinguished geographic origin and aligned with enterotypes defined from full-depth taxonomic profiles. Together, these results support Genos-m as a reusable representation model for microbial genomes and metagenomes, with conclusions bounded by the reported datasets, task definitions and evaluation protocols. Genos-m model weights, inference code, and usage documentation are publicly available on GitHub (https://github.com/BGI-HangzhouAI/Genos-m) and HuggingFace (https://huggingface.co/BGI-HangzhouAI/Genos-m).
|
Scooped by
mhryu@live.com
May 25, 11:35 PM
|
Horizontal gene transfer (HGT), the movement of genetic material between unrelated organisms, is widely recognized as an important driver of genome evolution in bacteria. In eukaryotes, however, the evolutionary impact of HGT remains debated. The identification of interkingdom HGT (iHGT) is especially challenging due to the lack of gold standard methods. Traditionally, iHGT identification has relied on manual inspection of phylogenetic trees, a process that is subjective, difficult to reproduce, and not scalable to large datasets. In this study, we present a computational framework that formalizes phylogenetic tree interpretation into a supervised machine-learning problem. We define five recurrent phylogenetic patterns—iHGT, NoHGT, Limited donor evidence, Multiple major clades (Multiple MC), and Patchy phylogeny—capturing clear and ambiguous evolutionary scenarios. To operationalize these patterns, we developed a feature-extraction pipeline that quantifies taxonomic composition and phylogenetic topology using seven biological descriptors derived from gene trees. These features were used to train and evaluate multiple machine-learning models, among which a Random Forest (RF) classifier achieved the best performance (AUC–ROC = 0.98; accuracy = 0.89). Model interpretability analyses revealed that topological distance to additional clades and lineage diversity are the most informative predictors, reflecting key signals used in expert-driven phylogenetic interpretation. The RF model was further validated using 1,000 simulated phylogenies and 1,438 real iHGT candidates, achieving low misclassification rates (7.8% and 10.43%, respectively). Benchmarking against AVP (Alienness vs. Predictor), a comparable tool for iHGT detection, demonstrated improved performance across all evaluation metrics, highlighting the advantages of incorporating global phylogenetic structure into the classification process. This study provides a reproducible and scalable framework for phylogenetic pattern classification that captures complex evolutionary signals while maintaining biological interpretability. Beyond improving iHGT detection, the approach offers a more nuanced representation of evolutionary scenarios by explicitly accounting for inconclusive cases, supporting more robust inference in comparative genomics.
|
Scooped by
mhryu@live.com
May 25, 11:03 PM
|
Lactoferrin is a multifunctional bioactive glycoprotein currently extracted primarily from bovine milk serum. However, traditional extraction methods rely on livestock farming, making the process inefficient and raising environmental and resource concerns. This study developed a cell factory for the biosynthetic production of bovine lactoferrin (BLF) using Aspergillus niger as the host. First, an efficient multigene editing toolkit was developed by optimizing the CRISPR/Cas9 system, achieving a 30.9% simultaneous integration efficiency for five genes. Next, BLF was fused with the GlaA fragment, an endogenous protein with high secretion capacity, and targeted to multiple high-expression sites and protease sites. Moreover, chaperone protein modification guided by a multi-cis-element construct, combined with hyphal morphology modification, elevated BLF titer to 178.6 mg/L. Fed-batch fermentation in a 5-L bioreactor yielded 1057.6 mg/L BLF. This represented a record titer for engineered filamentous fungi producing recombinant BLF. Finally, life cycle assessment indicated that producing BLF using A. niger exhibits strong potential advantages over naturally extracted BLF under the modeled cradle-to-gate scenarios. This engineered A. niger strain shows remarkable prospects in the sustainable industrial applications of recombinant lactoferrin.
|
laub mt