Home | << 1 >> |
Aite, M., Chevallier, M., Frioux, C., Trottier, C., Got, J., Cortes, M. P., et al. (2018). Traceability, reproducibility and wiki-exploration for “a-la-carte” reconstructions of genome-scale metabolic models. PLoS Comput. Biol., 14(5), 25 pp.
Abstract: Genome-scale metabolic models have become the tool of choice for the global analysis of microorganism metabolism, and their reconstruction has attained high standards of quality and reliability. Improvements in this area have been accompanied by the development of some major platforms and databases, and an explosion of individual bioinformatics methods. Consequently, many recent model s result from “a la carte” pipelines, combining the use of platforms, individual tools and biological expertise to enhance the quality of the reconstruction. Although very useful, introducing heterogeneous tools, that hardly interact with each other, causes loss of traceability and reproducibility in the reconstruction process. This represents a real obstacle, especially when considering less studied species whose metabolic reconstruction can greatly benefit from the comparison to good quality models of related organisms. This work proposes an adaptable workspace, AuReMe, for sustainable reconstructions or improvements of genome-scale metabolic models involving personalized pipelines. At each step, relevant information related to the modifications brought to the model by a method is stored. This ensures that the process is reproducible and documented regardless of the combination of tools used. Additionally, the workspace establishes a way to browse metabolic models and their metadata through the automatic generation of ad-hoc local wikis dedicated to monitoring and facilitating the process of reconstruction. AuReMe supports exploration and semantic query based on RDF databases. We illustrate how this workspace allowed handling, in an integrated way, the metabolic reconstructions of non-model organisms such as an extremophile bacterium or eukaryote algae. Among relevant applications, the latter reconstruction led to putative evolutionary insights of a metabolic pathway.
|
Cortes, M. P., Mendoza, S. N., Travisany, D., Gaete, A., Siegel, A., Cambiazo, V., et al. (2017). Analysis of Piscirickettsia salmonis Metabolism Using Genome-Scale Reconstruction, Modeling, and Testing. Front. Microbiol., 8, 15 pp.
Abstract: Piscirickettsia salmonis is an intracellular bacterial fish pathogen that causes piscirickettsiosis, a disease with highly adverse impact in the Chilean salmon farming industry. The development of effective treatment and control methods for piscireckttsiosis is still a challenge. To meet it the number of studies on P. salmonis has grown in the last couple of years but many aspects of the pathogen's biology are still poorly understood. Studies on its metabolism are scarce and only recently a metabolic model for reference strain LF-89 was developed. We present a new genomescale model for P. salmonis LF-89 with more than twice as many genes as in the previous model and incorporating specific elements of the fish pathogen metabolism. Comparative analysis with models of different bacterial pathogens revealed a lower flexibility in P. salmonis metabolic network. Through constraint-based analysis, we determined essential metabolites required for its growth and showed that it can benefit from different carbon sources tested experimentally in new defined media. We also built an additional model for strain A1-15972, and together with an analysis of P. salmonis pangenome, we identified metabolic features that differentiate two main species clades. Both models constitute a knowledge-base for P. salmonis metabolism and can be used to guide the efficient culture of the pathogen and the identification of specific drug targets.
Keywords: pathogen; genome-scale; metabolism; constraint-based; Piscirickettsia; salmonis
|
Di Genova, A., Ruz, G. A., Sagot, M. F., & Maass, A. (2018). Fast-SG: an alignment-free algorithm for hybrid assembly. GigaScience, 7(5), 15 pp.
Abstract: Background: Long-read sequencing technologies are the ultimate solution for genome repeats, allowing near reference-level reconstructions of large genomes. However, long-read de novo assembly pipelines are computationally intense and require a considerable amount of coverage, thereby hindering their broad application to the assembly of large genomes. Alternatively, hybrid assembly methods that combine short-and long-read sequencing technologies can reduce the time and cost required to produce de novo assemblies of large genomes. Results: Here, we propose a new method, called Fast-SG, that uses a new ultrafast alignment-free algorithm specifically designed for constructing a scaffolding graph using light-weight data structures. Fast-SG can construct the graph from either short or long reads. This allows the reuse of efficient algorithms designed for short-read data and permits the definition of novel modular hybrid assembly pipelines. Using comprehensive standard datasets and benchmarks, we show how Fast-SG outperforms the state-of-the-art short-read aligners when building the scaffolding graph and can be used to extract linking information from either raw or error-corrected long reads. We also show how a hybrid assembly approach using Fast-SG with shallow long-read coverage (5X) and moderate computational resources can produce long-range and accurate reconstructions of the genomes of Arabidopsis thaliana (Ler-0) and human (NA12878). Conclusions: Fast-SG opens a door to achieve accurate hybrid long-range reconstructions of large genomes with low effort, high portability, and low cost.
Keywords: hybrid assembly; genome scaffolding; alignment-free
|
Loira, N., Mendoza, S., Cortes, M. P., Rojas, N., Travisany, D., Di Genova, A., et al. (2017). Reconstruction of the microalga Nannochloropsis salina genome-scale metabolic model with applications to lipid production. BMC Syst. Biol., 11, 17 pp.
Abstract: Background: Nannochloropsis salina (= Eustigmatophyceae) is a marine microalga which has become a biotechnological target because of its high capacity to produce polyunsaturated fatty acids and triacylglycerols. It has been used as a source of biofuel, pigments and food supplements, like Omega 3. Only some Nannochloropsis species have been sequenced, but none of them benefit from a genome-scale metabolic model (GSMM), able to predict its metabolic capabilities. Results: We present iNS934, the first GSMM for N. salina, including 2345 reactions, 934 genes and an exhaustive description of lipid and nitrogen metabolism. iNS934 has a 90% of accuracy when making simple growth/no-growth predictions and has a 15% error rate in predicting growth rates in different experimental conditions. Moreover, iNS934 allowed us to propose 82 different knockout strategies for strain optimization of triacylglycerols. Conclusions: iNS934 provides a powerful tool for metabolic improvement, allowing predictions and simulations of N. salina metabolism under different media and genetic conditions. It also provides a systemic view of N. salina metabolism, potentially guiding research and providing context to -omics data.
Keywords: Genome-scale Metabolic model; Nannochloropsis salina; TAG; Microalg ae
|
Narum, S. R., Di Genova, A., Micheletti, S. J., & Maass, A. (2018). Genomic variation underlying complex life-history traits revealed by genome sequencing in Chinook salmon. Proc. R. Soc. B-Biol. Sci., 285(1883), 9 pp.
Abstract: A broad portfolio of phenotypic diversity in natural organisms can buffer against exploitation and increase species persistence in disturbed ecosystems. The study of genomic variation that accounts for ecological and evolutionary adaptation can represent a powerful approach to extend understanding of phenotypic variation in nature. Here we present a chromosome-level reference genome assembly for Chinook salmon (Oncorhynchus tshawytscha; 2.36 Gb) that enabled association mapping of life-history variation and phenotypic traits for this species. Whole-genome re-sequencing of populations with distinct life-history traits provided evidence that divergent selection was extensive throughout the genome within and among phylogenetic lineages, indicating that a broad portfolio of phenotypic diversity exists in this species that is related to local adaptation and life-history variation. Association mapping with millions of genome-wide SNPs revealed that a genomic region of major effect on chromosome 28 was associated with phenotypes for premature and mature arrival to spawning grounds and was consistent across three distinct phylogenetic lineages. Our results demonstrate how genomic resources can enlighten the genetic basis of known phenotypes in exploited species and assist in clarifying phenotypic variation that may be difficult to observe in naturally occurring organisms.
|
Travisany, D., Goles, E., Latorre, M., Cort?s, M. P., & Maass, A. (2020). Generation and robustness of Boolean networks to model Clostridium difficile infection. Nat. Comput., 19(1), 111–134.
Abstract: One of the more common healthcare associated infection is Chronic diarrhea. This disease is caused by the bacterium Clostridium difficile which alters the normal composition of the human gut flora. The most successful therapy against this infection is the fecal microbial transplant (FMT). They displace C. difficile and contribute to gut microbiome resilience, stability and prevent further episodes of diarrhea. The microorganisms in the FMT their interactions and inner dynamics reshape the gut microbiome to a healthy state. Even though microbial interactions play a key role in the development of the disease, currently, little is known about their dynamics and properties. In this context, a Boolean network model for C. difficile infection (CDI) describing one set of possible interactions was recently presented. To further explore the space of possible microbial interactions, we propose the construction of a neutral space conformed by a set of models that differ in their interactions, but share the final community states of the gut microbiome under antibiotic perturbation and CDI. To begin with the analysis, we use the previously described Boolean network model and we demonstrate that this model is in fact a threshold Boolean network (TBN). Once the TBN model is set, we generate and use an evolutionary algorithm to explore to identify alternative TBNs. We organize the resulting TBNs into clusters that share similar dynamic behaviors. For each cluster, the associated neutral graph is constructed and the most relevant interactions are identified. Finally, we discuss how these interactions can either affect or prevent CDI.
|