Searched for: in-biosketch:yes
person:deustp01
Advancing curation of viral life cycles, host interactions, and therapeutics in Reactome
Matthews, Lisa; Cook, Justin; Stephan, Ralf; Milacic, Marija; Rothfels, Karen; Shamovsky, Veronica; Jassal, Bijay; Haw, Robin; Sevilla, Cristoffer; Gong, Chuqiao; Ragueneau, Eliot; May, Bruce; Wright, Adam; Weiser, Joel; Beavers, Deidre; Tiwari, Krishna; Senff-Ribeiro, Andrea; Varusai, Thawfeek; Hermjakob, Henning; D'Eustachio, Peter; Wu, Guanming; Stein, Lincoln; Gillespie, Marc E
Reactome (reactome.org) is a manually curated, peer-reviewed, open-source, open-access pathway knowledgebase of essential human cellular functions. Reactome includes viral life cycles that capture a broad range of virus-induced human pathology. Here, we describe a workflow using collaborative curation strategies, orthoinference procedures, and literature triage to rapidly create reliable molecular models of emergent viruses. The resulting pathway data set rigorously details viral infection pathways, interactions with normal human biological processes, and potential therapeutic compounds.
PMID: 40265842
ISSN: 1098-5514
CID: 5830242
Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes
Orlic-Milacic, Marija; Rothfels, Karen; Matthews, Lisa; Wright, Adam; Jassal, Bijay; Shamovsky, Veronica; Trinh, Quang; Gillespie, Marc E; Sevilla, Cristoffer; Tiwari, Krishna; Ragueneau, Eliot; Gong, Chuqiao; Stephan, Ralf; May, Bruce; Haw, Robin; Weiser, Joel; Beavers, Deidre; Conley, Patrick; Hermjakob, Henning; Stein, Lincoln D; D'Eustachio, Peter; Wu, Guanming
Germline and somatic mutations can give rise to proteins with altered activity, including both gain and loss-of-function. The effects of these variants can be captured in disease-specific reactions and pathways that highlight the resulting changes to normal biology. A disease reaction is defined as an aberrant reaction in which a variant protein participates. A disease pathway is defined as a pathway that contains a disease reaction. Annotation of disease variants as participants of disease reactions and disease pathways can provide a standardized overview of molecular phenotypes of pathogenic variants that is amenable to computational mining and mathematical modeling. Reactome (https://reactome.org/), an open source, manually curated, peer-reviewed database of human biological pathways, in addition to providing annotations for >11 000 unique human proteins in the context of ∼15 000 wild-type reactions within more than 2000 wild-type pathways, also provides annotations for >4000 disease variants of close to 400 genes as participants of ∼800 disease reactions in the context of ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, described in wild-type reactions and pathways, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Reactome's data model enables mapping of disease variant datasets to specific disease reactions within disease pathways, providing a platform to infer pathway output impacts of numerous human disease variants and model organism orthologs, complementing computational predictions of variant pathogenicity. Database URL: https://reactome.org/.
PMID: 38713862
ISSN: 1758-0463
CID: 5658352
Plant Reactome Knowledgebase: empowering plant pathway exploration and OMICS data analysis
Gupta, Parul; Elser, Justin; Hooks, Elizabeth; D'Eustachio, Peter; Jaiswal, Pankaj; Naithani, Sushma
Plant Reactome (https://plantreactome.gramene.org) is a freely accessible, comprehensive plant pathway knowledgebase. It provides curated reference pathways from rice (Oryza sativa) and gene-orthology-based pathway projections to 129 additional species, spanning single-cell photoautotrophs, non-vascular plants, and higher plants, thus encompassing a wide-ranging taxonomic diversity. Currently, Plant Reactome houses a collection of 339 reference pathways, covering metabolic and transport pathways, hormone signaling, genetic regulations of developmental processes, and intricate transcriptional networks that orchestrate a plant's response to abiotic and biotic stimuli. Beyond being a mere repository, Plant Reactome serves as a dynamic data discovery platform. Users can analyze and visualize omics data, such as gene expression, gene-gene interaction, proteome, and metabolome data, all within the rich context of plant pathways. Plant Reactome is dedicated to fostering data interoperability, upholding global data standards, and embracing the tenets of the Findable, Accessible, Interoperable and Re-usable (FAIR) data policy.
PMID: 37986220
ISSN: 1362-4962
CID: 5608382
The Reactome Pathway Knowledgebase 2024
Milacic, Marija; Beavers, Deidre; Conley, Patrick; Gong, Chuqiao; Gillespie, Marc; Griss, Johannes; Haw, Robin; Jassal, Bijay; Matthews, Lisa; May, Bruce; Petryszak, Robert; Ragueneau, Eliot; Rothfels, Karen; Sevilla, Cristoffer; Shamovsky, Veronica; Stephan, Ralf; Tiwari, Krishna; Varusai, Thawfeek; Weiser, Joel; Wright, Adam; Wu, Guanming; Stein, Lincoln; Hermjakob, Henning; D'Eustachio, Peter
The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.
PMID: 37941124
ISSN: 1362-4962
CID: 5610282
Perspectives on tracking data reuse across biodata resources [Editorial]
Ross, Karen E; Bastian, Frederic B; Buys, Matt; Cook, Charles E; D'Eustachio, Peter; Harrison, Melissa; Hermjakob, Henning; Li, Donghui; Lord, Phillip; Natale, Darren A; Peters, Bjoern; Sternberg, Paul W; Su, Andrew I; Thakur, Matthew; Thomas, Paul D; Bateman, Alex; ,
MOTIVATION/UNASSIGNED:Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge. RESULTS/UNASSIGNED:The article reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources. AVAILABILITY AND IMPLEMENTATION/UNASSIGNED:Summaries of survey results are available at: https://docs.google.com/forms/d/1j-VU2ifEKb9C-sW6l3ATB79dgHdRk5v_lESv2hawnso/viewanalytics (survey of data providers) and https://docs.google.com/forms/d/18WbJFutUd7qiZoEzbOytFYXSfWFT61hVce0vjvIwIjk/viewanalytics (survey of users).
PMCID:11076920
PMID: 38721398
ISSN: 2635-0041
CID: 5733972
Biochemical pathways represented by Gene Ontology-Causal Activity Models identify distinct phenotypes resulting from mutations in pathways
Hill, David P; Drabkin, Harold J; Smith, Cynthia L; Van Auken, Kimberly M; D'Eustachio, Peter
Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process descriptions in the Reactome Knowledgebase and causal activity flows between molecular functions in Gene Ontology-Causal Activity Models (GO-CAMs). A computational process has been developed to convert Reactome pathways to GO-CAMs. Laboratory mice are widely used models of normal and pathological human processes. We have converted human Reactome GO-CAMs to orthologous mouse GO-CAMs, as a resource to transfer pathway knowledge between humans and model organisms. These mouse GO-CAMs allowed us to define sets of genes that function in a causally connected way. To demonstrate that individual variant genes from connected pathways result in similar but distinguishable phenotypes, we used the genes in our pathway models to cross-query mouse phenotype annotations in the Mouse Genome Database (MGD). Using GO-CAM representations of 2 related but distinct pathways, gluconeogenesis and glycolysis, we show that individual causal paths in gene networks give rise to discrete phenotypic outcomes resulting from perturbations of glycolytic and gluconeogenic genes. The accurate and detailed descriptions of gene interactions recovered in this analysis of well-studied processes suggest that this strategy can be applied to less well-understood processes in less well-studied model systems to predict phenotypic outcomes of novel gene variants and to identify potential gene targets in altered processes.
PMID: 37579192
ISSN: 1943-2631
CID: 5609842
Illuminate the Functions of Dark Proteins Using the Reactome-IDG Web Portal
Beavers, Deidre; Brunson, Timothy; Sanati, Nasim; Matthews, Lisa; Haw, Robin; Shorser, Solomon; Sevilla, Cristoffer; Viteri, Guilherme; Conley, Patrick; Rothfels, Karen; Hermjakob, Henning; Stein, Lincoln; D'Eustachio, Peter; Wu, Guanming
Understudied or dark proteins have the potential to shed light on as-yet undiscovered molecular mechanisms that underlie phenotypes and suggest innovative therapeutic approaches for many diseases. The Reactome-IDG (Illuminating the Druggable Genome) project aims to place dark proteins in the context of manually curated, highly reliable pathways in Reactome, the most comprehensive, open-source biological pathway knowledgebase, facilitating the understanding functions and predicting therapeutic potentials of dark proteins. The Reactome-IDG web portal, deployed at https://idg.reactome.org, provides a simple, interactive web page for users to search pathways that may functionally interact with dark proteins, enabling the prediction of functions of dark proteins in the context of Reactome pathways. Enhanced visualization features implemented at the portal allow users to investigate the functional contexts for dark proteins based on tissue-specific gene or protein expression, drug-target interactions, or protein or gene pairwise relationships in the original Reactome's systems biology graph notation (SBGN) diagrams or the new simplified functional interaction (FI) network view of pathways. The protocols in this chapter describe step-by-step procedures to use the web portal to learn biological functions of dark proteins in the context of Reactome pathways. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Search for interacting pathways of a protein Support Protocol: Interacting pathway results for an annotated protein Alternate Protocol: Use individual pairwise relationships to predict interacting pathways of a protein Basic Protocol 2: Using the IDG pathway browser to study interacting pathways Basic Protocol 3: Overlaying tissue-specific expression data Basic Protocol 4: Overlaying protein/gene pairwise relationships in the pathway context Basic Protocol 5: Visualizing drug/target interactions.
PMID: 37467006
ISSN: 2691-1299
CID: 5535802
Biocuration of a Transcription Factors Network Involved in Submergence Tolerance during Seed Germination and Coleoptile Elongation in Rice (Oryza sativa)
Naithani, Sushma; Mohanty, Bijayalaxmi; Elser, Justin; D'Eustachio, Peter; Jaiswal, Pankaj
Modeling biological processes and genetic-regulatory networks using in silico approaches provides a valuable framework for understanding how genes and associated allelic and genotypic differences result in specific traits. Submergence tolerance is a significant agronomic trait in rice; however, the gene-gene interactions linked with this polygenic trait remain largely unknown. In this study, we constructed a network of 57 transcription factors involved in seed germination and coleoptile elongation under submergence. The gene-gene interactions were based on the co-expression profiles of genes and the presence of transcription factor binding sites in the promoter region of target genes. We also incorporated published experimental evidence, wherever available, to support gene-gene, gene-protein, and protein-protein interactions. The co-expression data were obtained by re-analyzing publicly available transcriptome data from rice. Notably, this network includes OSH1, OSH15, OSH71, Sub1B, ERFs, WRKYs, NACs, ZFP36, TCPs, etc., which play key regulatory roles in seed germination, coleoptile elongation and submergence response, and mediate gravitropic signaling by regulating OsLAZY1 and/or IL2. The network of transcription factors was manually biocurated and submitted to the Plant Reactome Knowledgebase to make it publicly accessible. We expect this work will facilitate the re-analysis/re-use of OMICs data and aid genomics research to accelerate crop improvement.
PMCID:10255735
PMID: 37299125
ISSN: 2223-7747
CID: 5609822
The Gene Ontology knowledgebase in 2023
,; Aleksander, Suzi A; Balhoff, James; Carbon, Seth; Cherry, J Michael; Drabkin, Harold J; Ebert, Dustin; Feuermann, Marc; Gaudet, Pascale; Harris, Nomi L; Hill, David P; Lee, Raymond; Mi, Huaiyu; Moxon, Sierra; Mungall, Christopher J; Muruganugan, Anushya; Mushayahama, Tremayne; Sternberg, Paul W; Thomas, Paul D; Van Auken, Kimberly; Ramsey, Jolene; Siegele, Deborah A; Chisholm, Rex L; Fey, Petra; Aspromonte, Maria Cristina; Nugnes, Maria Victoria; Quaglia, Federica; Tosatto, Silvio; Giglio, Michelle; Nadendla, Suvarna; Antonazzo, Giulia; Attrill, Helen; Dos Santos, Gil; Marygold, Steven; Strelets, Victor; Tabone, Christopher J; Thurmond, Jim; Zhou, Pinglei; Ahmed, Saadullah H; Asanitthong, Praoparn; Luna Buitrago, Diana; Erdol, Meltem N; Gage, Matthew C; Ali Kadhum, Mohamed; Li, Kan Yan Chloe; Long, Miao; Michalak, Aleksandra; Pesala, Angeline; Pritazahra, Armalya; Saverimuttu, Shirin C C; Su, Renzhi; Thurlow, Kate E; Lovering, Ruth C; Logie, Colin; Oliferenko, Snezhana; Blake, Judith; Christie, Karen; Corbani, Lori; Dolan, Mary E; Drabkin, Harold J; Hill, David P; Ni, Li; Sitnikov, Dmitry; Smith, Cynthia; Cuzick, Alayne; Seager, James; Cooper, Laurel; Elser, Justin; Jaiswal, Pankaj; Gupta, Parul; Jaiswal, Pankaj; Naithani, Sushma; Lera-Ramirez, Manuel; Rutherford, Kim; Wood, Valerie; De Pons, Jeffrey L; Dwinell, Melinda R; Hayman, G Thomas; Kaldunski, Mary L; Kwitek, Anne E; Laulederkind, Stanley J F; Tutaj, Marek A; Vedi, Mahima; Wang, Shur-Jen; D'Eustachio, Peter; Aimo, Lucila; Axelsen, Kristian; Bridge, Alan; Hyka-Nouspikel, Nevila; Morgat, Anne; Aleksander, Suzi A; Cherry, J Michael; Engel, Stacia R; Karra, Kalpana; Miyasato, Stuart R; Nash, Robert S; Skrzypek, Marek S; Weng, Shuai; Wong, Edith D; Bakker, Erika; Berardini, Tanya Z; Reiser, Leonore; Auchincloss, Andrea; Axelsen, Kristian; Argoud-Puy, Ghislaine; Blatter, Marie-Claude; Boutet, Emmanuel; Breuza, Lionel; Bridge, Alan; Casals-Casas, Cristina; Coudert, Elisabeth; Estreicher, Anne; Livia Famiglietti, Maria; Feuermann, Marc; Gos, Arnaud; Gruaz-Gumowski, Nadine; Hulo, Chantal; Hyka-Nouspikel, Nevila; Jungo, Florence; Le Mercier, Philippe; Lieberherr, Damien; Masson, Patrick; Morgat, Anne; Pedruzzi, Ivo; Pourcel, Lucille; Poux, Sylvain; Rivoire, Catherine; Sundaram, Shyamala; Bateman, Alex; Bowler-Barnett, Emily; Bye-A-Jee, Hema; Denny, Paul; Ignatchenko, Alexandr; Ishtiaq, Rizwan; Lock, Antonia; Lussi, Yvonne; Magrane, Michele; Martin, Maria J; Orchard, Sandra; Raposo, Pedro; Speretta, Elena; Tyagi, Nidhi; Warner, Kate; Zaru, Rossana; Diehl, Alexander D; Lee, Raymond; Chan, Juancarlos; Diamantakis, Stavros; Raciti, Daniela; Zarowiecki, Magdalena; Fisher, Malcolm; James-Zorn, Christina; Ponferrada, Virgilio; Zorn, Aaron; Ramachandran, Sridhar; Ruzicka, Leyla; Westerfield, Monte
The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.
PMCID:10158837
PMID: 36866529
ISSN: 1943-2631
CID: 5609782
Using the Reactome Database
Rothfels, Karen; Milacic, Marija; Matthews, Lisa; Haw, Robin; Sevilla, Cristoffer; Gillespie, Marc; Stephan, Ralf; Gong, Chuqiao; Ragueneau, Eliot; May, Bruce; Shamovsky, Veronica; Wright, Adam; Weiser, Joel; Beavers, Deidre; Conley, Patrick; Tiwari, Krishna; Jassal, Bijay; Griss, Johannes; Senff-Ribeiro, Andrea; Brunson, Timothy; Petryszak, Robert; Hermjakob, Henning; D'Eustachio, Peter; Wu, Guanming; Stein, Lincoln
Pathway databases provide descriptions of the roles of proteins, nucleic acids, lipids, carbohydrates, and other molecular entities within their biological cellular contexts. Pathway-centric views of these roles may allow for the discovery of unexpected functional relationships in data such as gene expression profiles and somatic mutation catalogues from tumor cells. For this reason, there is a high demand for high-quality pathway databases and their associated tools. The Reactome project (a collaboration between the Ontario Institute for Cancer Research, New York University Langone Health, the European Bioinformatics Institute, and Oregon Health & Science University) is one such pathway database. Reactome collects detailed information on biological pathways and processes in humans from the primary literature. Reactome content is manually curated, expert-authored, and peer-reviewed and spans the gamut from simple intermediate metabolism to signaling pathways and complex cellular events. This information is supplemented with likely orthologous molecular reactions in mouse, rat, zebrafish, worm, and other model organisms. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Browsing a Reactome pathway Basic Protocol 2: Exploring Reactome annotations of disease and drugs Basic Protocol 3: Finding the pathways involving a gene or protein Alternate Protocol 1: Finding the pathways involving a gene or protein using UniProtKB (SwissProt), Ensembl, or Entrez gene identifier Alternate Protocol 2: Using advanced search Basic Protocol 4: Using the Reactome pathway analysis tool to identify statistically overrepresented pathways Basic Protocol 5: Using the Reactome pathway analysis tool to overlay expression data onto Reactome pathway diagrams Basic Protocol 6: Comparing inferred model organism and human pathways using the Species Comparison tool Basic Protocol 7: Comparing tissue-specific expression using the Tissue Distribution tool.
PMID: 37053306
ISSN: 2691-1299
CID: 5464262