Searched for: in-biosketch:yes
person:deustp01
Toll-like receptor signaling in vertebrates: testing the integration of protein, complex, and pathway data in the protein ontology framework
Arighi, Cecilia; Shamovsky, Veronica; Masci, Anna Maria; Ruttenberg, Alan; Smith, Barry; Natale, Darren A; Wu, Cathy; D'Eustachio, Peter
The Protein Ontology (PRO) provides terms for and supports annotation of species-specific protein complexes in an ontology framework that relates them both to their components and to species-independent families of complexes. Comprehensive curation of experimentally known forms and annotations thereof is expected to expose discrepancies, differences, and gaps in our knowledge. We have annotated the early events of innate immune signaling mediated by Toll-Like Receptor 3 and 4 complexes in human, mouse, and chicken. The resulting ontology and annotation data set has allowed us to identify species-specific gaps in experimental data and possible functional differences between species, and to employ inferred structural and functional relationships to suggest plausible resolutions of these discrepancies and gaps.
PMCID:4404318
PMID: 25894391
ISSN: 1932-6203
CID: 1542542
The Reactome pathway knowledgebase
Croft, David; Mundo, Antonio Fabregat; Haw, Robin; Milacic, Marija; Weiser, Joel; Wu, Guanming; Caudy, Michael; Garapati, Phani; Gillespie, Marc; Kamdar, Maulik R; Jassal, Bijay; Jupe, Steven; Matthews, Lisa; May, Bruce; Palatnik, Stanislav; Rothfels, Karen; Shamovsky, Veronica; Song, Heeyeon; Williams, Mark; Birney, Ewan; Hermjakob, Henning; Stein, Lincoln; D'Eustachio, Peter
Reactome (http://www.reactome.org) is a manually curated open-source open-data resource of human pathways and reactions. The current version 46 describes 7088 human proteins (34% of the predicted human proteome), participating in 6744 reactions based on data extracted from 15 107 research publications with PubMed links. The Reactome Web site and analysis tool set have been completely redesigned to increase speed, flexibility and user friendliness. The data model has been extended to support annotation of disease processes due to infectious agents and to mutation.
PMCID:3965010
PMID: 24243840
ISSN: 0305-1048
CID: 782562
Gramene 2013: comparative plant genomics resources
Monaco, Marcela K; Stein, Joshua; Naithani, Sushma; Wei, Sharon; Dharmawardhana, Palitha; Kumari, Sunita; Amarasinghe, Vindhya; Youens-Clark, Ken; Thomason, James; Preece, Justin; Pasternak, Shiran; Olson, Andrew; Jiao, Yinping; Lu, Zhenyuan; Bolser, Dan; Kerhornou, Arnaud; Staines, Dan; Walts, Brandon; Wu, Guanming; D'Eustachio, Peter; Haw, Robin; Croft, David; Kersey, Paul J; Stein, Lincoln; Jaiswal, Pankaj; Ware, Doreen
Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.
PMCID:3964986
PMID: 24217918
ISSN: 0305-1048
CID: 782552
Protein Ontology: a controlled structured network of protein entities
Natale, Darren A; Arighi, Cecilia N; Blake, Judith A; Bult, Carol J; Christie, Karen R; Cowart, Julie; D'Eustachio, Peter; Diehl, Alexander D; Drabkin, Harold J; Helfer, Olivia; Huang, Hongzhan; Masci, Anna Maria; Ren, Jia; Roberts, Natalia V; Ross, Karen; Ruttenberg, Alan; Shamovsky, Veronica; Smith, Barry; Yerramalla, Meher Shruti; Zhang, Jian; Aljanahi, Aisha; Celen, Irem; Gan, Cynthia; Lv, Mengxi; Schuster-Lezell, Emily; Wu, Cathy H
The Protein Ontology (PRO; http://proconsortium.org) formally defines protein entities and explicitly represents their major forms and interrelations. Protein entities represented in PRO corresponding to single amino acid chains are categorized by level of specificity into family, gene, sequence and modification metaclasses, and there is a separate metaclass for protein complexes. All metaclasses also have organism-specific derivatives. PRO complements established sequence databases such as UniProtKB, and interoperates with other biomedical and biological ontologies such as the Gene Ontology (GO). PRO relates to UniProtKB in that PRO's organism-specific classes of proteins encoded by a specific gene correspond to entities documented in UniProtKB entries. PRO relates to the GO in that PRO's representations of organism-specific protein complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology. The past few years have seen growth and changes to the PRO, as well as new points of access to the data and new applications of PRO in immunology and proteomics. Here we describe some of these developments.
PMCID:3964965
PMID: 24270789
ISSN: 0305-1048
CID: 652102
Pathway databases: making chemical and biological sense of the genomic data flood
D'Eustachio, Peter
Pathway databases are a means to systematically associate proteins with their functions and link them into networks that describe the reaction space of an organism. Here, the Reactome Knowledgebase provides a convenient example to illustrate strategies used to assemble such a reaction space based on manually curated experimental data, approaches to semiautomated extension of these manual annotations to infer annotations for a large fraction of a species' proteins, and the use of networks of functional annotations to infer pathway relationships among variant proteins that have been associated with disease risk through genome-wide surveys and resequencing studies of tumors.
PMCID:3678733
PMID: 23706629
ISSN: 1074-5521
CID: 361782
Gene Ontology annotations and resources
Blake, J A; Dolan, M; Drabkin, H; Hill, D P; Li, Ni; Sitnikov, D; Bridges, S; Burgess, S; Buza, T; McCarthy, F; Peddinti, D; Pillai, L; Carbon, S; Dietze, H; Ireland, A; Lewis, S E; Mungall, C J; Gaudet, P; Chrisholm, R L; Fey, P; Kibbe, W A; Basu, S; Siegele, D A; McIntosh, B K; Renfro, D P; Zweifel, A E; Hu, J C; Brown, N H; Tweedie, S; Alam-Faruque, Y; Apweiler, R; Auchinchloss, A; Axelsen, K; Bely, B; Blatter, M -C; Bonilla, C; Bouguerleret, L; Boutet, E; Breuza, L; Bridge, A; Chan, W M; Chavali, G; Coudert, E; Dimmer, E; Estreicher, A; Famiglietti, L; Feuermann, M; Gos, A; Gruaz-Gumowski, N; Hieta, R; Hinz, C; Hulo, C; Huntley, R; James, J; Jungo, F; Keller, G; Laiho, K; Legge, D; Lemercier, P; Lieberherr, D; Magrane, M; Martin, M J; Masson, P; Mutowo-Muellenet, P; O'Donovan, C; Pedruzzi, I; Pichler, K; Poggioli, D; Porras Millán, P; Poux, S; Rivoire, C; Roechert, B; Sawford, T; Schneider, M; Stutz, A; Sundaram, S; Tognolli, M; Xenarios, I; Foulgar, R; Lomax, J; Roncaglia, P; Khodiyar, V K; Lovering, R C; Talmud, P J; Chibucos, M; Giglio, M Gwinn; Chang, H -Y; Hunter, S; McAnulla, C; Mitchell, A; Sangrador, A; Stephan, R; Harris, M A; Oliver, S G; Rutherford, K; Wood, V; Bahler, J; Lock, A; Kersey, P J; McDowall, D M; Staines, D M; Dwinell, M; Shimoyama, M; Laulederkind, S; Hayman, T; Wang, S -J; Petri, V; Lowry, T; D'Eustachio, P; Matthews, L; Balakrishnan, R; Binkley, G; Cherry, J M; Costanzo, M C; Dwight, S S; Engel, S R; Fisk, D G; Hitz, B C; Hong, E L; Karra, K; Miyasato, S R; Nash, R S; Park, J; Skrzypek, M S; Weng, S; Wong, E D; Berardini, T Z; Huala, E; Mi, H; Thomas, P D; Chan, J; Kishore, R; Sternberg, P; Van Auken, K; Howe, D; Westerfield, M
The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources.
PMCID:3531070
PMID: 23161678
ISSN: 1362-4962
CID: 2953432
Annotating cancer variants and anti-cancer therapeutics in reactome
Milacic, Marija; Haw, Robin; Rothfels, Karen; Wu, Guanming; Croft, David; Hermjakob, Henning; D'Eustachio, Peter; Stein, Lincoln
Reactome describes biological pathways as chemical reactions that closely mirror the actual physical interactions that occur in the cell. Recent extensions of our data model accommodate the annotation of cancer and other disease processes. First, we have extended our class of protein modifications to accommodate annotation of changes in amino acid sequence and the formation of fusion proteins to describe the proteins involved in disease processes. Second, we have added a disease attribute to reaction, pathway, and physical entity classes that uses disease ontology terms. To support the graphical representation of "cancer" pathways, we have adapted our Pathway Browser to display disease variants and events in a way that allows comparison with the wild type pathway, and shows connections between perturbations in cancer and other biological pathways. The curation of pathways associated with cancer, coupled with our efforts to create other disease-specific pathways, will interoperate with our existing pathway and network analysis tools. Using the Epidermal Growth Factor Receptor (EGFR) signaling pathway as an example, we show how Reactome annotates and presents the altered biological behavior of EGFR variants due to their altered kinase and ligand-binding properties, and the mode of action and specificity of anti-cancer therapeutics.
PMCID:3712731
PMID: 24213504
ISSN: 2072-6694
CID: 3663682
The Gene Ontology: enhancements for 2011
Blake, JA; Dolan, M; Drabkin, H; Hill, DP; Ni, L; Sitnikov, D; Burgess, S; Buza, T; Gresham, C; McCarthy, F; Pillai, L; Wang, H; Carbon, S; Lewis, SE; Mungall, CJ; Gaudet, P; Chisholm, RL; Fey, P; Kibbe, WA; Basu, S; Siegele, DA; McIntosh, BK; Renfro, DP; Zweifel, AE; Hu, JC; Brown, NH; Tweedie, S; Alam-Faruque, Y; Apweiler, R; Auchinchloss, A; Axelsen, K; Argoud-Puy, G; Bely, B; Blatter, M-C; Bougueleret, L; Boutet, E; Branconi-Quintaje, S; Breuza, L; Bridge, A; Browne, P; Chan, WM; Coudert, E; Cusin, I; Dimmer, E; Duek-Roggli, P; Eberhardt, R; Estreicher, A; Famiglietti, L; Ferro-Rojas, S; Feuermann, M; Gardner, M; Gos, A; Gruaz-Gumowski, N; Hinz, U; Hulo, C; Huntley, R; James, J; Jimenez, S; Jungo, F; Keller, G; Laiho, K; Legge, D; Lemercier, P; Lieberherr, D; Magrane, M; Martin, MJ; Masson, P; Moinat, M; O'Donovan, C; Pedruzzi, I; Pichler, K; Poggioli, D; Millan, PPorras; Poux, S; Rivoire, C; Roechert, B; Sawford, T; Schneider, M; Sehra, H; Stanley, E; Stutz, A; Sundaram, S; Tognolli, M; Xenarios, I; Foulger, R; Lomax, J; Roncaglia, P; Camon, E; Khodiyar, VK; Lovering, RC; Talmud, PJ; Chibucos, M; Giglio, MGwinn; Dolinski, K; Heinicke, S; Livstone, MS; Stephan, R; Harris, MA; Oliver, SG; Rutherford, K; Wood, V; Bahler, J; Lock, A; Kersey, PJ; McDowall, MD; Staines, DM; Dwinell, M; Shimoyama, M; Laulederkind, S; Hayman, T; Wang, S-J; Petri, V; Lowry, T; D'Eustachio, P; Matthews, L; Amundsen, CD; Balakrishnan, R; Binkley, G; Cherry, JM; Christie, KR; Costanzo, MC; Dwight, SS; Engel, SR; Fisk, DG; Hirschman, JE; Hitz, BC; Hong, EL; Karra, K; Krieger, CJ; Miyasato, SR; Nash, RS; Park, J; Skrzypek, MS; Weng, S; Wong, ED; Berardini, TZ; Li, D; Huala, E; Slonim, D; Wick, H; Thomas, P; Chan, J; Kishore, R; Sternberg, P; Van Auken, K; Howe, D; Westerfield, M; Gene Ontology Consortium
The Gene Ontology (GO) (http://www.geneontology.org) is a community bioinformatics resource that represents gene product function through the use of structured, controlled vocabularies. The number of GO annotations of gene products has increased due to curation efforts among GO Consortium (GOC) groups, including focused literature-based annotation and ortholog-based functional inference. The GO ontologies continue to expand and improve as a result of targeted ontology development, including the introduction of computable logical definitions and development of new tools for the streamlined addition of terms to the ontology. The GOC continues to support its user community through the use of e-mail lists, social media and web-based resources.
ISI:000298601300084
ISSN: 0305-1048
CID: 2337532
Recent advances in biocuration: meeting report from the Fifth International Biocuration Conference [Meeting Abstract]
Gaudet, Pascale; Arighi, Cecilia; Bastian, Frederic; Bateman, Alex; Blake, Judith A; Cherry, Michael J; D'Eustachio, Peter; Finn, Robert; Giglio, Michelle; Hirschman, Lynette; Kania, Renate; Klimke, William; Martin, Maria Jesus; Karsch-Mizrachi, Ilene; Munoz-Torres, Monica; Natale, Darren; O'Donovan, Claire; Ouellette, Francis; Pruitt, Kim D; Robinson-Rechavi, Marc; Sansone, Susanna-Assunta; Schofield, Paul; Sutton, Granger; Van Auken, Kimberly; Vasudevan, Sona; Wu, Cathy; Young, Jasmine; Mazumder, Raja
The 5th International Biocuration Conference brought together over 300 scientists to exchange on their work, as well as discuss issues relevant to the International Society for Biocuration's (ISB) mission. Recurring themes this year included the creation and promotion of gold standards, the need for more ontologies, and more formal interactions with journals. The conference is an essential part of the ISB's goal to support exchanges among members of the biocuration community. Next year's conference will be held in Cambridge, UK, from 7 to 10 April 2013. In the meanwhile, the ISB website provides information about the society's activities (http://biocurator.org), as well as related events of interest.
PMCID:3483532
PMID: 23110974
ISSN: 1758-0463
CID: 2953412
The representation of protein complexes in the Protein Ontology (PRO)
Bult, Carol J; Drabkin, Harold J; Evsikov, Alexei; Natale, Darren; Arighi, Cecilia; Roberts, Natalia; Ruttenberg, Alan; D'Eustachio, Peter; Smith, Barry; Blake, Judith A; Wu, Cathy
ABSTRACT: BACKGROUND: Representing species-specific proteins and protein complexes in ontologies that are both human- and machine-readable facilitates the retrieval, analysis, and interpretation of genome-scale data sets. Although existing protin-centric informatics resources provide the biomedical research community with well-curated compendia of protein sequence and structure, these resources lack formal ontological representations of the relationships among the proteins themselves. The Protein Ontology (PRO) Consortium is filling this informatics resource gap by developing ontological representations and relationships among proteins and their variants and modified forms. Because proteins are often functional only as members of stable protein complexes, the PRO Consortium, in collaboration with existing protein and pathway databases, has launched a new initiative to implement logical and consistent representation of protein complexes. DESCRIPTION: We describe here how the PRO Consortium is meeting the challenge of representing species-specific protein complexes, how protein complex representation in PRO supports annotation of protein complexes and comparative biology, and how PRO is being integrated into existing community bioinformatics resources. The PRO resource is accessible at http://pir.georgetown.edu/pro/. CONCLUSION: PRO is a unique database resource for species-specific protein complexes. PRO facilitates robust annotation of variations in composition and function contexts for protein complexes within and between species
PMCID:3189193
PMID: 21929785
ISSN: 1471-2105
CID: 139625