Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
???displayArticle.abstract???
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and Application Programming Interfaces (APIs). Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse software. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Fig. 1. MOD landing pages at the Alliance portal. A common look and feel that allows community-specific content.
Fig. 2. Paralog table for C. elegans hlh-25. The table presents a ranking of paralogs for the hlh-25 gene, based on a weighted scoring algorithm that incorporates sequence conservation metrics. It lists the gene symbols, provides the alignment length in amino acids, and quantifies the similarity and identity percentages of genes paralogous to hlh-25. The methodology count, indicating the number of algorithms supporting the paralogous relationship, is also included. In this ranking, hlh-27 is identified as the primary paralog due to its high similarity and identity scores, despite being recognized by fewer methods than hlh-28.
Fig. 3. Sequence detail widget. Chosen views of a specific gene are readily available for copying as plain text or with highlights. 5′ region of the human PLAA gene.
Fig. 4. Screenshot of results from the Alliance SequenceServer BLAST tool. The results have been enhanced relative to the default SequenceServer results page by the addition of links to Alliance JBrowse and to the corresponding gene page (in this case C. elegans abi-1) at the Alliance website for each BLAST hit.
Fig. 5. Output of a BLAST search. After a user clicks on the JBrowse link for a BLAST hit, they are directed to the web service where they will see a track for the BLAST hit and how the hit aligns with other tracks.
Fig. 6. AllianceMine example. Using a simple template, a disease ontology (DO) term, in this case “autism,” is chosen, and all genes associated with this DO term are returned in a downloadable table.
Fig. 7. Alliance pathway viewer. The pathway widget displays gene products (rectangles with gene names) and chemicals (rectangles with chemical abbreviations) and the flow of information and material between them (relations). These relations, shown in legend, indicate direct or indirect regulation that can be positive, negative, or of unknown effect direction. For metabolites that mediate the information flow between gene products, distinct shading distinguishes metabolites that are the inputs or outputs of a reaction.
Fig. 8. Evolution of data flow. Graphical summary showing the design of short-term infrastructure initially deployed to support rapid delivery of unified data to the community and the planned production system. Red, data quartermasters at MODs; yellow, data; brown, database; green, transformations; blue, user interface.
Fig. 9. Alliance curation tool. Screenshot of the Alliance curation tool interface showing an example of curated annotations of AGMs managed in the persistent store.
Fig. 10. Textpresso for SGD literature at the Alliance (http://sgd-textpresso.alliancegenome.org/tpc/search).
Fig. 11. Swagger interface for the Alliance APIs.
Fig. 12. Alliance community forum home page.
Fig. 13. Mockup of an expression detail page. This example shows one of the current features of WormBase—single-cell data from 2 studies—displayed on what will be part of an Alliance gene expression detail page.
Fig. 14. Mockup of the AD portal showing the home page and the data access page. These views illustrate the type of information that will be available with a disease focus.
Aleksander,
The Gene Ontology knowledgebase in 2023.
2023, Pubmed,
Xenbase
Aleksander,
The Gene Ontology knowledgebase in 2023.
2023,
Pubmed
,
Xenbase
Alliance of Genome Resources Consortium,
Harmonizing model organism data in the Alliance of Genome Resources.
2022,
Pubmed
Altenhoff,
OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more.
2021,
Pubmed
Altschul,
Basic local alignment search tool.
1990,
Pubmed
Anderson,
Data management: A global coalition to sustain core data.
2017,
Pubmed
Arnaboldi,
Text mining meets community curation: a newly designed curation platform to improve author experience and participation at WormBase.
2020,
Pubmed
Bornstein,
The NIH Comparative Genomics Resource: addressing the promises and challenges of comparative genomics on human health.
2023,
Pubmed
Bowes,
The Xenbase literature curation process.
2013,
Pubmed
,
Xenbase
Bradford,
From multiallele fish to nonstandard environments, how ZFIN assigns phenotypes, human disease models, and gene expression annotations to genes.
2023,
Pubmed
Bult,
The alliance of genome resources: transforming comparative genomics.
2023,
Pubmed
Bunt,
Directly e-mailing authors of newly published papers encourages community curation.
2012,
Pubmed
Carotenuto,
Xenopus laevis (Daudin, 1802) as a Model Organism for Bioscience: A Historic Review and Perspective.
2023,
Pubmed
,
Xenbase
Cohen,
Genome editing of Caenorhabditis briggsae using CRISPR/Cas9 co-conversion marker dpy-10.
2019,
Pubmed
Cohen,
Formation and function of dauer ascarosides in the nematodes Caenorhabditis briggsae and Caenorhabditis elegans.
2022,
Pubmed
Cosentino,
SonicParanoid: fast, accurate and easy orthology inference.
2019,
Pubmed
Davis,
WormBase in 2022-data, processes, and tools for analyzing Caenorhabditis elegans.
2022,
Pubmed
Dunn,
Apollo: Democratizing genome annotation.
2019,
Pubmed
Emms,
OrthoFinder: phylogenetic orthology inference for comparative genomics.
2019,
Pubmed
Engel,
New data and collaborations at the Saccharomyces Genome Database: updated reference genome, alleles, and the Alliance of Genome Resources.
2022,
Pubmed
Fang,
Automatic categorization of diverse experimental information in the bioscience literature.
2012,
Pubmed
Fisher,
Xenbase: key features and resources of the Xenopus model organism knowledgebase.
2023,
Pubmed
,
Xenbase
FlyBase Consortium,
The FlyBase database of the Drosophila Genome Projects and community literature.
1999,
Pubmed
Fuentes,
PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies.
2022,
Pubmed
Gramates,
FlyBase: a guided tour of highlighted features.
2022,
Pubmed
Howe,
Model organism data evolving in support of translational medicine.
2018,
Pubmed
Hu,
FlyRNAi.org-the database of the Drosophila RNAi screening center and transgenic RNAi project: 2021 update.
2021,
Pubmed
Hu,
An integrative approach to ortholog prediction for disease-focused and other functional studies.
2011,
Pubmed
Inoue,
Genetic analysis of dauer formation in Caenorhabditis briggsae.
2007,
Pubmed
Ivanova,
Orthologs of the Caenorhabditis elegans heterochronic genes have divergent functions in Caenorhabditis briggsae.
2023,
Pubmed
Jhaveri,
Genome annotation of Caenorhabditis briggsae by TEC-RED identifies new exons, paralogs, and conserved and novel operons.
2022,
Pubmed
Jiang,
Integrating image caption information into biomedical document classification in support of biocuration.
2020,
Pubmed
Kishore,
Automated generation of gene summaries at the Alliance of Genome Resources.
2020,
Pubmed
Kostiuk,
Xenopus as a platform for discovery of genes relevant to human disease.
2021,
Pubmed
,
Xenbase
Larkin,
FlyBase: updates to the Drosophila melanogaster knowledge base.
2021,
Pubmed
Liu,
OntoMate: a text-mining tool aiding curation at the Rat Genome Database.
2015,
Pubmed
Milacic,
The Reactome Pathway Knowledgebase 2024.
2024,
Pubmed
Mitros,
A chromosome-scale genome assembly and dense genetic map for Xenopus tropicalis.
2019,
Pubmed
,
Xenbase
Moya,
Novel and improved Caenorhabditis briggsae gene models generated by community curation.
2023,
Pubmed
Müller,
Textpresso: an ontology-based information retrieval and extraction system for biological literature.
2004,
Pubmed
Müller,
Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.
2018,
Pubmed
Nevers,
The Quest for Orthologs orthology benchmark service in 2022.
2022,
Pubmed
Nevers,
OrthoInspector 3.0: open portal for comparative genomics.
2019,
Pubmed
Oliver,
Model organism databases: essential resources that need the support of both funders and users.
2016,
Pubmed
Persson,
InParanoid-DIAMOND: faster orthology analysis with the InParanoid algorithm.
2022,
Pubmed
Priyam,
Sequenceserver: A Modern Graphical User Interface for Custom BLAST Databases.
2019,
Pubmed
Ringwald,
Mouse Genome Informatics (MGI): latest news from MGD and GXD.
2022,
Pubmed
Sargent,
G-OnRamp: Generating genome browsers to facilitate undergraduate-driven collaborative genome annotation.
2020,
Pubmed
Session,
Genome evolution in the allotetraploid frog Xenopus laevis.
2016,
Pubmed
,
Xenbase
Sharanya,
Genetic control of vulval development in Caenorhabditis briggsae.
2012,
Pubmed
Smith,
InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data.
2012,
Pubmed
Sternberg,
WormBase 2024: status and transitioning to Alliance infrastructure.
2024,
Pubmed
Thomas,
PANTHER: Making genome-scale phylogenetics accessible to all.
2022,
Pubmed
Thomas,
Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems.
2019,
Pubmed
UniProt Consortium,
UniProt: the Universal Protein Knowledgebase in 2023.
2023,
Pubmed
Van Auken,
Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR.
2012,
Pubmed
Vedi,
2022 updates to the Rat Genome Database: a Findable, Accessible, Interoperable, and Reusable (FAIR) resource.
2023,
Pubmed
Wood,
Making biological knowledge useful for humans and machines.
2022,
Pubmed