Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
???displayArticle.abstract???
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and APIs. Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Figure 2. Paralog table for C. elegans hlh-25.The table presents a ranking of paralogs for the hlh-25 gene, based on a weighted scoring algorithm that incorporates sequence conservation metrics. It lists the gene symbols, provides the alignment length in amino acids, and quantifies the similarity and identity percentages of genes paralogous to hlh-25. The methodology count, indicating the number of algorithms supporting the paralogous relationship, is also included. In this ranking, hlh-27 is identified as the primary paralog due to its high similarity and identity scores, despite being recognized by fewer methods than hlh-28.
Figure 3. Sequence detail widget.Chosen views of a specific gene are readily available for copying as plain text or with highlights. 5’ region of the human PLAA gene.
Figure 4. Screenshot of results from the Alliance SequenceServer BLAST tool.The results have been enhanced relative to the default Sequence Server results page by the addition of links to Alliance JBrowse and to the corresponding gene page (in this case, C. elegans abi-1) at the Alliance website for each BLAST hit.
Figure 5. Output of a BLAST searchAfter a user clicks on the JBrowse link for a BLAST hit they are directed to the web service where they will see a track for the BLAST hit and how the hit aligns with other tracks.
Figure 6. AllianceMine example.Using a simple template, a disease ontology (DO) term is chosen, and all genes associated with this DO term are returned in a downloadable table.
Figure 7. Alliance Pathway Viewer.The pathway widget displays gene products (light purple rectangles), protein complexes (light grey rectangles) and chemicals (light blue rectangles) and the flow of information and material between them (relations). These relations, shown in legend indicate direct or indirect regulation that can be positive, negative or of unknown effect direction.
Figure 8. Evolution of Data Flow.Graphical summary showing the design of short term infrastructure initially deployed to support rapid delivery of unified data to the community and the planned production system. Red, data quartermasters at MODs; Yellow, data; Brown, database; Green, transformations; Blue, user interface.
Figure 9. Screenshot of the Alliance curation tool interface showing an example of curated annotations of Affected Genomic Models managed in the persistent store.
Figure 10. Textpresso for SGD literature at the Alliance. (http://sgdtextpresso.alliancegenome.org/tpc/search)
Figure 11. Swagger interface for the Alliance APIs.
FIgure 12. Example of API output.
Figure 13. Mockup of the Alzheimer’s Disease Portal showing the Home page and the Data access page.These views illustrate the type of information that will be available with a disease-focus.
Figure 14. Alliance community forum home page.
Figure 15. Mockup of an Expression Detail page.This example shows one of the current features of WormBase – single cell data from two studies – displayed on what will be part of an Alliance Gene Expression detail page.
Aleksander,
The Gene Ontology knowledgebase in 2023.
2023, Pubmed,
Xenbase
Aleksander,
The Gene Ontology knowledgebase in 2023.
2023,
Pubmed
,
Xenbase
Alliance of Genome Resources Consortium,
Harmonizing model organism data in the Alliance of Genome Resources.
2022,
Pubmed
Altenhoff,
OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more.
2021,
Pubmed
Altschul,
Basic local alignment search tool.
1990,
Pubmed
Anderson,
Data management: A global coalition to sustain core data.
2017,
Pubmed
Bornstein,
The NIH Comparative Genomics Resource: addressing the promises and challenges of comparative genomics on human health.
2023,
Pubmed
Bowes,
The Xenbase literature curation process.
2013,
Pubmed
,
Xenbase
Bradford,
From multiallele fish to nonstandard environments, how ZFIN assigns phenotypes, human disease models, and gene expression annotations to genes.
2023,
Pubmed
Bult,
The alliance of genome resources: transforming comparative genomics.
2023,
Pubmed
Bunt,
Directly e-mailing authors of newly published papers encourages community curation.
2012,
Pubmed
Carotenuto,
Xenopus laevis (Daudin, 1802) as a Model Organism for Bioscience: A Historic Review and Perspective.
2023,
Pubmed
,
Xenbase
Cohen,
Formation and function of dauer ascarosides in the nematodes Caenorhabditis briggsae and Caenorhabditis elegans.
2022,
Pubmed
Cohen,
Genome editing of Caenorhabditis briggsae using CRISPR/Cas9 co-conversion marker dpy-10.
2019,
Pubmed
Cosentino,
SonicParanoid: fast, accurate and easy orthology inference.
2019,
Pubmed
Davis,
WormBase in 2022-data, processes, and tools for analyzing Caenorhabditis elegans.
2022,
Pubmed
Dunn,
Apollo: Democratizing genome annotation.
2019,
Pubmed
Emms,
OrthoFinder: phylogenetic orthology inference for comparative genomics.
2019,
Pubmed
Engel,
New data and collaborations at the Saccharomyces Genome Database: updated reference genome, alleles, and the Alliance of Genome Resources.
2022,
Pubmed
Fisher,
Xenbase: key features and resources of the Xenopus model organism knowledgebase.
2023,
Pubmed
,
Xenbase
FlyBase Consortium,
The FlyBase database of the Drosophila Genome Projects and community literature.
1999,
Pubmed
Fuentes,
PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies.
2022,
Pubmed
Gramates,
FlyBase: a guided tour of highlighted features.
2022,
Pubmed
Howe,
Model organism data evolving in support of translational medicine.
2018,
Pubmed
Hu,
FlyRNAi.org-the database of the Drosophila RNAi screening center and transgenic RNAi project: 2021 update.
2021,
Pubmed
Hu,
An integrative approach to ortholog prediction for disease-focused and other functional studies.
2011,
Pubmed
Inoue,
Genetic analysis of dauer formation in Caenorhabditis briggsae.
2007,
Pubmed
Ivanova,
Orthologs of the Caenorhabditis elegans heterochronic genes have divergent functions in Caenorhabditis briggsae.
2023,
Pubmed
Jhaveri,
Genome annotation of Caenorhabditis briggsae by TEC-RED identifies new exons, paralogs, and conserved and novel operons.
2022,
Pubmed
Kostiuk,
Xenopus as a platform for discovery of genes relevant to human disease.
2021,
Pubmed
,
Xenbase
Larkin,
FlyBase: updates to the Drosophila melanogaster knowledge base.
2021,
Pubmed
Liu,
OntoMate: a text-mining tool aiding curation at the Rat Genome Database.
2015,
Pubmed
Milacic,
The Reactome Pathway Knowledgebase 2024.
2024,
Pubmed
Mitros,
A chromosome-scale genome assembly and dense genetic map for Xenopus tropicalis.
2019,
Pubmed
,
Xenbase
Moya,
Novel and improved Caenorhabditis briggsae gene models generated by community curation.
2023,
Pubmed
Müller,
Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.
2018,
Pubmed
Nevers,
The Quest for Orthologs orthology benchmark service in 2022.
2022,
Pubmed
Nevers,
OrthoInspector 3.0: open portal for comparative genomics.
2019,
Pubmed
Oliver,
Model organism databases: essential resources that need the support of both funders and users.
2016,
Pubmed
Persson,
InParanoid-DIAMOND: faster orthology analysis with the InParanoid algorithm.
2022,
Pubmed
Priyam,
Sequenceserver: A Modern Graphical User Interface for Custom BLAST Databases.
2019,
Pubmed
Ringwald,
Mouse Genome Informatics (MGI): latest news from MGD and GXD.
2022,
Pubmed
Sargent,
G-OnRamp: Generating genome browsers to facilitate undergraduate-driven collaborative genome annotation.
2020,
Pubmed
Session,
Genome evolution in the allotetraploid frog Xenopus laevis.
2016,
Pubmed
,
Xenbase
Sharanya,
Genetic control of vulval development in Caenorhabditis briggsae.
2012,
Pubmed
Smith,
InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data.
2012,
Pubmed
Thomas,
PANTHER: Making genome-scale phylogenetics accessible to all.
2022,
Pubmed
Thomas,
Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems.
2019,
Pubmed
Van Auken,
Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR.
2012,
Pubmed
Vedi,
2022 updates to the Rat Genome Database: a Findable, Accessible, Interoperable, and Reusable (FAIR) resource.
2023,
Pubmed
Wood,
Making biological knowledge useful for humans and machines.
2022,
Pubmed