« Home « Kết quả tìm kiếm

The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes


Tóm tắt Xem thử

- phylogenetic breadth of the underlying OrthoDB data set.
- To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/..
- The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data..
- While the BUSCO completeness level has become a widely used quality measure with similar importance as the N50 measure for assembly contiguity, its informative value is highly dependent on the quality and sampling of the underlying orthology data, which may differ vastly across taxonomic groups.
- Currently, multiple represen- tative genomes of the highly diverse and rapidly evolving nematode phylum are still poorly annotated.
- To over- come this problem in the case of the nematode model organism Pristionchus pacificus, community-based cura- tions have recently been initiated to improve the quality of gene annotations [3, 4].
- However, further strand-specific RNA-seq and Iso-seq data pointed towards the presence of numerous artificial gene fusions in gene dense regions of the genome [3]..
- Here, I make use of the community-curated annota- tions of the P.
- pacificus genome to improve the annota- tions of related Pristionchus and other genomes of the family Diplogastridae, which were recently sequenced to study the evolutionary dynamics of novel gene families [9].
- Few exceptions are the ge- nomes of the free-living Oscheius tipulae and the parasitic Haemonchus contortus [18], Dirofilaria immitis [19], Loa loa [20], Brugia malayi [21], and Onchocerca volvulus [22].
- Please note that the most recent updates of the P.
- 1) and these do not repre- sent the full range of genomic diversity of the nematode phylum [14, 23].
- To this end, I reannotated nine nematode genomes of the family Diplogastridae including seven other Pristionchus spe- cies, which were sequenced previously as part of a phy- logenomic study to investigate the evolutionary dynamics of novel gene families [9, 26].
- Specifically, pre- dicted open reading frames (ORFs) in assembled RNA- seq transcripts [27] as well as protein sequences of the community-curated P.
- gene annotations [9].
- annotation accuracy occurs already within the same nematode family, I reevaluated the BUSCO complete- ness of the gene annotations only inferred from hom- ology data (Fig.
- Moreover, the evaluation of the contribu- tion of homology-inferred vs.
- The new gene annotations of the nine diplogastrid ge- nomes contain between 19 and 39 thousand gene models that are completely evidence-based as they are either supported by transcriptional evidence or by pro- tein conservation with P.
- The third most abundant pattern arises from 204 genes that were not found in any of the.
- As mentioned above, an alternative explanation would be that these genes are present but could not be detected by the BUSCO pipeline as the nematode odb10 data set does not represent the full diversity of the nematode phylum [14, 23].
- To further test the possibility of un- detected orthologs, I used a complementary approach to find one-to-one orthologs of the corresponding C.
- This revealed that 101 (50%) of these 204 genes have predicted one-to-one orthologs in all diplogastrid genomes, which points to a failure of detection of the BUSCO pipeline (Fig.
- Comparison of the bitscores from BLASTP searches between BUSCO genes in C.
- Thus, I conclude that the insufficient sample size and phylogenetic breadth of the nematode odb10 data set may cause failures in the ortholog detection by the BUSCO pipeline and that the quality of divergent nematode genomes might therefore be underestimated..
- In this study, I have demonstrated the benefit of the community-curated P.
- In this context, community an- notation seems to be one of the most effective methods to increase gene annotation quality beyond what can be achieved using automated pipelines.
- For example, in the case of the nine diplogastrid genomes, BUSCO genes that are found in the genome or transcriptome (Add- itional file 1, Tables S1, S2), but not in the final gene an- notations can be taken as candidates to further improve gene annotation quality by manual curation.
- In the case of the diplogastrid genomes, the completeness level is likely underestimated by up to 3% (101 out of 3131.
- a The left heatmap shows the bitscores for 101 randomly subsampled BUSCO orthologs derived from a BLASTP search of the C.
- elegans proteins against annotated protein sets of the ten diplogastrid genomes.
- The generation of high quality genomes of further members of the Pristionchus genus may therefore help to characterize and compare the genomic basis of these convergent patterns.
- In the case of the nematode model organism Pristionchus pacificus, community-based gene curations have previ- ously been presented as an effective means to lift anno- tation quality above the level of what could be obtained by automated pipelines.
- With BUSCO com- pleteness levels between 83 and 86%, the reannotated Pristionchus genomes are more complete than most other members of the nematode phylum.
- Third, the insufficient sample size and phylogenetic breadth of the BUSCO and OrthoDB data sets may prohibit the detec- tion of orthologs and thus cause an underestimation of nematode genome quality..
- These transcribed ORFs were aligned against the respective reference assembly with the help of the exonerate protein2genome program with the following parameter settings: --bestn 2, −-dnawor- dlen 20, and --maxintron 20,000 (version .
- The complexity of the joint annotation was reduced by a sim- ple heuristic to generate a set of non-redundant annota- tions.
- From the result files of the BUSCO pipeline, genes that were classified to be missing were extracted and compared.
- The online version contains supplementary material available at https://doi..
- org/10.1186/s x..
- a The left heatmap shows the normalized bitscores (bitscore / alignment length) for 1490 BUSCO orthologs derived from a BLASTP search of the C..
- b The aligned proportion was computed as the length of the BLASTP alignment divided by the protein length of the C..
- The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
- All data sets were submitted to WormBase ParaSite and are also available at http://pristionchus.org/download/.
- https://doi.org/10.1186/s .
- https://doi.org/1 0.1093/bioinformatics/btv351..
- https://doi.org/10.1038/s .
- doi.org .
- https://doi.org/10.1371/journal.pgen.1008687..
- https://doi.org/10.1534/.
- https://doi.org/1 0.1101/gr .
- https://doi.org/10.1093/molbev/msaa207..
- Single- molecule sequencing reveals the chromosome-scale genomic architecture of the nematode model organism Pristionchus pacificus.
- https://doi.org/10.1016/j.celrep .
- https://doi.org/10.1016/j.molbiopara .
- https://doi.org/10.1186/s x..
- https://doi.org/10.1002/.
- https://doi.org/10.1038/.
- The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets.
- https://doi.org/10.1096/fj.12-205096..
- https://doi.org/10.1038/ng.2585..
- https://doi.org/10.1038/nmicrobiol.2016.216..
- doi.org/10.1093/nar/gky1053..
- Expanding the view on the evolution of the nematode dauer signalling pathways: refinement through gene gain and pathway co-option.
- doi.org/10.1186/s .
- Improving the annotation of the Heterorhabditis bacteriophora genome.
- https://doi.org/10.1093/gigascience/giy034..
- https://doi.org/10.1534/genetics .
- https://doi.org .
- https://doi.org/10.1093/.
- https://doi.org/10.1093/molbev/msaa235..
- https://doi.org/10.1016/j.tig .
- Comparative genomics of the major parasitic worms.
- doi.org/10.1093/bioinformatics/bti310..
- https://doi.org/10.1093/bioinformatics/btaa1016..
- Structure, function and evolution of the nematode genome.
- https://doi.org a 0024603..
- https://doi..
- org/10.7554/elife.55687..
- https://doi.org/10.1 093/molbev/msw093..
- https://doi.org/1 0.1126/science.aav9856..
- https://doi.org/10.1371/journal.pgen.1005146..
- org/10.1073/pnas .
- Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita.
- https://doi.org/10.1038/nbt.1482..
- https://doi.org/10.1038/srep20316..
- The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families.
- https://doi.org/10.1038/ng.3237..
- Genome of the human hookworm Necator americanus.
- https://doi.org/10.1038/ng.2875..
- https://doi.org/10.1371/journal.pone.0069618..
- https://doi.org/10.1186/gb-2014-15-3-r43..
- The genome of the yellow potato cyst nematode, Globodera rostochiensis, reveals insights into the basis of parasitism and virulence.
- https://doi.org/10.1371/journal.ppat.1002219..
- Population genomics of the filarial nematode parasite Wuchereria bancrofti from mosquitoes.
- https://doi.org/10.1111/mec.13574..
- org/10.1016/j.devcel .
- The draft genome of the parasitic nematode Trichinella spiralis..
- https://doi.org/10.1038/ng.769..
- https://doi.org/10.1126/science.aao0827..
- https://doi.org/10.1093/bioinformatics/btq706.

Xem thử không khả dụng, vui lòng xem tại trang nguồn
hoặc xem Tóm tắt