WeChat Mini Program
Old Version Features

Evolution of Genome Structure in Thedrosophila Simulansspecies Complex

Genome research(2020)SCI 1区SCI 2区

Cited 9|Views1
Abstract
ABSTRACT The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex ( D. simulans, D. mauritiana , and D. sechellia ), which speciated ∼250,000 years ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster due to structural divergence—twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, while the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.
More
Translated text
Key words
Chromosome Duplication,Phylogenetic Analysis,phylogenetic tree,Adaptive Evolution,metagenomics assembly
PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Try using models to generate summary,it takes about 60s
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本研究构建了高质量的大果蝇属物种复合群(Drosophila simulans物种复合群)的参考基因组,揭示了基因组结构在物种分化过程中的快速演变及其对表型进化的影响。

方法】:通过de novo方法生成了高度连续的参考基因组,并利用这些基因组组装对比分析了D. simulans、D. mauritiana和D. sechellia与模式生物D. melanogaster之间的重复序列差异。

实验】:实验中使用了Drosophila simulans物种复合群的基因组数据,通过比较分析发现至少15%的D. simulans复合群物种基因组因结构差异而无法与D. melanogaster对齐,这一比例是单核苷酸替换数量的两倍。研究还发现卫星DNA的快速更迭和异染色质区域的大量结构分化,而常染色质基因含量则相对保守。尽管基因排列顺序总体上保持不变,但各物种的常染色质区域受到种内特有的倒位、转座子元素、卫星和tRNA串联数组的扩张与收缩以及基因复制的影响。研究还发现了Y连锁基因的快速分化,包括拷贝数变异和来自自动族的最近基因复制。