{"id":182906,"date":"2024-02-17T01:35:03","date_gmt":"2024-02-17T07:35:03","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2024\/02\/large-scale-gene-expression-alterations-introduced-by-structural-variation-drive-morphotype-diversification-in-brassica-oleracea"},"modified":"2024-02-17T01:35:03","modified_gmt":"2024-02-17T07:35:03","slug":"large-scale-gene-expression-alterations-introduced-by-structural-variation-drive-morphotype-diversification-in-brassica-oleracea","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2024\/02\/large-scale-gene-expression-alterations-introduced-by-structural-variation-drive-morphotype-diversification-in-brassica-oleracea","title":{"rendered":"Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea"},"content":{"rendered":"<p style=\"padding-right: 20px\"><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/large-scale-gene-expression-alterations-introduced-by-structural-variation-drive-morphotype-diversification-in-brassica-oleracea2.jpg\"><\/a><\/p>\n<p>To construct a pan-genome that encompasses the full range of genetic diversity in <i><i>B. ole <\/i>racea<\/i>, we analyzed the resequencing data of 704 globally distributed <i><i>B. ole <\/i>racea<\/i> accessions covering all different morphotypes and their wild relatives (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM4\">1<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM4\">2<\/a>). We identified 3,792,290 SNPs and 528,850 InDels in these accessions using cabbage JZS as reference genome<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Cai, X. et al. Improved Brassica oleracea JZS assembly reveals significant changing of LTR-RT dynamics in different morphotypes. Theor. Appl. Genet. 133, 3187&ndash;3199 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR22\" id=\"ref-link-section-d181667972e792\">22<\/a><\/sup>. A phylogenetic tree was then constructed using SNPs, which classified the 704 accessions into the following three main groups: wild <i><i>B. ole <\/i>racea<\/i> and kales, arrested inflorescence lineage (AIL) and leafy head lineage (LHL; Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Fig1\">1a<\/a> and Supplementary Note <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM1\">2<\/a>). The phylogenetic relationship revealed in our study was generally consistent with those reported previously<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\" title=\"Cheng, F. et al. Subgenome parallel selection is associated with morphotype diversification and convergent crop domestication in Brassica rapa and Brassica oleracea. Nat. Genet. 48, 1218&ndash;1224 (2016).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR4\" id=\"ref-link-section-d181667972e806\">4<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\" title=\"Guo, N. et al. Genome sequencing sheds light on the contribution of structural variants to Brassica oleracea diversification. BMC Biol. 19, 93 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR5\" id=\"ref-link-section-d181667972e809\">5<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Cai, C., Bucher, J., Bakker, F. T. & Bonnema, G. Evidence for two domestication lineages supporting a middle-eastern origin for Brassica oleracea crops from diversified kale populations. Hortic. Res 9, uhac033 (2022).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR24\" id=\"ref-link-section-d181667972e812\">24<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Mabry, M. E. et al. The evolutionary history of wild, domesticated, and feral Brassica oleracea (Brassicaceae). Mol. Biol. Evol. 38, 4419&ndash;4434 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR25\" id=\"ref-link-section-d181667972e815\">25<\/a><\/sup>. Based on the phylogeny and morphotype diversity, we selected 22 representative accessions for de novo genome assembly (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Tab1\">1<\/a>).<\/p>\n<p>We assembled genome sequences of the 22 accessions by integrating long-reads (PacBio or Nanopore sequencing), optical mapping molecules (BioNano) or high-throughput chromosome conformation capture data (Hi-C) and Illumina short-reads (Methods; Supplementary Note <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM1\">2<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM1\">3\u2013<\/a><a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM4\">7<\/a>). The total genome size of these assemblies ranged from 539.87 to 584.16 Mb with an average contig N50 of 19.18 Mb (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Tab1\">1<\/a>). An average of 98% contig sequences were anchored to the nine pseudochromosomes of <i><i>B. ole <\/i>racea<\/i>. The completeness of these genome assemblies was assessed using benchmarking universal single-copy orthologs (BUSCO), with an average of 98.70% complete score in these genomes (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#MOESM1\">8<\/a>).<\/p>\n<p>To minimize artifacts that could arise from different gene prediction approaches, we predicted gene models of both the 22 newly assembled genomes and the five reported high-quality genomes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\" title=\"Guo, N. et al. Genome sequencing sheds light on the contribution of structural variants to Brassica oleracea diversification. BMC Biol. 19, 93 (2021).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR5\" id=\"ref-link-section-d181667972e2759\">5<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Lv, H. et al. A high-quality reference genome for cabbage obtained with SMRT reveals novel genomic features and evolutionary characteristics. Sci. Rep. 10, 12394 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR21\" id=\"ref-link-section-d181667972e2762\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Cai, X. et al. Improved Brassica oleracea JZS assembly reveals significant changing of LTR-RT dynamics in different morphotypes. Theor. Appl. Genet. 133, 3187&ndash;3199 (2020).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR22\" id=\"ref-link-section-d181667972e2762_1\">22<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Belser, C. et al. Chromosome-scale assemblies of plant genomes using nanopore long reads and optical maps. Nat. Plants. 4879&ndash;887 (2018).\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#ref-CR23\" id=\"ref-link-section-d181667972e2765\">23<\/a><\/sup> using the same annotation pipeline (Methods). Using an integrated strategy combining ab initio, homology-based and transcriptome-assisted prediction, we obtained a range of 50,346 to 55,003 protein-coding genes with a mean BUSCO value of 97.9% in these genomes (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Tab1\">1<\/a>). After gene prediction, a phylogenetic tree constructed based on single-copy orthologous genes clustered the 27 genomes into three groups, similar to the results observed in the population (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Fig1\">1a<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"https:\/\/www.nature.com\/articles\/s41588-024-01655-4#Fig1\">b<\/a>).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>To construct a pan-genome that encompasses the full range of genetic diversity in B. ole racea, we analyzed the resequencing data of 704 globally distributed B. ole racea accessions covering all different morphotypes and their wild relatives (Supplementary Tables 1 and 2). We identified 3,792,290 SNPs and 528,850 InDels in these accessions using cabbage JZS [\u2026]<\/p>\n","protected":false},"author":661,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,412],"tags":[],"class_list":["post-182906","post","type-post","status-publish","format-standard","hentry","category-biotech-medical","category-genetics"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/182906","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/661"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=182906"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/182906\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=182906"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=182906"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=182906"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}