Genetic Diversity Analysis of Seven Natural Populations of Procambarus clarkii from Jiangsu Province Based on Whole-Genome SNP Variations

XU Yu, XU Zhiqiang, YAN Weihui, LI Jiajia, HUANG Hongbing, LI Xuguang, LIU Yuanzhu, PAN Jianlin

Chin Agric Sci Bull ›› 2025, Vol. 41 ›› Issue (17) : 144-151.

PDF(2295 KB)
Home Journals Chinese Agricultural Science Bulletin
Chinese Agricultural Science Bulletin

Abbreviation (ISO4): Chin Agric Sci Bull      Editor in chief: Yulong YIN

About  /  Aim & scope  /  Editorial board  /  Indexed  /  Contact  / 
PDF(2295 KB)
Chin Agric Sci Bull ›› 2025, Vol. 41 ›› Issue (17) : 144-151. DOI: 10.11924/j.issn.1000-6850.casb2024-0725

Genetic Diversity Analysis of Seven Natural Populations of Procambarus clarkii from Jiangsu Province Based on Whole-Genome SNP Variations

Author information +
History +

Abstract

To explore the influence of artificial breeding activities on the genetic diversity of natural Procambarus clarkii populations, SNP molecular markers developed via SLAF-Seq sequencing technology were employed to analyze the genetic diversity and genetic structure of seven natural P. clarkii populations in Jiangsu Province. These populations included the main production areas (Weishan Lake, Gaoyou Lake, Hongze Lake, and Luoma Lake) and non-main production areas (Ge Lake, Gucheng Lake, and Yangcheng Lake). In total, 176486 SNPs and 24828 Indels were detected in 210 individuals from these populations. The average Q30 and GC contents were 92.77% and 44.31%, respectively. The average polymorphic information content (PIC) of the seven populations ranged from 0.2070 to 0.2246. The expected heterozygosity (He) was between 0.2388 and 0.2854, while the observed heterozygosity (Ho) ranged from 0.3204 to 0.3387, suggesting a certain degree of heterozygote excess in these populations. The coefficient of genetic differentiation (Fst) among populations varied from 0.033 to 0.124, indicating moderate differentiation. The seven populations could be divided into four lineages. The populations from Hongze Lake, Luoma Lake, Gaoyou Lake, and Ge Lake were classified as one group, and the remaining three populations from Weishan Lake, Gucheng Lake, and Yangcheng Lake formed separate groups respectively. This demonstrated clear geographical distribution patterns among the populations. Linkage disequilibrium (LD) analysis showed that the LD decay distance in the P. clarkii populations was short, with the r2 value decreasing to less than 0.1 within a 100-bp physical distance. Further selection sweep analysis revealed 153 strongly selected genomic regions between the main production area and non-main production area populations. These selected genes were significantly enriched in pathways such as the Cell cycle, AMPK signaling pathway, Meiosis, and other gene pathways closely related to the biological functions of P. clarkii. This study elucidated the genetic diversity characteristics of natural populations in both main and non-main production areas of P. clarkii in Jiangsu Province, thus providing a theoretical basis for the development, utilization and conservation of P. clarkii germplasm resources.

Key words

Procambarus clarkii / SLAF-seq / SNP / genetic structure / genetic diversity / Jiangsu Province

Cite this article

Download Citations
XU Yu , XU Zhiqiang , YAN Weihui , et al . Genetic Diversity Analysis of Seven Natural Populations of Procambarus clarkii from Jiangsu Province Based on Whole-Genome SNP Variations[J]. Chinese Agricultural Science Bulletin. 2025, 41(17): 144-151 https://doi.org/10.11924/j.issn.1000-6850.casb2024-0725

References

[1]
HUNER J V. Procambarus in North America and elsewhere. In: HOLDICH D M, LOWERYR S. Freshwater crayfish: biology, management and exploitation[M]. Portland: timber press, 1988:239-261.
[2]
王祖峰. 浅谈我国小龙虾养殖产业发展面临的问题与建议[J]. 中国渔业经济, 2019, 5(37):107-111.
[3]
YUE G H, LI J, BAI Z, et al. Genetic diversity and population structure of the invasive alien red swamp crayfish[J]. Biological invasions, 2010, 12:2697-2706.
[4]
全国水产技术推广总站, 中国水产学会. 中国小龙虾产业发展报告(2024)[J]. 中国水产, 2024, 584(7):14-20.
[5]
徐宇, 许志强, 黄鸿兵, 等. 江苏小龙虾产业发展历程、现状和展望[J]. 水产养殖, 2021, 42(10):77-80.
[6]
刘国锋, 徐增洪, 徐跑, 等. 我国克氏原螯虾种苗产业发展现状[J]. 江苏农业科学, 2022, 50(9):1-6.
[7]
谭云飞, 蓬国辉, 熊礼静, 等. 长江中下游流域13个克氏原螯虾群体遗传多样性和遗传结构分析[J]. 华中农业大学学报, 2020, 39(2):33-39.
[8]
LI Y H, GUO X W, CAO X J, et al. Population genetic structure and post-establishment dispersal patterns of the red swamp crayfish Procambarus clarkii in China[J]. PLoS one, 2012, 7(7):e40652.
[9]
YI S K, LI Y H, SHI L L, et al. Characterization of Population Genetic Structure of red swamp crayfish, Procambarus clarkii, in China[J]. Scientific reports 8, 2018:5586.
[10]
XU Z Q, GAOT H, XU Y, et al. A chromosome-level reference genome of red swamp crayfish Procambarus clarkii provides insights into the gene families regarding growth or development in crustaceans[J]. Genomics, 2021, 113(5):3274-3284.
[11]
LI H, DURBIN R. Fast and accurate long-read alignment with Burrows-WHeeler transform[J]. Bioinformatics, 2010, 26(5):589-595.
Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short queries with low sequencing error rate. However, some sequencing platforms already produce longer reads and others are expected to become available soon. For longer reads, hashing-based software such as BLAT and SSAHA2 remain the only choices. Nonetheless, these methods are substantially slower than short-read aligners in terms of aligned bases per unit time.We designed and implemented a new algorithm, Burrows-Wheeler Aligner's Smith-Waterman Alignment (BWA-SW), to align long sequences up to 1 Mb against a large sequence database (e.g. the human genome) with a few gigabytes of memory. The algorithm is as accurate as SSAHA2, more accurate than BLAT, and is several to tens of times faster than both.http://bio-bwa.sourceforge.net
[12]
DANECEK P, AUTON A, ABECASIS G, et al. The variant call formatand VCFtools[J]. Bioinformatics, 2011, 27(15):2156-2158.
[13]
ALEXANDER D H, NOVEMBRE J, LANGE K. Fast model-based estimation of ancestry in unrelated individuals[J]. Genome research, 2009, 19:1655-1664.
Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.
[14]
STAMATAKIS A. RAxML version 8:A tool for phylogenetic analysis and post-analysis of large phylogenies[J]. Bioinformatics, 2014, 30:1312-1313.
[15]
PETR D, ADAM A, GONCALO A, et al. 1000 Genomes project analysis group, the variant call format and VCFtools[J]. Bioinformatics, 2011, 27(15):2156-2158.
The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.http://vcftools.sourceforge.net
[16]
MINORU K, MIHO F, YOKO S, et al. KEGG for taxonomy-based analysis of pathways and genomes[J]. Nucleic acids research, 2023, 51:587-592.
[17]
BOTSTEIN D, WHITE R L, SKOLNICK M, et al. Construction of a genetic linkage map in man using restriction fragment length polymorphisms[J]. American journal human genetics, 1980, 32(3):314-331.
[18]
XIA Q, GUO Y, ZHANG Z, et al. Complete resequencing of 40 genomes reveals domestication events and genes in silkworm (Bombyx)[J]. Science, 2009, 326(5951):433-436.
A single-base pair resolution silkworm genetic variation map was constructed from 40 domesticated and wild silkworms, each sequenced to approximately threefold coverage, representing 99.88% of the genome. We identified ~16 million single-nucleotide polymorphisms, many indels, and structural variations. We find that the domesticated silkworms are clearly genetically differentiated from the wild ones, but they have maintained large levels of genetic variability, suggesting a short domestication event involving a large number of individuals. We also identified signals of selection at 354 candidate genes that may have been important during domestication, some of which have enriched expression in the silk gland, midgut, and testis. These data add to our understanding of the domestication processes and may have applications in devising pest control strategies and advancing the use of silkworms as efficient bioreactors.
[19]
SIDNEY C H CHEUNG. The social life of American crayfish in Asia. In: JAMESF. Globalization, food and social identities in the Asia Pacific Region[M]. Tokyo, Sophia University institute of comparative culture, 2010.
[20]
LIU F, QU Y K, GENG C, et al. Analysis of the population structure and genetic diversity of the red swamp crayfish (Procambarus clarkii) in China using SSR markers[J]. Electronic journal of biotechnology, 2020, 47:59-71.
[21]
LIU J, SUN Y, CHEN Q, et al. Genetic diversity analysis of the red swamp crayfish Procambarus clarkii in three cultured populations based on microsatellite markers[J]. Animals, 2023, 13(11):1881.
[22]
刘炜. 克氏原螯虾肌肉肌苷酸含量及遗传多样性研究[D]. 扬州: 扬州大学, 2008:27-34.
[23]
曾怡锦. 温、热带玉米特异转座子的验证及功能初探[D]. 雅安: 四川农业大学, 2016:4-6.
[24]
罗依妮, 王露. 转座子的研究现状[J]. 中国细胞生物学学报, 2024, 46(7):1323-1334.
[25]
YASUDA K, ITO M, SUGITA T, et al. Utilization of transposable element mPing as a novel genetic tool for modification of the stress response in rice[J]. Molecular breeding, 2013, 32(3):505-516.
[26]
NATIO K, ZHANG F, TSUKIYAMA T, et al. Unexpected consequences of a sudden and massive transposon amplification on rice gene expression[J]. Nature, 2009, 461(7267):1130-1134.
PDF(2295 KB)

Accesses

Citation

Detail

Sections
Recommended

/