2.4 K-mer analyses of clean Illumina reads
The remaining clean Illumina reads can be used to estimate the genomic characteristics ofC. japonica before its genome was assembled. In the present study, K-mer-based analysis was used to estimate the size, heterozygosity rate, and repeat sequence of the C. japonica genome (Liu et al., 2013). The 17-mer was selected for K-mer analysis to ensure that enough K-mers (417) were produced to cover the entire C. japonica genome.