Data Collection
In Southeast Asia, a total of 411 complete or near complete genomes from India (n=355), Bangladesh (n=20), Indonesia (n=9), Thailand (n=122), Sri Lanka (n=4), Nepal (n=1) and Myanmar (n=1) have been submitted to GISAID (www.gisaid.org) as of 23rd May, 2020. This study analyzed 60 genome sequences of SARS-CoV-2 with 30 genome sequences coming from India, 10 from Bangladesh, 8 from Indonesia, 7 from Thailand, 4 from Sri Lanka and one from Nepal. While another sequence from Myanmar was available in GISAID, this sequence was excluded from this study due to the poor quality of the data. The ratio of sequence by country was determined based on the available number of sequences and the total number of Covid-19 cases up to 23rd of May, 2020. The selection of sequences analyzed was based on genome quality, discreteness of their location and random sample collection dates. hCoV19 / Wuhan / WIV04 / 2019 (Accession: EPI_ISL_ 402124) was used as a reference genome for phylogenetic and mutation analysis. Accessions ID, collection dates and locations are provided in supplementary Table-s1.