Data Collection
In Southeast Asia, a total of 411 complete or near complete genomes from
India (n=355), Bangladesh (n=20), Indonesia (n=9), Thailand (n=122), Sri
Lanka (n=4), Nepal (n=1) and Myanmar (n=1) have been submitted to GISAID
(www.gisaid.org) as of 23rd May, 2020. This study
analyzed 60 genome sequences of SARS-CoV-2 with 30 genome sequences
coming from India, 10 from Bangladesh, 8 from Indonesia, 7 from
Thailand, 4 from Sri Lanka and one from Nepal. While another sequence
from Myanmar was available in GISAID, this sequence was excluded from
this study due to the poor quality of the data. The ratio of sequence by
country was determined based on the available number of sequences and
the total number of Covid-19 cases up to 23rd of May,
2020. The selection of sequences analyzed was based on genome quality,
discreteness of their location and random sample collection dates.
hCoV19 / Wuhan / WIV04 / 2019 (Accession: EPI_ISL_ 402124) was used as
a reference genome for phylogenetic and mutation analysis. Accessions
ID, collection dates and locations are provided in supplementary
Table-s1.