* Ancestries were estimated using 1000 Genomes Project Phase3 as the reference population. GCTA (doi: 10.1016/j.ajhg.2010.11.011) was used to project HostSeq sample genotypes onto the reference population via precomputed SNP loadings, and PCs were calculated. The samples were assigned to five super-populations (AFR, AMR, EAS, EUR, and SAS)