Here's a flowchart overview :))
Here's a flowchart overview :))
A little note on which of the fastas to use - the HRC data we got access to was aligned with an older genome build, and many variants were lost during liftover. The 50k+ variants are in the 1000 Genomes databases.
A little note on which of the fastas to use - the HRC data we got access to was aligned with an older genome build, and many variants were lost during liftover. The 50k+ variants are in the 1000 Genomes databases.