Haky Im
hakyim.bsky.social
Haky Im
@hakyim.bsky.social
Statistician doing genomic data science, faculty the University of Chicago, Korean, Argentinean, American. Love kimchi, math, science, books with beautiful prose.
Ughhh, suffering the plos system now. My collaborators won’t let me submit future papers to PLoS
February 11, 2025 at 9:06 PM
with invaluable contributions from @temicrates Lisha Zhu @ssalazar_02 Sarah Sumner Hyunki Kim Saideep Gona @Festus_nyasimi Rohit Kulkarni @drjosephpowell @madduri
@boxiangliu
November 15, 2024 at 4:34 AM
Consistency is key
November 13, 2024 at 3:26 AM
I’m at UChicago, develop methods to understand the biology of complex trait and diseases, aspire to help real people with my research. Currently very optimistic about predicting molecular traits from DNA sequences using m/billions of parameters
November 13, 2024 at 3:25 AM
Reposted by Haky Im
Also worth noting: the US NIH offers financial support for those from ANY underrepresented group at ANY career stage to join an NIH funded lab!

Find a professor whose research you’re interested in and see if they are willing to host you! grants.nih.gov/grants/guide...
PA-23-189: Research Supplements to Promote Diversity in Health-Related Research (Admin Supp Clinical...
NIH Funding Opportunities and Notices in the NIH Guide for Grants and Contracts: Research Supplements to Promote Diversity in Health-Related Research (Admin Supp Clinical Trial Not Allowed) PA-23-189....
grants.nih.gov
November 5, 2023 at 11:02 AM
TWAS (transcriptome wide association study) is a statistical method that prioritizes genes that are more likely to cause a disease. It uses GWAS (genome wide association study) data, which is a method that identifies genomic loci associated with diseases
October 23, 2023 at 2:53 PM
October 21, 2023 at 1:44 PM
Preprint should be up shortly
October 18, 2023 at 5:48 PM
4) LD is not necessary for the inflation to occur (our simulations were done using independent SNPs)

5) The inflation can be corrected by using the noncentral χ2 distribution with noncentrality parameter N h2δ Φ, where the factor Φ can be pre-calculated independent of the GWAS
October 18, 2023 at 5:48 PM
2) Uncertainty in the prediction of the mediator does not cause inflation

3) Uncertainty in the prediction of the mediator reduces the power of the test
October 18, 2023 at 5:47 PM
In summary

1) Polygenicity of the target trait induces inflation in the test statistics regardless of the genetic architecture of the mediating trait
October 18, 2023 at 5:47 PM
Does this inflation affect other mediator-based *WAS?

Yes

What if we use PRS of GWAS traits to correlate with target traits? Is this going to be inflated?

Yes. You need to estimate Φ and use the noncentral χ2 distribution
October 18, 2023 at 5:46 PM
Back to the effect of precision
Precision of prediction improves power, or equivalently prediction error reduces power but doesn’t increase inflation under the null
October 18, 2023 at 5:46 PM
How well does your formula work under the alternative with finite N?

Pretty well
October 18, 2023 at 5:46 PM
What happens under the alternative?

See figure for formula under the alternative

τ2 is the precision of the prediction
October 18, 2023 at 5:45 PM
Can you estimate Φ?

Yes

See figure: most of the Φ are around 10e-5
October 18, 2023 at 5:45 PM