Leyan Wang
leyan-wang.bsky.social
Leyan Wang
@leyan-wang.bsky.social
Overall, ARG inference tools have demonstrated remarkable robustness to computational phasing errors. While various practical challenges may limit ARG inference quality, we believe that computational phasing inaccuracies should not be a problem of concern.
7/7
November 26, 2025 at 5:09 PM
Under a selective sweep model, we saw little difference in branch-length-based diversity between ARGs inferred from true and computationally phased haplotypes, demonstrating the practicality of applying ARGs inferred from unphased data in evolutionary analyses.
6/7
November 26, 2025 at 5:09 PM
Under the CEU bottleneck model, we observed consistently minor effects of phasing errors. The difference between ARG inference methods is much larger than the difference in performance between using true haplotypes and computationally phased haplotypes.
5/7
November 26, 2025 at 5:09 PM
In constant population size models, we observed only a slight reduction in ARG inference accuracy in terms of estimates of coalescence times or recombination breakpoint counts when using computationally phased haplotypes. The effect also reduced as the sample size increased.
4/7
November 26, 2025 at 5:09 PM
Here, we show that ARG inference remains robust to phasing errors even at incredibly small sample sizes (8 haplotypes). We simulated ARGs and VCFs under various demographic models, simulated phasing errors, and compared ARGs inferred from true and computationally phased haplotypes.
3/7
November 26, 2025 at 5:09 PM
ARGs are powerful tools in population genetics. However, ARG inference generally requires phased haplotypes. Phasing quality is limited at small sample sizes, making it difficult in non-model organisms due to the usually limited sample size and a lack of reference panels.
2/7
November 26, 2025 at 5:09 PM
We are excited to share our recent work on the surprising robustness of Ancestral Recombination Graph (ARG) inference tools to computational phasing errors, now available on BioRxiv: biorxiv.org/content/10.1....
This work is co-advised by @yundeng.bsky.social and Rasmus Nielsen.
1/7
Robustness of Ancestral Recombination Graph Inference Tools to Phasing Errors
Ancestral Recombination Graphs (ARGs) are fundamental population genetic structures that encode the genealogical history of a sample of haplotypes along the genome. They have recently received substan...
biorxiv.org
November 26, 2025 at 5:09 PM
Overall, ARG inference tools have demonstrated remarkable robustness to computational phasing errors. While various practical challenges may limit ARG inference quality, we believe that computational phasing inaccuracies should not be a problem of concern.
7/7
November 26, 2025 at 4:48 PM
Under a selective sweep model, we saw little difference in branch-length-based diversity between ARGs inferred from true and computationally phased haplotypes, demonstrating the practicality of applying ARGs inferred from unphased data in evolutionary analyses.
6/7
November 26, 2025 at 4:48 PM
Under the CEU bottleneck model, we observed consistently minor effects of phasing errors. The difference between ARG inference methods is much larger than the difference in performance between using true haplotypes and computationally phased haplotypes.
5/7
November 26, 2025 at 4:48 PM
In constant population size models, we observed only a slight reduction in ARG inference accuracy in terms of estimates of coalescence times or recombination breakpoint counts when using computationally phased haplotypes. The effect also reduced as the sample size increased.
4/7
November 26, 2025 at 4:48 PM
Here, we show that ARG inference remains robust to phasing errors even at incredibly small sample sizes (8 haplotypes). We simulated ARGs and VCFs under various demographic models, simulated phasing errors, and compared ARGs inferred from true and computationally phased haplotypes.
3/7
November 26, 2025 at 4:48 PM
ARGs are powerful tools in population genetics. However, ARG inference generally requires phased haplotypes. Phasing quality is limited at small sample sizes, making it difficult in non-model organisms due to the usually limited sample size and a lack of reference panels.
2/7
November 26, 2025 at 4:48 PM