Jae Young Choi
jychoi.bsky.social
Jae Young Choi
@jychoi.bsky.social
Assistant professor of evolutionary genomics at KU. Has a toddler 🙂🙃🙂🙃🇰🇷🇨🇦🇺🇲
https://jychoilab.github.io/
Huh I remember seeing a similar thread at twitter eons ago. Is nature healing?
November 17, 2025 at 7:10 PM
If you're interested in our method please check out the study and the method on github.
github.com/jaeyoungchoi...
GitHub - jaeyoungchoilab/Topsicle: Topsicle utilizes abundance of telomere pattern k-mers to estimate telomere length in long read.
Topsicle utilizes abundance of telomere pattern k-mers to estimate telomere length in long read. - jaeyoungchoilab/Topsicle
github.com
September 22, 2025 at 2:18 PM
Details of our study can be found in my previous threads from our biorxiv preprint.
bsky.app/profile/jych...
September 22, 2025 at 2:18 PM
Toxin antidote system found in speciation research is good examples of this.
July 30, 2025 at 9:33 PM
Topsicle can work on either Nanopore or Pacbio data. And you don't need to have a reference genome. Just get your FASTQ and analyze with Topsicle! Our package can be found on github. If you're interested in the telomere length of your organism please try!
github.com/jaeyoungchoi...
GitHub - jaeyoungchoilab/Topsicle: Topsicle utilizes abundance of telomere pattern k-mers to estimate telomere length in long read.
Topsicle utilizes abundance of telomere pattern k-mers to estimate telomere length in long read. - jaeyoungchoilab/Topsicle
github.com
July 15, 2025 at 2:14 PM
We also used our method on 8 human cancer cell lines and compared it to Telogator which is a method developed mostly for mammalian telomeres. And results shows that our method is well correlated with Telogator.
July 15, 2025 at 2:14 PM
When we compare Topsicle estimates with gold standard direct estimates from Southern blots its fairly well correlated suggesting our method works pretty well. We tried our method in A. thaliana, maize, and monkeyflower and they all worked well.
July 15, 2025 at 2:14 PM
Topsicle performs fairly well under simulated reads of varying sizes and error rates. Interestingly under simulated coverages we saw that the genome coverage didnt have too much of an effect on telomere length estimates
July 15, 2025 at 2:14 PM
This is done by implementing a change point method. These are commonly used in electrical engineering to model signal intensity data and it also works well for our purpose. We implemented our approach into a program called Topsicle.
July 15, 2025 at 2:14 PM
Because long reads can be noisy looking exact matching telomere repeat sequence (e.g. AAACCCT) can miss repeat matches. Instead we search for k-mers of different sizes to fill in the gaps. From here telomere length is measured by finding the point where theres a sudden drop in telo density
July 15, 2025 at 2:14 PM
Instead we thought what about a genomics approach? Technically if a long read was sequenced from the telomere we should be able to use that read to estimate the telomere length. And thats what we did! We came up with a way to search for telomeric long reads
July 15, 2025 at 2:14 PM
My lab is interested in understanding what drives the genetic variation in telomeres of plants. We are interested in telomere length variation and to study this we need to measure length of the telomere but this protocol is often difficult to implement in a typical lab. Think of Southern blots
July 15, 2025 at 2:14 PM
Please give it a try if you're interested in measuring the telomere length of the organism you long read sequenced. Hopefully the peer reviewed manuscript of our method will come out shortly!
June 28, 2025 at 5:54 PM
And it performs fairly well against various error rates and genome coverage. We tested it against gold standard telomere length measurements made from several A. thaliana ecotypes and found it was highly correlated with Topsicle predictions as well.
June 28, 2025 at 5:54 PM
Topsicle searches for long reads from the telomere and then uses those as candidate telomere reads. Then we quantify telomere repeats with k-mer analysis and identify the telomere-subtelomere boundary using change point analysis.
June 28, 2025 at 5:54 PM
Traditional way of measuring telomere length is thru Southern blot but its a lost art not many know how to do. Instead we thought what about long read sequencing since long read from telomere could be used to identify the telomere and measure its length. So we developed Topsicle. *image not related*
June 28, 2025 at 5:54 PM
Telomere length is a really interesting phenotype where despite having crucial roles in maintaining genome stability its length can vary alot between individuals and even between species. Why is that? Is a major question many like to answer.
June 28, 2025 at 5:54 PM
June 25, 2025 at 5:54 PM