Clint Valentine
moonlight.bio
Clint Valentine
@moonlight.bio
Interested in the molecular patterns of carcinogenesis, adventure racing, bioinformatics, and hand-rolled pasta. Professional crud & stuff wrangler, aspiring artisanal software author.
I do this! I encode first, and provide a "was UTF-8 encoded" marker in the INFO header so I auto-detect when I've done it and decode when reading. Helpers here:

github.com/clintval/tp5...

^ not actually encoding JSON in that tool, but other kinds of structured text with chars that are illegal.
tp53/scala/seshat/src/io/cvbio/seshat/VcfUtil.scala at 400b3e35f83175dfe59891cbc6906f2cfb7738c4 · clintval/tp53
Tools for programmatically annotating VCFs with the Seshat TP53 database - clintval/tp53
github.com
November 7, 2025 at 8:51 PM
This is pretty cool. Thanks for sharing! Yes it indeed looks like me :)
October 30, 2024 at 8:35 PM