John Chodera
@jchodera.bsky.social
Achira | http://achira.ai
Research laboratory | http://choderalab.org
Antiviral drug discovery for pandemics | http://asapdiscovery.org
OpenADMET | http://openadmet.org
Employer-mandated disclaimer: http://choderalab.org/disclaimer
Pronouns: he/him
Research laboratory | http://choderalab.org
Antiviral drug discovery for pandemics | http://asapdiscovery.org
OpenADMET | http://openadmet.org
Employer-mandated disclaimer: http://choderalab.org/disclaimer
Pronouns: he/him
Chat, is this good?
October 11, 2025 at 1:53 AM
Chat, is this good?
Excited to be joining the AI for Science Symposium in San Francisco on May 16 organized by ML Foundry!
Check it out here:
mlfoundry.com/ai-for-scien...
Check it out here:
mlfoundry.com/ai-for-scien...
May 2, 2025 at 4:54 PM
Excited to be joining the AI for Science Symposium in San Francisco on May 16 organized by ML Foundry!
Check it out here:
mlfoundry.com/ai-for-scien...
Check it out here:
mlfoundry.com/ai-for-scien...
I have so many of these!
February 21, 2025 at 9:31 PM
I have so many of these!
Oh, have we got coffee cup ideas for you...
February 21, 2025 at 5:11 PM
Oh, have we got coffee cup ideas for you...
As a peek toward where we're headed:
Right now, CADD scientists are forced to use the same model week after week, even if new experimental data says the model is inaccurate.
If we can fine- models, we can exploit that data to systematically improve our predictions week by week!
Right now, CADD scientists are forced to use the same model week after week, even if new experimental data says the model is inaccurate.
If we can fine- models, we can exploit that data to systematically improve our predictions week by week!
February 19, 2025 at 7:36 PM
As a peek toward where we're headed:
Right now, CADD scientists are forced to use the same model week after week, even if new experimental data says the model is inaccurate.
If we can fine- models, we can exploit that data to systematically improve our predictions week by week!
Right now, CADD scientists are forced to use the same model week after week, even if new experimental data says the model is inaccurate.
If we can fine- models, we can exploit that data to systematically improve our predictions week by week!
While this strategy is still highly limited by the constraints of legacy molecular mechanics force fields which were developed for late-1970s era hardware.
Since then, compute per dollar has increased by 160 billion times. We need a totally new approach. Stay tuned for something exciting...
Since then, compute per dollar has increased by 160 billion times. We need a totally new approach. Stay tuned for something exciting...
February 19, 2025 at 7:30 PM
While this strategy is still highly limited by the constraints of legacy molecular mechanics force fields which were developed for late-1970s era hardware.
Since then, compute per dollar has increased by 160 billion times. We need a totally new approach. Stay tuned for something exciting...
Since then, compute per dollar has increased by 160 billion times. We need a totally new approach. Stay tuned for something exciting...
This simple regularization strategy was surprisingly effective at ensuring the reweighted free energy estimates at new parameters matched free energies from expensive new simulations at the new parameter set.
February 19, 2025 at 7:30 PM
This simple regularization strategy was surprisingly effective at ensuring the reweighted free energy estimates at new parameters matched free energies from expensive new simulations at the new parameter set.
Dominic showed that a regularized low-rank adjustment to the espaloma-0.3 MM foundation simulation model dramatically reduced errors in predicted hydration free energies in the held-out test set, shifting the error CDF far to the left, improving the RMSE by 0.82 kcal/mol--in just ten minutes!
February 19, 2025 at 7:30 PM
Dominic showed that a regularized low-rank adjustment to the espaloma-0.3 MM foundation simulation model dramatically reduced errors in predicted hydration free energies in the held-out test set, shifting the error CDF far to the left, improving the RMSE by 0.82 kcal/mol--in just ten minutes!
But this can be avoided! By adding a regularization term that penalizes effective sample size collapse, optimization takes a different path that maintains the ability of reweighting strategies to estimate accurate free energies for the new model parameters.
February 19, 2025 at 7:30 PM
But this can be avoided! By adding a regularization term that penalizes effective sample size collapse, optimization takes a different path that maintains the ability of reweighting strategies to estimate accurate free energies for the new model parameters.
Dominic also wanted it to be FAST. Could we do this in one step, avoiding need for additional simulations?
Unfortunately, once you start training the LoRA model, the conformational probability density can change rapidly, quickly escaping the region where free energy reweighting gives low errors.
Unfortunately, once you start training the LoRA model, the conformational probability density can change rapidly, quickly escaping the region where free energy reweighting gives low errors.
February 19, 2025 at 7:30 PM
Dominic also wanted it to be FAST. Could we do this in one step, avoiding need for additional simulations?
Unfortunately, once you start training the LoRA model, the conformational probability density can change rapidly, quickly escaping the region where free energy reweighting gives low errors.
Unfortunately, once you start training the LoRA model, the conformational probability density can change rapidly, quickly escaping the region where free energy reweighting gives low errors.
Borrowing ideas from fine-tuning LLMs, Dominic asked whether a low-rank adaptation (LoRA) could map from atom embedding vectors to learned perturbations to the charge-equilibration model parameters (electronegativity and hardness) to modify the small molecule partial charges.
February 19, 2025 at 7:30 PM
Borrowing ideas from fine-tuning LLMs, Dominic asked whether a low-rank adaptation (LoRA) could map from atom embedding vectors to learned perturbations to the charge-equilibration model parameters (electronegativity and hardness) to modify the small molecule partial charges.
For example, hydration free energies can be computed with high precision using alchemical free energy methods with Ken's espaloma-0.3 foundation model, but are relatively inaccurate, with an RMSE ~ 1.72 kcal/mol and a disappointing error cumulative distribution function (CDF).
February 19, 2025 at 7:30 PM
For example, hydration free energies can be computed with high precision using alchemical free energy methods with Ken's espaloma-0.3 foundation model, but are relatively inaccurate, with an RMSE ~ 1.72 kcal/mol and a disappointing error cumulative distribution function (CDF).
What can you do with a foundation simulation model? It's good at many tasks, but may not achieve the performance you want for every application.
Former PhD student Dominic Rufa wondered whether we could fine-tune these models to deliver better performance.
www.linkedin.com/in/dominic-r...
Former PhD student Dominic Rufa wondered whether we could fine-tune these models to deliver better performance.
www.linkedin.com/in/dominic-r...
February 19, 2025 at 7:30 PM
What can you do with a foundation simulation model? It's good at many tasks, but may not achieve the performance you want for every application.
Former PhD student Dominic Rufa wondered whether we could fine-tune these models to deliver better performance.
www.linkedin.com/in/dominic-r...
Former PhD student Dominic Rufa wondered whether we could fine-tune these models to deliver better performance.
www.linkedin.com/in/dominic-r...
In a tour de force, visiting scientist Ken Takaba [https://kntkb.github.io/] built the first true molecular mechanics foundation simulation model for proteins, nucleic acids, and small molecules that excels at at protein:ligand binding free energy benchmarks.
pubs.rsc.org/en/content/a...
pubs.rsc.org/en/content/a...
February 19, 2025 at 7:30 PM
In a tour de force, visiting scientist Ken Takaba [https://kntkb.github.io/] built the first true molecular mechanics foundation simulation model for proteins, nucleic acids, and small molecules that excels at at protein:ligand binding free energy benchmarks.
pubs.rsc.org/en/content/a...
pubs.rsc.org/en/content/a...
Aside: Yuanqing Wang is tremendously talented, and is on the faculty job market right now!
Check out his website: www.wangyq.net
and papers: scholar.google.com/citations?us...
Check out his website: www.wangyq.net
and papers: scholar.google.com/citations?us...
February 19, 2025 at 7:30 PM
Aside: Yuanqing Wang is tremendously talented, and is on the faculty job market right now!
Check out his website: www.wangyq.net
and papers: scholar.google.com/citations?us...
Check out his website: www.wangyq.net
and papers: scholar.google.com/citations?us...
Recently, Yuanqing Wang [https://www.wangyq.net/] (now a Simons and Schmidt Fellow at NYU) demonstrated how molecular mechanics force field construction can be cast as an end-to-end differentiable machine learning problem, presenting espaloma: espaloma.wangyq.net
February 19, 2025 at 7:30 PM
Recently, Yuanqing Wang [https://www.wangyq.net/] (now a Simons and Schmidt Fellow at NYU) demonstrated how molecular mechanics force field construction can be cast as an end-to-end differentiable machine learning problem, presenting espaloma: espaloma.wangyq.net
Everything is chaos, but I wanted to share some awesome recent science from the lab that hints at where the future of biomolecular simulation is headed:
Foundation simulation models that can be fine-tuned to experimental free energy data to produce systematically more accurate predictions.
Foundation simulation models that can be fine-tuned to experimental free energy data to produce systematically more accurate predictions.
February 19, 2025 at 7:30 PM
Everything is chaos, but I wanted to share some awesome recent science from the lab that hints at where the future of biomolecular simulation is headed:
Foundation simulation models that can be fine-tuned to experimental free energy data to produce systematically more accurate predictions.
Foundation simulation models that can be fine-tuned to experimental free energy data to produce systematically more accurate predictions.
In case you want to do something about it:
resist.bot/petitions/PM...
resist.bot/petitions/PM...
January 28, 2025 at 1:20 PM
In case you want to do something about it:
resist.bot/petitions/PM...
resist.bot/petitions/PM...
Happy Caturday to all who celebrate.
December 22, 2024 at 4:57 AM
Happy Caturday to all who celebrate.
Daniel Probst (@skepteis.bsky.social) is up next with "Less is More" at the ML4Molecules Workshop in Berlin, taking his own advice by presenting without the need for corporeal form.
December 6, 2024 at 12:45 PM
Daniel Probst (@skepteis.bsky.social) is up next with "Less is More" at the ML4Molecules Workshop in Berlin, taking his own advice by presenting without the need for corporeal form.
Cecilia Clementi (@cecclementi.bsky.social) kicks off the afternoon session of the ELLIS ML4Molecules Workshop in Berlin!
December 6, 2024 at 12:03 PM
Cecilia Clementi (@cecclementi.bsky.social) kicks off the afternoon session of the ELLIS ML4Molecules Workshop in Berlin!
Excited to speak at the ELLIS ML4Molecules Workshop 2024 in Berlin!
moleculediscovery.github.io/workshop2024/
moleculediscovery.github.io/workshop2024/
December 6, 2024 at 8:08 AM
Excited to speak at the ELLIS ML4Molecules Workshop 2024 in Berlin!
moleculediscovery.github.io/workshop2024/
moleculediscovery.github.io/workshop2024/