Research Engineering @ Turing
banner
hut23.bsky.social
Research Engineering @ Turing
@hut23.bsky.social
We are research software engineers and data scientists connecting research to applications at The Alan Turing Institute, the UK's national institute for data science and AI.
SSG is available on GitHub at github.com/alan-turing-... — check it out!

We also have a longer-form write up with additional context that you can look at here: sites.computer.org/debull/A24ju...
GitHub - alan-turing-institute/sqlsynthgen: Synthetic data for SQL databases
Synthetic data for SQL databases. Contribute to alan-turing-institute/sqlsynthgen development by creating an account on GitHub.
github.com
May 28, 2025 at 10:19 AM
In close collaboration with the UCLH Trust, we've developed SqlSynthGen (SSG): a Python library for generating synthetic data from relational databases. SSG is designed with transparency in mind, so data owners can control and audit the data they choose to expose.
GitHub - alan-turing-institute/sqlsynthgen: Synthetic data for SQL databases
Synthetic data for SQL databases. Contribute to alan-turing-institute/sqlsynthgen development by creating an account on GitHub.
github.com
May 28, 2025 at 10:19 AM
One solution to this problem is to generate synthetic data that shares the statistical properties of the original dataset, without including any personal information. We've been working on something to do just that!
May 28, 2025 at 10:19 AM