Gordon
banner
gordonmurray.com
Gordon
@gordonmurray.com
AWS Systems Engineer | Cloud Infrastructure and Terraform Enthusiast | Focused on scalable, secure and compliant solutions | Always exploring new tools

GitHub https://github.com/gordonmurray
I really like using fly.io for small projects. I like knowing cost up front though, so I wrote a CLI tool called flycost.
Like www.infracost.io but for fly.io. It reads your fly.toml file and gives a monthly cost estimate
I’d love feedback from anyone running on Fly.io
github.com/gordonmurray...
Deploy app servers close to your users · Fly
fly.io
September 16, 2025 at 8:01 PM
I built a simple to-do list app called www.notebare.com. All the data is stored in Lance format from @lancedb. Performance seems quick, Anyone can try it out, you can sign in with Github. Feedback more than welcome!
notebare.com
www.notebare.com
September 15, 2025 at 7:21 PM
Cleaned up my old project: CDC from MariaDB → Flink → back into MariaDB using Docker Compose. Flink’s CDC connectors (powered by Debezium) keeps the target table up to date in real time. Code on GitHub: github.com/gordonmurray...
GitHub - gordonmurray/apache_flink_and_docker_compose: A complete real-time Change Data Capture (CDC) pipeline using Apache Flink, MariaDB, and Docker Compose. This project demonstrates how to build a...
A complete real-time Change Data Capture (CDC) pipeline using Apache Flink, MariaDB, and Docker Compose. This project demonstrates how to build a modern streaming analytics system that processes da...
github.com
September 7, 2025 at 5:17 PM
Apache Fluss is a storage engine:
• Fast key lookups (e.g. user status)
• Instant historic queries (via Paimon on S3/ORC)
• Works with Flink SQL for heavy lifting
• Tablets replicate & recover automatically

Made a small POC to try it out: github.com/gordonmurray...
GitHub - gordonmurray/apache_fluss_flink_and_paimon: This project integrates Apache Fluss (stream-batch unified storage) with Apache Paimon (data lake storage) using Apache Flink
This project integrates Apache Fluss (stream-batch unified storage) with Apache Paimon (data lake storage) using Apache Flink - gordonmurray/apache_fluss_flink_and_paimon
github.com
September 5, 2025 at 5:30 PM
I’ve refreshed a small hobby project from ~2 years ago: Apache Flink & Paimon.
It picked up a few early stars back then. I’ve cleaned it up so it’s runnable again.
I’m hoping to experiment with Apache Fluss on top of this. Code on Github github.com/gordonmurray...
GitHub - gordonmurray/apache_flink_and_paimon: Trying out Apache Paimon with Apache Flink using Docker Compose
Trying out Apache Paimon with Apache Flink using Docker Compose - gordonmurray/apache_flink_and_paimon
github.com
September 4, 2025 at 7:02 PM
I took some time this weekend to learn TLA+. It lets you test distributed systems by exploring all possible states. Meaning you can catch bugs before production and likely save costs. I built a small FastAPI + MinIO demo to try it out. Repo: github.com/gordonmurray...
GitHub - gordonmurray/fastapi-temporal-logic-of-actions: A small introduction to verification using TLA+ through a file upload idempotency problem
A small introduction to verification using TLA+ through a file upload idempotency problem - gordonmurray/fastapi-temporal-logic-of-actions
github.com
August 17, 2025 at 5:56 PM
I tried to build a minimal image vector store using Lance and Cloudflare R2. I Learned a lot, hit a hard limit fast.

"When Lance hits a wall at 70 images"

gordonmurray.com/data/2025/05...
When Lance Hits the Wall at 70 Images on Cloudflare R2 | Gordon Murray
First impressions of using Lance with Python were excellent. It took almost no code at all to upload images, vectorize them, save the data in Lance format in...
gordonmurray.com
May 14, 2025 at 12:56 PM
Trying out Lance for storing vector data on R2. I’ve got Python saving the image and its CLIP embedding to a Lance dataset on R2. It’s append-only, columnar format kind of like Parquet, but optimized for updates, metadata, and search. Compared to Iceberg, feels like a better fit for vector mappings
QuickVectors - Image Vectorization
quickvectors.com
May 9, 2025 at 7:54 PM
I made a thing: upload an image, get back a vector embedding. That’s it.

Built with Python, FastAPI, and OpenAI’s CLIP model (ViT-B/32). Deployed on Fly.io.

quickvectors.com

Want to try it out? DM me for an API key.
May 8, 2025 at 10:46 PM
Built a small DAM-style pipeline to handle image uploads, async processing, and ML enrichment. No DB, just new tools I’ve been meaning to try (fly.io, R2, Parquet). Wrote a post about what I’ve built so far and what’s next gordonmurray.com/2025/05/05/B...
Building A Digital Asset Management Project With Different Tools | Gordon Murray
Building a Digital Asset Management (DAM) project with different tools
gordonmurray.com
May 5, 2025 at 8:57 PM
Built a Python pipeline to upload images to R2 and generate captions + vectors using BLIP and CLIP from Hugging Face. Metadata stored as Parquet. Code on GitHub: github.com/gordonmurray...
GitHub - gordonmurray/dam-pipeline-bento: A decoupled processing pipeline for a DAM using FastAPI, RabbitMQ, Cloudflare R2, and BentoML + CLIP for image vectorization
A decoupled processing pipeline for a DAM using FastAPI, RabbitMQ, Cloudflare R2, and BentoML + CLIP for image vectorization - gordonmurray/dam-pipeline-bento
github.com
May 2, 2025 at 10:42 PM
Trying out Fly.io and Cloudflare R2 - long overdue on my part. Made a small Python app to receive an image, store it in R2 with Parquet metadata, and query via DuckDB+SQL.

Very basic, but next I want to try R2's data catalog / managed Apache Iceberg.

Code on github github.com/gordonmurray...
April 30, 2025 at 9:23 PM
I created a small Terraform project that will set up Deepseek R1 for you on your own GPU EC2 instance, seems to be uncensored too. Code on Github at github.com/gordonmurray...

I took the steps from this great AWS Community guide community.aws/content/2sEu...
GitHub - gordonmurray/terraform_aws_deepseek: A terraform project to create an EC2 instance running Deepseek R1
A terraform project to create an EC2 instance running Deepseek R1 - gordonmurray/terraform_aws_deepseek
github.com
January 29, 2025 at 10:36 PM
Updated my BentoML project to include Milvus for storing and searching vectorized images. More to learn about L2 Euclidean distance for better search results
github.com/gordonmurray...
GitHub - gordonmurray/bentoml-image-vectorization: This project offers an API for vectorizing images using OpenAI's CLIP model via BentoML and integrates Milvus for efficient vector storage and search
This project offers an API for vectorizing images using OpenAI's CLIP model via BentoML and integrates Milvus for efficient vector storage and search - gordonmurray/bentoml-image-vectorization
github.com
December 16, 2024 at 10:54 PM
Just finished a small project on deploying BentoML on ECS using Terraform. Perfect for creating vectors from images with a CLIP model I’ve been learning. If you’re getting in to machine learning I'd recommend BentoML, hopefully this is a useful starting point!
github.com/gordonmurray...
December 14, 2024 at 9:36 PM
Its surprising how many requests and countries try an API endpoint that has only just come online in the last few minutes, using a brand new domain name just registered too.
December 13, 2024 at 7:21 PM
Tried out BentoML this evening to vectorize images, and it works really well! I expected to need a GPU, but it runs smoothly on a CPU. It even exposes Prometheus metrics out of the box. Code on GitHub: github.com/gordonmurray...
GitHub - gordonmurray/bentoml: This project provides an API for vectorizing images using OpenAI's CLIP model, packaged and served with BentoML. It allows you to send images to the API and receive a ve...
This project provides an API for vectorizing images using OpenAI's CLIP model, packaged and served with BentoML. It allows you to send images to the API and receive a vector representation that...
github.com
December 10, 2024 at 11:01 PM
Ah this is great, I didn’t know this was available. The DDB approach was fine but this would be nicer
medium.com/@manuchandra...

One really nice change announced recently in Terraform is support for using S3 buckets to lock your state files (and store them) instead of requiring both an S3 bucket AND a DynamoDB table. Below is a step by step guide to migrate to using this new approach. (1/4)

🧵
Managing Terraform State for AWS workloads ( Experimental)
When working with Terraform in a team environment or production setup, it’s crucial to store the state file remotely and implement state…
medium.com
December 10, 2024 at 8:29 AM
Reposted by Gordon
Re:Invent hasn't even officially started yet and we've already got 22 announcements. No rest on Re:Invent Sunday, I guess. To help you keep track, I will curate a number of custom feeds. The first one: All Re:Invent 2024 Releases

Check it out below 👇
The AWS News Feed
Keep track of AWS news and blogs. See what's hot, read summaries, search, set bookmarks and ratings, and categorize articles with tags.
buff.ly
December 1, 2024 at 10:52 PM
The “show vehicle motion cues” option on iOS really works! For me at least. I can read away as a passenger and not feel queasy. Really cool 📖
November 30, 2024 at 4:34 PM
Reposted by Gordon
If you’re thinking without writing, you only think you’re thinking.—Leslie Lamport
Writing is the Thinking
Remember: [[Writing]] is the thinking. The more you write, the more notes you take, and the better of a thinker you become. You’ll read better with better questions in your mind. Without writing, ther...
www.ssp.sh
November 30, 2024 at 3:44 PM
Oh nice. I didn’t know that. Thanks! I have some repos to go and update
Show-casing JSON snippets in a README file on GitHub? Then consider marking them as json5, so you can have proper in-line comments.

json5.org
November 29, 2024 at 8:14 PM
Today is as good day to start learning Terraform
dev.to/bennyfmo_237...

Using Infrastructure as Code tools like Terraform is good practice and seeing some examples can always help.

On AWS setting up basic infrastructure can be tedious when doing it via the console or CLI but using example configs and IaC can usually speed up the process. (1/2)
🧵
Deploying Basic Infrastructure on AWS with Terraform
Introduction In today’s cloud-driven world, managing infrastructure efficiently and...
dev.to
November 28, 2024 at 9:22 AM
Reposted by Gordon
Hmm 🤔 Maybe I just created the AWS News Firehose on Bluesky. Maybe it's hosted at https://buff.ly/4g3kQsK 🦋 And maybe it's related to Bluesky's recent posts-as-comments demo... 🤷‍♀️ Who knows!
November 26, 2024 at 10:47 PM