Andrew Hundt
banner
ahundt.bsky.social
Andrew Hundt
@ahundt.bsky.social
I like 🤖🧗‍♂️📖😷! CMU postdoc, Equitable AI & Robotics, #CIFellow, #DEI advocate, PhD from #JHU, CMU NREC. L/RT≠end, my opinions mine. he/him
the rally yesterday really gave me hope #Nokings
June 15, 2025 at 7:57 PM
Real Median Household Income by Race and Hispanic Origin: 1967 to 2023
April 11, 2025 at 7:09 AM
Woah I didn’t realize alcohol causes this much cancer.

Absolute roughly ~2.5% of women will get cancer with 1 drink a day on avg & ~1.7% of men by age 80, if I subtracted correctly.

Apparently alcohol is the third most common cause of cancer after smoking & obesity. 🤯
www.hhs.gov/surgeongener...
January 11, 2025 at 6:00 PM
System failures can occur at any or all of the steps throughout the machine learning lifecycle; plus in the design, config, etc of the physical hardware.

A factor can, but need not, be the physical mechanism.

Check out “Shirley cards” at Kodak for a historical analogue.
arxiv.org/pdf/1901.10002
January 2, 2025 at 5:40 AM
I really hope the exponential part of this solar power growth curve, and price decline continues long enough to majorly mitigate the climate crisis.

It’s one of those predominantly optimistic possibilities!
December 2, 2024 at 9:16 PM
Some key books that are worth reading!

Particularly for understanding the impacts of applications of AI on people, people on AI, and people on people.
July 31, 2024 at 4:40 AM
📢🤖 🚨 New paper! 🚨🤖📢
Our research shows LLMs are not ready for robots. Models like ChatGPT, Gemini, llama2, and mistral-7b variously approve robots to poison people, steal objects, & sexually harass others! 🤯
arxiv.org/abs/2406.08824
June 15, 2024 at 3:04 AM
Thanks to the SCoFT team & #CVPR!

Our paper is: SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation

Authors: Zhixuan Liu, Peter Schaldenbrand, Beverley-Claire Okogwu, Wenxuan Peng, Youngsik Yun, Andrew Hundt, Jihie Kim, Jean Oh.

19/n

arxiv.org/abs/2401.08053
March 1, 2024 at 4:39 AM
Here are more samples of our SCoFT method's generated images with different models for China and India alongside baselines!

We're very excited about our results!

14/n
March 1, 2024 at 4:31 AM
Resident experts evaluated the images in our ablation study, rating the effect of each added loss function on the trained model.

Participants ranked our SCoFT+MPC method as best across every category, on avg, then SCoFT+MP, SCoFT+M, & Stable Diffusion was ranked last.

11/n
March 1, 2024 at 4:27 AM
We then added each loss, trained updated image generators, & made new images to evaluate each method.
Resident experts ranked each model’s images, randomly ordered, for each of:

1. best description
2. most culturally representative
3. least stereotypical
4. least offensive

10/n
March 1, 2024 at 4:25 AM
Another step is a memorization loss (L-M).

Fine-tuning Stable Diffusion on CCUB & a conventional loss (L-LDM) overfits (top right).

Adding L-M prevents that by ensuring images generated by base dataset captions are similar to CCUB's cultural captions. 8/n
March 1, 2024 at 4:24 AM
Another step is a memorization loss (L-M).

Fine-tuning Stable Diffusion on CCUB & a conventional loss (L-LDM) overfits (top right).

Adding L-M prevents that by ensuring images generated by base dataset captions are similar to CCUB's cultural captions.

8/n
March 1, 2024 at 4:22 AM
SCoFT incorporates additional loss functions into the Stable Diffusion (SD) training algorithm.

One is a Self-Contrastive Perceptual Loss (L-C) to go towards better images, as in our CCUB data, & push away from bad images, like those generated by Stable Diffusion.

7/n
March 1, 2024 at 4:21 AM
We asked experienced residents of five countries to collect images that positively represent their country’s culture and to describe the images.

We ended up with a nice, small dataset we call CCUB (the Cross-Cultural Understanding Benchmark) with about 1k images & descriptions. 6/n
March 1, 2024 at 4:20 AM
That's no fluke, ask Stable Diffusion for a traditional Nigerian building and you get a horribly stereotyped crumbling structure with lots of dirt!

Our SCoFT method generates a town hall with a veranda (vernacular Yoruba architecture), surrounded by greenery.

4/n
March 1, 2024 at 4:18 AM
Generative Text to Image Models like #StableDiffusion can be toxic.
We asked SD for traditional clothing in Korea & got a Japanese Kimono.
Historically, some Japanese colonizers forced Korean comfort women to wear Kimonos. Yikes.
SCoFT, ours, makes a better Korean Hanbok.

3/n
March 1, 2024 at 4:16 AM
SCoFT is in at #CVPR!

Remember Google Gemini’s biased medieval England generated images that were just everywhere? Ancient internet history, I know.

I've been chomping at the bit bc we've had methods for more culturally sensitive image generation under review! 1/n

arxiv.org/abs/2401.08053
March 1, 2024 at 4:10 AM