Pandora
banner
pandorai1995.bsky.social
Pandora
@pandorai1995.bsky.social
Data analytics. AI researcher (LLM/VLM). Communication and social media studies. Art historian.
Also an attempt at generating fakes Van Gogh's The Starry Night (for better or for worse) ;)
October 31, 2024 at 4:31 PM
Once again OCR misinterpretations ahead...
October 31, 2024 at 4:30 PM
Very interesting results, for example for Raphael's The School of Athens' description - especially when compared with the results obtained with Florence-2-base and Qwen2-VL-2B.
October 31, 2024 at 4:29 PM
Just posted on @huggingface.bsky.social : a VLM Visual Arts analysis with DeepSeek Janus-1.3B.
October 31, 2024 at 4:29 PM
Very interesting results once again especially when compared with those obtained with Florence-2 and Qwen2-VL
October 27, 2024 at 8:10 PM
Sequel to OCR processing and text in images analysis: this time with new recently released model DeepSeek Janus-1.3B (still on @huggingface.bsky.social of course)
October 27, 2024 at 8:09 PM
Very interesting results both for HTR and printed content.
October 27, 2024 at 8:06 PM
Spoiler alert: Qwen2-VL-2B reveals itself as a Jane Austen fan…
October 27, 2024 at 8:05 PM
Once more with Florence-2-base and Qwen-VL-2B.
October 27, 2024 at 8:04 PM
VLM analysis part 2, as always on @huggingface.bsky.social : this time it’s based on textual content in images whether handwritten, typed/printed, or even from an illuminated medieval manuscript.
October 27, 2024 at 8:02 PM
Comparative analysis of the results obtained when analyzing a large number of different artworks (from the Medieval era to Abstract Contemporary art).

It’s clear the models are more in favor of figurative art than Kandinskys - when detecting objects especially.
October 27, 2024 at 7:59 PM
First of my VLM study series (posted on @huggingface.bsky.social) :

An in-depth analysis of artworks with Microsoft Florence-2-base and Alibaba Cloud Qwen2-VL-2B
October 27, 2024 at 7:57 PM