Georg Heiler
geoheil.com
Georg Heiler
@geoheil.com
building socio-technical complex systems with data | geoheil.com
Pinned
@milicevica23.bsky.social and I recently gave a talk how we scale #data #pipelinese for Telekom georgheiler.com/event/magent...
Scaling data pipelines @Telekom | Georg Heiler
Tackling data challenges via the orchestrator.
georgheiler.com
A great video about LLMs and the data they can provide to the world - even though perhaps they should not | www.youtube.com/watch?v=O7BI... - DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh
DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh
This talk explores the hidden risks in apps leveraging modern AI systems—especially those using large language models (LLMs) and retrieval-augmented generation (RAG) workflows. We demonstrate how…
www.youtube.com
November 6, 2025 at 3:05 PM
Reposted by Georg Heiler
The real AI win isn't superhuman agents, it's scaled mediocrity.
Doing less with less at massive scale unlocks tasks that were once uneconomical.
The magic is in aggregate value, not perfect outputs. Empower teams with practical AI tools. 
🔗 https://dlthub.com/blog/the-real-ai-win-scaled-mediocrity
October 17, 2025 at 12:39 PM
#dsc-dach #data it was a. pleasure to share an introductory workshop about spark and data pipelines. Thank you Aleks for the great collaboration!

Find the workshop files here if you want to follow along github.com/l-mds/dsc-da...
GitHub - l-mds/dsc-dach-tutorial-dagster: Introduction to using and scaling dagster
Introduction to using and scaling dagster. Contribute to l-mds/dsc-dach-tutorial-dagster development by creating an account on GitHub.
github.com
October 14, 2025 at 2:00 PM
Something about super and computing in the making anyone daring out there who wants to explore? Or folks who want to exchange ideas about SLURM, HET jobs and advanced resource management? github.com/ascii-supply...
feat: build SLURM integration for dagster by HPicatto · Pull Request #19 · ascii-supply-networks/dagster-slurm
Type of Change feat: New feature fix: Bug fix docs: Documentation style: Code style refactor: Code refactor perf: Performance improvement test: Tests chore: Maintenance Description adds ...
github.com
October 7, 2025 at 2:05 PM
Simple Sovereign Scalable Data Stack georgheiler.com/event/tdwi-2... precursor: pypi.org/project/dags... github.com/dagster-io/c... if you want to see this in action join in Nürnberg or Vienna for some sovereign, scalable data talks in the coming weeks
Simple Sovereign Scalable Data Stack | Georg Heiler
Tired of cloud lock-in and surprise bills? This talk shows how to build a fast, portable analytics stack around DuckDB and Dagster. Along the way of our journey to sovereignty and scale we touch on…
georgheiler.com
October 7, 2025 at 7:02 AM
Reposted by Georg Heiler
📈 DuckDB 1.4.0 is out! This is our first LTS release which comes with *one year of community support*. It also supports database encryption, the MERGE SQL statement and Iceberg writes.

For more details, read the announcement blog post at
duckdb.org/2025/09/16/a...
September 16, 2025 at 11:55 AM
A living Elo leaderboard for analytics/OLAP engines. Public benchmarks (TPC-DS/H, SSB, vendor & community posts) becomes a “match.” Upsets + context matter. Browse the board & poke holes: rebrand.ly/ey6y7hf
Home | Data inconsistencies
Data inconsistencies, architecuture and real world stories
data-inconsistencies.datajourney.expert
September 2, 2025 at 7:02 AM
Together with www.linkedin.com/in/aaron-cul... I have created a template for making #LLMs (from different vendors and even self hosted ones) easily accessible to researchers - including advanced document RAG with #docling. github.com/complexity-s...
GitHub - complexity-science-hub/llm-in-a-box-template: Template to use genai for chatting and via api to accelerate research
Template to use genai for chatting and via api to accelerate research - complexity-science-hub/llm-in-a-box-template
github.com
July 20, 2025 at 8:21 PM
Reposted by Georg Heiler
People say “your imagination is the limit” to mean the possibilities are limitless, but I believe that for many people the phrase is more literally true: they really are limited by a lack of imagination more than anything else
July 7, 2025 at 12:38 PM
#state of #cyber for #germany www.heise.de/news/Bundesr... quite sad to read that emergency power supplies are not there for a large quantity of the data centers
Sogar Notstrom fehlt: Schlechte Sicherheitstandards in Rechenzentren des Bundes
Ein Bericht des Bundesrechnungshofs wirft kein gutes Licht auf die Sicherheit der IT des Bundes. Nur ein Bruchteil der Rechenzentren erreiche Mindeststandards.
www.heise.de
July 5, 2025 at 10:41 AM
nice to see DE tooling adapted in other verticals - github.com/michimussato... here for visualization rendering
June 24, 2025 at 7:41 AM
Reposted by Georg Heiler
I particularly appreciated principle 3: Agent actions and planning must be observable
June 15, 2025 at 5:37 AM
Reposted by Georg Heiler
Congratulations @lambda.bsky.social! Today @theguardian.com is launching a new way for whistleblowers to anonymously contact journalists, based on years-long research by Daniel and other colleagues. www.theguardian.com/gnm-press-of...
The Guardian launches Secure Messaging, a world-first from a media organisation, in collaboration with the University of Cambridge
Secure Messaging is a new innovation for confidential story-sharing and source protection, underpinning the Guardian’s commitment to investigative journalism. The Guardian has published the open sourc...
www.theguardian.com
June 9, 2025 at 12:29 PM