blef.fr
@blef.fr
I have launched Excel once
DuckDB team announced 2025 will be the year of the LakeHouse focus 🥹
January 31, 2025 at 2:17 PM
Reposted
DuckCon #6 a few hours away, Pakhuis De Zwijger or live on the @duckdb.org Youtube channel.

duckdb.org/events/2025/...

Speaker are impressive, looking forward to hear from them, chance to see in person a bunch of GitHub/Discord handles / talk DuckDB a bunch.

I will be around, come to say quack!
January 31, 2025 at 9:19 AM
Reposted
Extraordinarily detailed article here by @mliebreich.bsky.social - if you're interested at all in the energy impact of AI data center buildouts I recommend spending some time with this, I learned a ton from it
January 12, 2025 at 12:07 AM
Reposted
Here's my end-of-year review of things we learned out about LLMs in 2024 - we learned a LOT of things simonwillison.net/2024/Dec/31/...

Table of contents:
December 31, 2024 at 6:10 PM
Hannes keynote at the Forward Data Conference. Really enjoyed his talk, this is for this kind of content that I’ve organised the conf 🔥
duckdb.org DuckDB @duckdb.org · Dec 18
📺 We added two new items to our media page. Both were recorded last month at the Forward Data Conference in Paris.

– Hannes' keynote talk “Changing Large Tables”: duckdb.org/media/changi...

– An episode of The Joe Reis Show, touching on the challenges of lakehouses: duckdb.org/media/duckdb...
December 18, 2024 at 8:17 PM
Reposted
DEW published its yearly State of Data Engineering: Key Insights and Trends. This one-edition summary summarizes the key patterns of all DEW editions this year. Later this week, we will follow up on DEW's data engineering prediction for 2025 and beyond.

www.dataengineeringw...
The State of Data Engineering in 2024: Key Insights and Trends
A Look Back at the Year's Defining Patterns in Data Engineering
www.dataengineeringweekly.com
December 16, 2024 at 5:40 PM
Reposted
I can now run a GPT-4 class model on my laptop

(The exact same laptop that could just about run a GPT-3 class model 20 months ago)

The new Llama 3.3 70B is a striking example of the huge efficiency gains we've seen in the last two years
simonwillison.net/2024/Dec/9/l...
I can now run a GPT-4 class model on my laptop
Meta’s new Llama 3.3 70B is a genuinely GPT-4 class Large Language Model that runs on my laptop. Just 20 months ago I was amazed to see something that felt …
simonwillison.net
December 9, 2024 at 3:19 PM
Reposted
Foursquare places data is live in the hive 🐝 🍯

@hachej.bsky.social @seifert.blue
November 30, 2024 at 12:04 AM
why are we so addicted to 3 letters when it comes to naming stuff?

This week it's MCP
December 1, 2024 at 6:50 PM
joined STATION F, now I'm a founder (lol)
November 28, 2024 at 5:43 PM
god i feel empty, no conference to organise anymore 🥹

yesterday was a crazy day after 6 months of work for the Forward Data Conference — so happy of the feedbacks and how it went
November 26, 2024 at 11:17 AM
Reposted
Salut Paris!

Forward Data Conference is amazing.
November 25, 2024 at 11:16 AM
Awesome Keynote by @hannes.muehleisen.org about Changing Large Tables at the Forward Data Conference — Iceberg the Duck is coming for you
November 25, 2024 at 11:12 AM
More than one year ago I dreamt about organizing an in-person conference about data, for data practitioners in France.

Tomorrow 350 people will come to attend the first Forward Data in Paris with awesome guests like @hannes.muehleisen.org, @joereis.bsky.social or @jayatillake.bsky.social 🥹
November 24, 2024 at 10:10 AM
Reposted
i see the future and it's beautiful
November 22, 2024 at 3:06 AM
why did we pick #dataBS as tag name — every time my brain read data bullshit
November 21, 2024 at 11:18 AM
Should we have a KPI as a data team about the coverage of data assets that are governed with code?
November 19, 2024 at 8:55 AM
Oh @jakthom.bsky.social did it!

bluesky data lands on R2, with ~5m latency, partitioned per hour
I like this. I like this a lot.

tl;dr:

full @bsky.app jetstream feed

landing in @cloudflare.social R2 (and available to you!)

accessible using two lines of @duckdb.org sql
November 17, 2024 at 7:34 PM
Everyone is playing with Bluesky firehose right now, is someone dropping it in a s3 bucket somewhere (or in a BigQuery public dataset?)

I started to work on this, but got lost in all experiments people are doing
November 17, 2024 at 4:05 PM
Reposted
Visualization of the network: a night sky where each star is someone posting ✨
hctr.dev Hector @hctr.dev · Nov 16
I made a thing!

I was playing around with the AT protocol and as a little experiment I made a website that visualises activity around Bluesky: nightsky.hctr.dev

It listens to all new posts and shows them as little stars across a night sky 🌃

Every star is someone, somewhere, posting something
Nightsky | hctr.dev
See live conversations from all over Bluesky as a dynamic night sky
nightsky.hctr.dev
November 17, 2024 at 5:30 AM
It's will be possible to orchestrate Airflow DAGs directly from BigQuery.

Orchestrating means: viewing and triggering.

Slowly but surely Google is shifting BigQuery tab to "BigQuery Studio" an all-in-one data platform with the extra.

cloud.google.com/bigquery/doc...
Orchestrate Airflow DAGs  |  BigQuery  |  Google Cloud
cloud.google.com
November 17, 2024 at 6:53 AM
How Canva deeply monitor their Snowflake costs — 25PB of storage and 10k dbt models 🙃

www.canva.dev/blog/enginee...
Our journey to Snowflake monitoring mastery - Canva Engineering Blog
How I learned to stop worrying and love metadata.
www.canva.dev
November 16, 2024 at 6:43 AM
Please just stop saying "just".

It happens that incrementally we all added a few words in our daily discussion like "just", "simple", "clean", etc.

I loved this blog about the word "just" that should be avoided at max.

sgringwe.com/2019/10/10/P...
Please just stop saying "just"
Do you work in Software Engineering, and have you seen messages or sentences like these before?
sgringwe.com
November 15, 2024 at 8:45 AM
Snowflake Notebooks are available in GA — still trying to bridge the feature gap with Databricks.

It's interesting to see that they released a lot of ipynb Notebooks in the open: github.com/Snowflake-La....

With Iceberg and interoperability everywhere, less diffs will mean an unique winner?
November 14, 2024 at 1:08 PM
Cursor users using it with dbt, what's your feeling about it?
November 12, 2024 at 7:43 AM