Dax Kellie
banner
daxkellie.bsky.social
Dax Kellie
@daxkellie.bsky.social
Data Analyst & Science Lead at the Atlas of Living Australia | Evolutionary biologist & social psychologist (PhD) 🧪 | #rstats 📊 | Music enthusiast 🎵

www.daxkellie.com

Opinions are my own, and they do not express those of my employer
Want to see what {galaxias} can do? 🤔

Check out slides from a talk we presented on galaxias this week at the Living Data conference:
martinwestgate.com/presentation...

Or, check out this intro video about galaxias:
www.youtube.com/watch?v=kO4-...

#rstats 🌏🧪🐟
October 23, 2025 at 2:55 AM
Once your data and metadata are ready, just run `build_archive()`, which will automatically build a schema file (meta.xml), zip and save your Darwin Core Archive to the parent directory.

You are now ready to share your ecological data with the world!
October 23, 2025 at 2:41 AM
galaxias handles file conversion and file management when preparing a Darwin Core Archive

Want to use your standardised data? Run `use_data()`

Want to add your completed metadata? Run `use_metadata()`

galaxias will convert and save them in the right place
October 23, 2025 at 2:41 AM
Write metadata in markdown. Use galaxias to convert a completed markdown file to Ecological Metadata Language (EML) (also shown in the next skeet)
October 23, 2025 at 2:41 AM
Specify, edit or modify columns to match accepted standard names with `set_` functions. These are basically bespoke wrappers around `dplyr::mutate()`, but with names and arguments that make it easier to find the information you need
October 23, 2025 at 2:41 AM
It can be hard knowing where to start standardising your data. To help, we can run `suggest_workflow()` which tells us:

- Which column names match valid terms
- What to do to meet minimum requirements
- A possible workflow to fix up our data
October 23, 2025 at 2:41 AM
To share ecological data with open data infrastructures, data must be converted to a Darwin Core Archive.

Machines are excellent at reading this format, but converting data to use this format can be VERY tricky (it requires file types most people have never worked with before)
October 23, 2025 at 2:41 AM
🚨Our new package {galaxias} is released in R & Python today! 🚨

📦 galaxias makes it easy to standardise data to Darwin Core, the accepted format for sharing ecological data with infrastructures like @gbif.org and the Atlas of Living Australia

galaxias.ala.org.au

#rstats #python 🧪🌏🐟

A thread 🧵👇
October 23, 2025 at 2:41 AM
🎉Congrats to 2025's Australian Bird of the Year, the Tawny Frogmouth (everyone's favourite tree-stump-that's-not-actually-a-tree-stump)! 🦉

Here's a map of where they've been recorded. Take a photo of the next tree stump you see, it might be a bird of the year!😉

🌏🧪📊 #rstats @birdlifeoz.bsky.social
October 17, 2025 at 6:23 AM
And of course a big benefit of joining a group like SORTEE is meeting other like-minded people who care about robust and transparent science. Many are on Bluesky!

If you’re a member and want to be added, send me a DM 😀 🧪🌏

go.bsky.app/44PpngU
October 15, 2025 at 12:26 PM
Still want to make cleaning biodiversity data shrimp-ler? 🦐

Good news: We just updated our Cleaning Biodiversity Data in R book, so you still can! We've updated data for 2025, added new content & fixed lots of silly typos 😀

Live the shrimp-le life:
cleaning-data-r.ala.org.au

#rstats #ecology 🧪🌏
October 15, 2025 at 4:12 AM
🔍 Need to search for species in an area with a buffer? 🔵

Learn how to add a buffer in Python and see how to consider threatened species with obfuscated locations in a new ALA Labs post by Amanda Buyan & me 😀

labs.ala.org.au/posts/2025-0...

🧪🌏 #Python #matplotlib #geopandas #geospatial #quartopub
August 27, 2025 at 12:44 AM
Need to make a species list for an area? 📋🌱

Learn how to download a list of species, cross-reference with conservation status lists, and visualise with {ggplot2} in a new ALA Labs post by me & Amanda Buyan

🔗 labs.ala.org.au/posts/2025-0...

#rstats 🧪🌏📊
July 29, 2025 at 12:14 AM
Apparently I like this colour palette. Am I too predictable lol
#rstats 📊
July 24, 2025 at 7:14 AM
Nautiluses are so cool. They just seem so ancient! 🐙🦑

Like other cephalopods they have tentacles, but not just 8 or 10—they have 47 pairs that lack any suckers or hooks 🤨 They also use their chambered shells to control buoyancy

This post was inspired by my friend Omanyte I picked up in Tokyo
🧪🌏
July 16, 2025 at 7:25 AM
It's #WorldRainforestDay! 🌴

Australia's Gondwana Rainforests hold examples of major evolutionary & geological stages dating ~359 mya, including the largest surviving population of Araucarians (the most phylogenetically primitive species of conifer) 👴

Code & high-res 📷 👇

#rstats #dataviz 📊🧪🌏🌳
June 22, 2025 at 12:32 AM
Adding `all_fields = TRUE` to `show_values()` adds any columns from the original list to your query. This generally includes taxonomic and conservation status information.

It's still experimental, but feel free to give it a go! 😀

#rstats
June 13, 2025 at 2:27 AM
🚨galah 2.1.2 is on CRAN! 📦

This version fixes a major bug that prevented queries with more complex filters from returning the correct result. It also adds a way to get species conservation status from authoritative lists

🔗 Details: galah.ala.org.au/news/index.h...

#rstats @rowdynerd.bsky.social 🌏
June 13, 2025 at 2:27 AM
Another beautiful dataviz 🌸🤩 Seasonality of cherry blossoms in Kyoto, Japan

By Shreya Arya, longlisted for the 2024 Information Is Beautiful Awards

www.informationisbeautifulawards.com/showcase/728...

#dataviz 🌱🌏🧪📊🗻🌸
May 22, 2025 at 8:18 AM
Australia's 2019-20 bushfires burnt an area the size of Washington State 🔥 How did this impact species like Southern greater gliders?

Learn how to use {tidymodels} & {tidysdm} to explore this question in a new ALA Labs post by intern Jarod Wright & me

labs.ala.org.au/posts/2025-0...

#rstats 🧪🌏🦘🌳
April 10, 2025 at 12:24 AM
This International Day of Forests, I made a forest using Eucalyptus data 😀🌳

Eucalypt forests form three quarters of Australia's native forest area. They're also the tallest flowering plant in the world!

Code + hi-def 📸 👇

#rstats #dataviz 🧪📊🌏🌲
March 21, 2025 at 4:13 AM
This dataviz is absolutely amazing 😍 Global bird species' conservation status displayed as a massive flock!

By Andrea Garrec, longlisted for the 2024 Information Is Beautiful Awards

www.informationisbeautifulawards.com/showcase/731...

#dataviz 📊🌏🧪🐦🦉🦆🦅
March 19, 2025 at 2:40 AM
But I’m not a car, so I was happy about this 😅
March 3, 2025 at 4:33 AM
It snowed *so* much in Japan while we were there that cars were at risk of being absorbed into the earth after only 2 days of snowfall
☃️🌨️
March 3, 2025 at 4:30 AM
I’ve been in Japan the last 2 weeks! It’s beautiful and I can’t wait to go back 🇯🇵🍜🍣🏯
March 3, 2025 at 4:19 AM