Moritz Schwarz
banner
moritz-schwarz.bsky.social
Moritz Schwarz
@moritz-schwarz.bsky.social
Econ PhD Candidate @ University of Vienna
Empirical Industrial Organization | Competition Economics
📍Vienna, Austria 🇪🇺🇦🇹 🔗 https://www.moritz-schwarz.com/
Interesting, can you recommend a resource on this?
December 20, 2023 at 11:51 AM
While stata/r/python will get slower in these things the bigger the data, power BI scales very well.
Try loading a 3 GB csv with 50 variables, some string, into stata/python/r. dta file will be 500 MB, power BI file will be 50 MB. When I had 100GB once, power BI file was 100 MB.
December 20, 2023 at 10:11 AM
What takes indeed time is changing the data import settings (because they need to rerun their compression algorithm?).

But once the data set(s) are compressed it is comparatively fast to do any analysis and browse data.
December 20, 2023 at 10:07 AM
I found it useful for exploring these bigger data sets and presenting intermediate results, but maintained the power BI files separately from the main pipeline that produced the final results. That worked.
December 20, 2023 at 9:58 AM
Indeed ugly and not built for research (so many useless buttons🫨).
It‘s free, though, and compression is insane. Easily handles data sets >>10GB, e.g., transaction level data – even on a standard local machine.
December 20, 2023 at 9:46 AM
(*fiddles)

It can, although it turned out it does not like`titlesec` (github.com/quarto-dev/q...)

This was the resulting pdf:
github.com/moritz-schwa...
I only spot some differences in the kerning.

Congrats to the 4 additional cites since the last compilation :)
November 6, 2023 at 4:14 PM
What I was looking for! Probably runs in quarto too (with minimal adjustments)?
November 5, 2023 at 1:04 PM