michael bommarito
mjbommar.bsky.social
michael bommarito
@mjbommar.bsky.social
to me, it's clear that we need to revisit the web *at the protocol level* to implement preference signalling. this preference signalling then needs to carry some economic and/or statutory force. $BAT, DMCA/WIPO-like action, etc. we need to experiment and see what works.
November 28, 2024 at 4:22 PM
releasing this kind of stuff helped me academically and commercially, and arguably helped proto-open source intelligence groups. i empathize with the huggingface researchers. but it's hard to see how relying on individual ethical choices will work at scale...
November 28, 2024 at 4:19 PM
the ultimate question, of course, is whether broader access to this technology is good. ignoring whose definition of "good" we use, it's clear that the risk of a future with extremely concentrated technical power is decreasing. in that sense, even if by mistake, elon has achieve the 2017 goal...
November 25, 2024 at 3:31 PM
now, like every dual-use technology, these implications apply across not just to nation-state interactions, but also to interactions between firms, citizens and institutions, etc.
November 25, 2024 at 3:29 PM
does this mean that related resource-based conflicts (e.g., lithium or taiwan fabs) or nation-state-coordinated infrastructure projects (e.g., nuclear) are less likely? maybe. but it also means that the landscape of long-term threat actors will be very different.
November 25, 2024 at 3:27 PM
ignoring the inevitable tit-for-tat dynamics that such investment controls (or tariffs, M&A approval boards, etc.) create, the point of modded-nanogpt is that it simply doesn't matter. these (hybrid) weapon systems are here and will continue to proliferate.
November 25, 2024 at 3:24 PM
the treasury only just 10 days ago published the Final Rule on Outbound Investment in Critical Technologies, which was already at risk because Executive Orders are obviously a terrible way to make law... www.federalregister.gov/documents/20...
Provisions Pertaining to U.S. Investments in Certain National Security Technologies and Products in Countries of Concern
This final rule sets forth the regulations that implement Executive Order 14105 of August 9, 2023, "Addressing United States Investments in Certain National Security Technologies and Products in Count...
www.federalregister.gov
November 25, 2024 at 3:19 PM
...and now we return to the table from keller's modded-nanogpt repository: github.com/KellerJordan....
GitHub - KellerJordan/modded-nanogpt: NanoGPT (124M) in 5 minutes
NanoGPT (124M) in 5 minutes. Contribute to KellerJordan/modded-nanogpt development by creating an account on GitHub.
github.com
November 25, 2024 at 3:15 PM
instead, we decided to focus on regulating the hardware - most notably, high-VRAM chips like the workhorse A100/H100. because pretraining parallelization is notoriously difficult to distribute across hetereogenous cards or even across datacenters, policy-makers thought it would be simpler this way.
November 25, 2024 at 3:13 PM
another option is to regulate the flow of human capital/knowledge. this is, again, difficult to implement when we have multinational organizations (corps, open source research groups, etc.) involved. unsurprisingly, some (e.g., M$FT) have also lobbied their way out (G42, MSR Asia)
November 25, 2024 at 3:10 PM
one option is to regulate the storage and transmission of the resulting systems themselves. tensor files , like 2003 mp3s, don't really lend themselves to effective management via this policy.
November 25, 2024 at 3:07 PM
knowledge diffusion, especially in the modern world, knows no borders. no one cares when it's a meme. but when this knowledge has to do with (hybrid) weapon system production, we tend to face choices.
November 25, 2024 at 3:04 PM
during 2023, and especially after the original wave of SLM projects like phi and tinyllama, this circle expanded dramatically to ~100s of orgs. while many roads still converged on the same gpu providers, the knowledge began to diffuse across open source projects and minds.
November 25, 2024 at 3:00 PM
prior to 2022, pretraining useful models was the domain of a literal handful, singular. the cost of human capital and hardware, let alone "non-diffused" knowledge, created an extreme concentration of power. *this* is what elon et al. were really discussing when they founded oai
November 25, 2024 at 2:57 PM
yeah :/ there is a serious selection effect in terms of subgraphs that migrated while preserving scale effects...
November 16, 2024 at 2:07 PM