Alexander Shlyapin
alshlyapin.bsky.social
Alexander Shlyapin
@alshlyapin.bsky.social
NLP Engineer (LLM)
Recently, Claude 3 was released. Although it shows greater results on many benchmarks compared to GPT-4, I argue that GPT-4 is probably better overall. I have two reasons for this:

1/n
March 7, 2024 at 11:40 PM
Recently, "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits" (arxiv.org/abs/2402.17764) paper was published. The results are amazing, although I have a bit of skepticism because the results seem too good to be true.

1/n
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single...
arxiv.org
March 5, 2024 at 4:12 AM
I have written instructions on my wiki detailing how to install and utilize Docker for training and inferencing LLMs using Hugging Face transformers: wiki.shlyapin.com/docker_train...
Docker
Alexander Shlyapin’s wiki
wiki.shlyapin.com
March 2, 2024 at 10:31 PM
Elon Musk is suing Sam Altman and OpenAI because they transitioned from a non-profit to a for-profit organization, despite Elon’s donation to the foundation under the premise that it was non-profit and committed to developing open-source AI. I tend to agree with Elon.

1/n
March 1, 2024 at 10:39 PM
Just a reminder: you should not trust any chatbot screenshots without a link to the conversation.
February 29, 2024 at 11:58 PM
I recently came across this: en.m.wikipedia.org/wiki/Moravec.... It claims that, contrary to popular belief, intellectual labor will be replaced by AI first, followed by manual labor. And their logic really makes sense.

1/n
February 26, 2024 at 11:05 PM
The backlash against Google continues. Earlier, Google disabled image generation on Gemini, but users continued to check for biases and inaccuracies in Gemini (using text) and other Google services.
February 25, 2024 at 6:54 PM
What's interesting about the recent release of Sora is that it revealed how much society is anti-AI. I am presenting to you three posts against AI (and particularly against Sora) that received more likes—205K, 155K, and 150K—than the official OpenAI video, which received 141K likes.
February 24, 2024 at 9:58 PM
Microsoft introduced LongRoPE, a method to increase the context window of LLMs to 2M tokens. They tested this method on the Mistral and LLaMA2 models, demonstrating that the models do not lose performance on short-context benchmarks.
arxiv.org/abs/2402.13753
February 23, 2024 at 9:55 AM
Gemini has been observed exhibiting biases when generating images related to history. This issue arises from the application of Reinforcement Learning from Human Feedback (RLHF).
(The screenshots are not mine).
February 22, 2024 at 1:20 PM
Google has released the Gemma models in 2B and 7B sizes. The 7B model surpasses the performance of Llama-2 13B. However, there is no comparison with Mistral 7B on their page. blog.google/technology/d...
February 21, 2024 at 4:16 PM
The introduction of LoraLand features 25 fine-tuned Mistral-7B models that outperform GPT-4. They are served on a single A100. The training cost is approximately $200. The downside is that you need to manually select a model for each prompt. predibase.com/blog/lora-la...
February 21, 2024 at 4:16 PM
I created my personal wiki at wiki.shlyapin.com. Currently, it contains information about git, SSH, and Markdown, but I plan to expand it with content on LLMs, AI, NLP, Deep Learning, Python, etc.
I've also launched a blog at www.shlyapin.com
Home
Alexander Shlyapin’s wiki
wiki.shlyapin.com
February 20, 2024 at 4:22 PM
Wired published an article claiming that OpenAI signed a letter of intent to purchase chips from a startup named Rain. OpenAI CEO Sam Altman previously invested personally in Rain, which I see as a conflict of interest.
www.wired.com/story/openai...
OpenAI Agreed to Buy $51 Million of AI Chips From a Startup Backed by CEO Sam Altman
Documents show that OpenAI signed a letter of intent to spend $51 million on brain-inspired chips developed by startup Rain. OpenAI CEO Sam Altman previously made a personal investment in Rain.
www.wired.com
December 4, 2023 at 6:27 AM
November 29, 2023 at 2:05 PM
November 29, 2023 at 1:35 PM
November 28, 2023 at 5:16 PM
November 27, 2023 at 7:48 PM
November 27, 2023 at 2:43 PM
November 27, 2023 at 3:27 AM
I read an interesting article about the AI debate and realized that my opinion is unique. I think that AI replacing humans is inevitable, and we can do almost nothing about it. Even if we stop AI, we would go extinct for another reason. So, we should not worry about AI
The "public debate" about AI is confusing for the general public and for policymakers because it is ...
Summary of Argument: The public debate among AI experts is confusing because there are, to a first approximation, three sides, not two sides to the d…
www.lesswrong.com
November 26, 2023 at 4:08 PM
November 26, 2023 at 12:13 PM
November 25, 2023 at 9:05 AM
November 23, 2023 at 4:43 PM
November 23, 2023 at 4:28 PM