Lightnews — Scholar-powered news

Alexander Shlyapin

@alshlyapin.bsky.social

Recently, Claude 3 was released. Although it shows greater results on many benchmarks compared to GPT-4, I argue that GPT-4 is probably better overall. I have two reasons for this:

1/n

March 7, 2024 at 11:40 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Recently, "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits" (arxiv.org/abs/2402.17764) paper was published. The results are amazing, although I have a bit of skepticism because the results seem too good to be true.

1/n

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single...

arxiv.org

March 5, 2024 at 4:12 AM

Alexander Shlyapin

@alshlyapin.bsky.social

I have written instructions on my wiki detailing how to install and utilize Docker for training and inferencing LLMs using Hugging Face transformers: wiki.shlyapin.com/docker_train...

Docker

Alexander Shlyapin’s wiki

wiki.shlyapin.com

March 2, 2024 at 10:31 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Elon Musk is suing Sam Altman and OpenAI because they transitioned from a non-profit to a for-profit organization, despite Elon’s donation to the foundation under the premise that it was non-profit and committed to developing open-source AI. I tend to agree with Elon.

1/n

March 1, 2024 at 10:39 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Just a reminder: you should not trust any chatbot screenshots without a link to the conversation.

February 29, 2024 at 11:58 PM

Alexander Shlyapin

@alshlyapin.bsky.social

I recently came across this: en.m.wikipedia.org/wiki/Moravec.... It claims that, contrary to popular belief, intellectual labor will be replaced by AI first, followed by manual labor. And their logic really makes sense.

1/n

February 26, 2024 at 11:05 PM

Alexander Shlyapin

@alshlyapin.bsky.social

The backlash against Google continues. Earlier, Google disabled image generation on Gemini, but users continued to check for biases and inaccuracies in Gemini (using text) and other Google services.

February 25, 2024 at 6:54 PM

Alexander Shlyapin

@alshlyapin.bsky.social

What's interesting about the recent release of Sora is that it revealed how much society is anti-AI. I am presenting to you three posts against AI (and particularly against Sora) that received more likes—205K, 155K, and 150K—than the official OpenAI video, which received 141K likes.

February 24, 2024 at 9:58 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Microsoft introduced LongRoPE, a method to increase the context window of LLMs to 2M tokens. They tested this method on the Mistral and LLaMA2 models, demonstrating that the models do not lose performance on short-context benchmarks.
arxiv.org/abs/2402.13753

February 23, 2024 at 9:55 AM

Alexander Shlyapin

@alshlyapin.bsky.social

Gemini has been observed exhibiting biases when generating images related to history. This issue arises from the application of Reinforcement Learning from Human Feedback (RLHF).
(The screenshots are not mine).

February 22, 2024 at 1:20 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Google has released the Gemma models in 2B and 7B sizes. The 7B model surpasses the performance of Llama-2 13B. However, there is no comparison with Mistral 7B on their page. blog.google/technology/d...

February 21, 2024 at 4:16 PM

Alexander Shlyapin

@alshlyapin.bsky.social

The introduction of LoraLand features 25 fine-tuned Mistral-7B models that outperform GPT-4. They are served on a single A100. The training cost is approximately $200. The downside is that you need to manually select a model for each prompt. predibase.com/blog/lora-la...

February 21, 2024 at 4:16 PM

Alexander Shlyapin

@alshlyapin.bsky.social

I created my personal wiki at wiki.shlyapin.com. Currently, it contains information about git, SSH, and Markdown, but I plan to expand it with content on LLMs, AI, NLP, Deep Learning, Python, etc.
I've also launched a blog at www.shlyapin.com

Home

Alexander Shlyapin’s wiki

wiki.shlyapin.com

February 20, 2024 at 4:22 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Wired published an article claiming that OpenAI signed a letter of intent to purchase chips from a startup named Rain. OpenAI CEO Sam Altman previously invested personally in Rain, which I see as a conflict of interest.
www.wired.com/story/openai...

OpenAI Agreed to Buy $51 Million of AI Chips From a Startup Backed by CEO Sam Altman

Documents show that OpenAI signed a letter of intent to spend $51 million on brain-inspired chips developed by startup Rain. OpenAI CEO Sam Altman previously made a personal investment in Rain.

www.wired.com

December 4, 2023 at 6:27 AM

Alexander Shlyapin

@alshlyapin.bsky.social

November 29, 2023 at 2:05 PM

Alexander Shlyapin

@alshlyapin.bsky.social

November 29, 2023 at 1:35 PM

Alexander Shlyapin

@alshlyapin.bsky.social

November 28, 2023 at 5:16 PM

Alexander Shlyapin

@alshlyapin.bsky.social

November 27, 2023 at 7:48 PM

Alexander Shlyapin

@alshlyapin.bsky.social

November 27, 2023 at 2:43 PM

Alexander Shlyapin

@alshlyapin.bsky.social

November 27, 2023 at 3:27 AM

Alexander Shlyapin

@alshlyapin.bsky.social

I read an interesting article about the AI debate and realized that my opinion is unique. I think that AI replacing humans is inevitable, and we can do almost nothing about it. Even if we stop AI, we would go extinct for another reason. So, we should not worry about AI