Lightnews — Scholar-powered news

Reposted

Ben Burtenshaw

@benburtenshaw.bsky.social

who's fine-tuning LLMs for reasoning? This dataset has been trending for a few weeks and there's a list of models trained on it.

- It has SFT formatted reasoning sequences, like those in o1.
- You could incorporate these into post training to boost reasoning abilities.

O1-OPEN/OpenO1-SFT · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

buff.ly

December 12, 2024 at 11:00 AM

Reposted

Ben Burtenshaw

@benburtenshaw.bsky.social

came across this example in agent-as-a-judge from Meta. It uses agent-as-a-judge to evaluate the effectiveness of a DevAI app.

- It's based on an open dataset.
- It's more accurate than LLM as a judge
- It explains its evaluation based on preferences, and requirements.

https://buff.ly/49tN6CQ

December 11, 2024 at 11:00 AM

Reposted

David Berenstein

@davidberenstein.bsky.social

Learn local and private LLMs with Hugging Face 🤗 : Participation is free so join now!

Even better, there are minimal GPU requirements and no paid services.

Start with, Instruction Tuning, Preference Alignment, Parameter-efficient Fine-tuning.

GitHub: https://buff.ly/3ZCMKX2

December 11, 2024 at 1:00 PM

frascuchon.bsky.social

@frascuchon.bsky.social

I've just contributed 10 examples to this dataset:

data-is-better-together-fineweb-c.hf.space/share-your-p...

spa - español - Spanish

Join and contribute to the dataset spa - español - Spanish

data-is-better-together-fineweb-c.hf.space

December 11, 2024 at 1:49 PM

Reposted

José Francisco Calvo

@jfcalvo.hf.co

The great @benburtenshaw.bsky.social is running an open course on fine-tuning smol LLMs, and it’s seriously worth checking out.

If you’re into AI or just curious about how these small language models work, this could be right up your alley. Don’t miss it—it’s super interesting!

#AI #LLMs #Learning

Ben Burtenshaw @benburtenshaw.bsky.social · Dec 3

For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol.

🧵>>

December 4, 2024 at 10:31 AM

Reposted

David Berenstein

@davidberenstein.bsky.social

👐 Open Image Preferences is an Apache 2.0 licensed dataset for text-to-image generation by the @hf.co community. This dataset contains 10K text-to-image preference pairs across image generation categories, using different model families and prompt complexities.

Blog: huggingface.co/blog/image-p...

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

December 9, 2024 at 3:30 PM

frascuchon.bsky.social

@frascuchon.bsky.social

✨ Argilla 2.5.0 is live and it comes with webhook listener support to supercharge your workflows! 🚀

#AI #MachineLearning #Webhooks #TechUpdate

December 3, 2024 at 10:46 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news