Lightnews — Scholar-powered news

eify.bsky.social

@eify.bsky.social

I just noticed this: Did Meta AI Few-Shot Learner's use of policy description inspire Anthropic's Constitutional AI, a year later?
ai.meta.com/blog/harmful...
www.anthropic.com/research/con...

Harmful content can evolve quickly. Our new AI system adapts to tackle it.

We’ve built and deployed a new AI technology called Few-Shot Learner that can take faster action on new or evolving types of harmful content.

ai.meta.com

January 1, 2025 at 12:56 AM

eify.bsky.social

@eify.bsky.social

It seems that AdamW & MARS (arxiv.org/abs/2411.10438) effectively reach the same val loss for GPT-2 small with the optimal LR according to Appendix B, in contrast to Figure 1? @quanquangu.bsky.social

If MARS is less sensitive to LR that's also an advantage, but a different kind.

Figure 6: Validation loss with respect to training tokens for AdamW with learning rates 6 × 10−4, 3 × 10−3 and MARS with learning rate 3 × 10−3 on GPT-2 small model (125M). The model trained with MARS achieves a validation loss of 2.852.

Figure 1: The training and validation loss curves, plotted against both training tokens and wall-clock time on GPT-2 small model (125M).

December 4, 2024 at 9:31 PM

eify.bsky.social

@eify.bsky.social

Critical module of TensorFlow (TensorFlow Text) still doesn't support Python 3.12. I had to switch back to 3.11 😬
github.com/tensorflow/t...

GitHub - tensorflow/text: Making text a first-class citizen in TensorFlow.

Making text a first-class citizen in TensorFlow. Contribute to tensorflow/text development by creating an account on GitHub.

github.com

November 28, 2024 at 7:44 AM

eify.bsky.social

@eify.bsky.social

That training run failed to converge anyway but TIL if you use multiprocessing to spawn new processes you can't edit .py file while the code is running: python interpreter needs to compile the source again to do that!

November 25, 2024 at 8:41 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news