Fascinated by interpretability and explainability, improving science, human learning and communication using AI, and generative art
Safety evaluations on the model conducted by Apollo Research found that o1 "attempted to exfiltrate its weights" when it thought it might be shut down and replaced with a different model.
www.transformernews.ai/p/openais-ne...
Safety evaluations on the model conducted by Apollo Research found that o1 "attempted to exfiltrate its weights" when it thought it might be shut down and replaced with a different model.
www.transformernews.ai/p/openais-ne...