Lightnews — Scholar-powered news

kasidkhan

@kasidkhan.bsky.social

9 followers 11 following 12 posts

AI enthusiast and commentators. Follow me if you are interested in Artificial Intelligence.

Posts Replies Media Videos

kasidkhan

@kasidkhan.bsky.social

It is 2028 yet ?

March 6, 2025 at 5:00 AM

kasidkhan

@kasidkhan.bsky.social

What truly makes us "living beings"?

January 20, 2025 at 8:03 AM

kasidkhan

@kasidkhan.bsky.social

December 25, 2024 at 5:40 PM

kasidkhan

@kasidkhan.bsky.social

Look at the numbers for o3 model operation cost,
"approximately 1,785 kWh of energy, about the same amount of electricity an average U.S. household uses in two months" - Boris Gamazaychikov

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub

OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.

lnkd.in

December 23, 2024 at 9:49 AM

kasidkhan

@kasidkhan.bsky.social

ARC-AGI-1. This is the test that decides if an AI is AGI.

Waiiittt, sorry, let me rephrase it.

"It is the test that decides if someone is human or not."

Good luck and all the best , I guess. 😐

hashtag#DefiningHumanity hashtag#AGIThreshold hashtag#ManOrMachine

December 22, 2024 at 6:06 AM

kasidkhan

@kasidkhan.bsky.social

Agricultural revolution > Industrial Revolution > Internet > AI Resolution > what next? Space revolution ?

December 12, 2024 at 4:10 PM

kasidkhan

@kasidkhan.bsky.social

🤔🧐💭🗨️🔍 Did you know ?.
1️⃣ TruthfulQA assesses the truthfulness of LLMs in their responses.
2️⃣ RealToxicityPrompts and ToxiGen tracks the extent of toxic output produced by language models.
3️⃣ BOLD and BBQ evaluate the bias present in LLM generations.

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

Pretrained neural language models (LMs) are prone to generating racist, sexist, or otherwise toxic language which hinders their safe deployment. We investigate the extent to which pretrained LMs can b...

arxiv.org

December 11, 2024 at 5:00 AM

kasidkhan

@kasidkhan.bsky.social

AGI yet ?

December 8, 2024 at 7:18 PM

kasidkhan

@kasidkhan.bsky.social

Will AI take our jobs ?

Well, the SWE-bench results provide some evidence. In just one year, the percentage of coding problems solved on the GitHub dataset (complex problem) has increased from 4.8% to 55%. impressive ? Indeed. source: www.swebench.com/viewer.html

December 7, 2024 at 10:17 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news