kasidkhan
banner
kasidkhan.bsky.social
kasidkhan
@kasidkhan.bsky.social
AI enthusiast and commentators. Follow me if you are interested in Artificial Intelligence.
It is 2028 yet ?
March 6, 2025 at 5:00 AM
What truly makes us "living beings"?
January 20, 2025 at 8:03 AM
December 25, 2024 at 5:40 PM
Look at the numbers for o3 model operation cost,
"approximately 1,785 kWh of energy, about the same amount of electricity an average U.S. household uses in two months" - Boris Gamazaychikov
OpenAI o3 Breakthrough High Score on ARC-AGI-Pub
OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.
lnkd.in
December 23, 2024 at 9:49 AM
ARC-AGI-1. This is the test that decides if an AI is AGI.

Waiiittt, sorry, let me rephrase it.

"It is the test that decides if someone is human or not."

Good luck and all the best , I guess. 😐

hashtag#DefiningHumanity hashtag#AGIThreshold hashtag#ManOrMachine
December 22, 2024 at 6:06 AM
Agricultural revolution > Industrial Revolution > Internet > AI Resolution > what next? Space revolution ?
December 12, 2024 at 4:10 PM
🤔🧐💭🗨️🔍 Did you know ?.
1️⃣ TruthfulQA assesses the truthfulness of LLMs in their responses.
2️⃣ RealToxicityPrompts and ToxiGen tracks the extent of toxic output produced by language models.
3️⃣ BOLD and BBQ evaluate the bias present in LLM generations.
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Pretrained neural language models (LMs) are prone to generating racist, sexist, or otherwise toxic language which hinders their safe deployment. We investigate the extent to which pretrained LMs can b...
arxiv.org
December 11, 2024 at 5:00 AM
AGI yet ?
December 8, 2024 at 7:18 PM
Will AI take our jobs ?

Well, the SWE-bench results provide some evidence. In just one year, the percentage of coding problems solved on the GitHub dataset (complex problem) has increased from 4.8% to 55%. impressive ? Indeed. source: www.swebench.com/viewer.html
December 7, 2024 at 10:17 PM