Lino Uruñuela
errioxa.bsky.social
Lino Uruñuela
@errioxa.bsky.social
Technical SEO Specialist & Data Lover ❤️

Blog: http://Mecagoenlos.com
Linkedin http://linkedin.com/in/errioxa
SEO Tool: http://MyDomain.dev
Encrypted in hex and braille ....🤔
March 27, 2025 at 7:41 AM
El contenido "fake" generado por CloudFlare NO es contenido inventado sino que es contenido verídico, basado en artículos científicos que no tienen ninguna relevancia para no fomentar la desinformación.. para quitarse el sombrero 🎩
blog.cloudflare.com/ai-labyrinth/
Trapping misbehaving bots in an AI Labyrinth
How Cloudflare uses generative AI to slow down, confuse, and waste the resources of AI Crawlers and other bots that don’t respect “no crawl” directives.
blog.cloudflare.com
March 21, 2025 at 6:24 AM
Además mostrar contenido "fake" también generará enlaces a otras URLs fake, que solo un bot verá y seguirá. De esta manera, además de impedir que estos bots vean el contenido real les hace perder el tiempo (y recursos) en rastrear URLs con contenido que no les vale para nada.
March 21, 2025 at 6:24 AM
This graph shows how many robots.txt files mention each User Agent.

GPTBot has been showing up in more and more robots.txt files over time

Post: www.mecagoenlos.com/Posicionamie...

How did I do? www.mecagoenlos.com/Posicionamie...
March 4, 2025 at 10:40 AM
AI-related bots aren’t in the top 10 yet, but they’re slowly becoming more common in robots.txt

The number of robots.txt files that mention AI-related bots has been increasing over time
March 4, 2025 at 10:40 AM
Now that I think about it, if it didn’t have an A record, it wouldn’t resolve the robots.txt URL either....
February 26, 2025 at 5:36 PM
Maybe by checking if the hostname has an A record?

dig mydomain.dev A

If there are no A or AAAA records, that would be a strong indication that the hostname doesn’t point directly to a web server. ¯\_(ツ)_/¯
February 26, 2025 at 5:15 PM
Regarding robots.txt... I just published an analysis of over 400 million robots.txt files to see how UA bots are being restricted 😅

www.mecagoenlos.com/Posicionamie...

and how I did it (tech post)
www.mecagoenlos.com/Posicionamie...

I'll look for yours to see if I can "listen" to it. 🤣
¿Se está impidiendo a los bots de Inteligencia Artificial acceder al contenido?
cómo ha ido incrementando el número de robots.txt en los que aparecen rastreadores asociados a la Inteligencia Artificial.
www.mecagoenlos.com
February 26, 2025 at 12:48 PM
From chrome://on-device-internals/ you can load .bin file and use the model, but I don't know how to run it from the command line
February 7, 2025 at 1:56 PM
Is it possible to run an LLM if you have a .bin weights file?
e..g. the weights of the LLM that Chrome uses for Built-in AI are located at this path (in Linux):
~/.config/google-chrome-unstable/OptGuideOnDeviceModel/2024.9.25.2033/weights.bin (3GB)

Is there a way to run it from the command line?
February 7, 2025 at 1:41 PM
🤦‍♂️
January 30, 2025 at 9:54 PM