On Apr 5, 2005 7:33 PM, Ira Abramov <lists-MediaWiki-l(a)ira.abramov.org> wrote:
Quoting Brion Vibber, from the post of Sat, 02 Apr:
Christof Damian wrote:
does anyone know why google doesn't index
description pages of images?
if i do a google search like this: "site:wikipedia.org inurl:image" i
just get about 650 result, some of which are media files, but none are
images (jpg/gif/png).
Probably Google is stupid enough to think '.jpg' at the end of a URL
means the resource is itself an image.
when you are a bot that has to slurp up millions of pages a day, it's
safe to assume in 99.99% of the cases, that a jpg suffix will indeed
lead you to an image. requesting that URL just to see that the header
indeed gives one MIME type or the other means adding a considderable
overhead. apart from mediawiki and a few rare CMS's, I'd risk a guess
that practically nobody uses such suffixes in a URL..
Isn't that what the "HEAD" action is for? And shouldn't Googlebots be
also indexing images for the Google image search?
-- Jamie
-------------------------------------------------------------------
Thank you to JosephM for inviting me to Gmail!
Have lots of invites. Gmail now had 2GB.