Google, Yahoo! and other Search Engines will index Flash Files

With Adobe’s help, Google can now better read the content on sites that use Adobe Flash technology, helping users find more relevant information when conducting searches.

This is the beginning of a new phenomena — Google, the leader in Search — will now be able to index textual content in Flash files of all kinds — Flash menus, buttons to banner, to self-contained Flash websites. Google have launched their Flash indexing algorithm, we can expect improved visibility of Flash content, with better search results and snippets.

People who have once shunned Flash for the lack of visibility on Search Engines can now rejoice. Earlier, it have been very difficult to make Flash contents indexable by Search Engines.

Ted’s got a nice, short and sweet explanation on how this works;

With the help of Adobe Flash Player, Search Engines like Google, Yahoo! have their spiders playback SWFs in the Flash Player runtime. The SWF Files actually runs inside the web spiders and allows all contents within an SWF to be indexed.

The cool part is that this also covers dynamic data loaded in from requests to a server, these are typically ignored in both AJAX and SWF applications.

Google Webmaster Central Blog have some bulleted details on this and here are some of them (more or less verbatim);

Q: Which Flash files can Google better index now?
Google have improved the ability to index textual content in SWF files of all kinds. This includes Flash buttons, menus, self-contained Flash websites, and everything in between.

Q: What content can Google better index from these Flash files?
All of the text that users can see as they interact with your Flash file. If your website contains Flash, the textual content in your Flash files can be used when Google generates a snippet for your website. Also, the words that appear in your Flash files can be used to match query terms in Google searches.

In addition to finding and indexing the textual content in Flash files, Google is also discovering URLs that appear in Flash files, and feeding them into their crawling pipeline — just like we do with URLs that appear in non-Flash webpages. For example, if your Flash application contains links to pages inside your website, Google may now be better able to discover and crawl more of your website.

Q: What about non-textual content, such as images?
At present, Google is only discovering and indexing textual content in Flash files. If your Flash files only include images, Google will not recognize or index any text that may appear in those images. Similarly, Google do not generate any anchor text for Flash buttons which target some URL, but which have no associated text.

Also note that Google do not index FLV files, such as the videos that play on YouTube, because these files contain no text elements.

Q: How does Google “see” the contents of a Flash file?
Google have developed an algorithm that explores Flash files in the same way that a person would, by clicking buttons, entering input, and so on. Their algorithm remembers all of the text that it encounters along the way, and that content is then available to be indexed.

Q: What do I need to do to get Google to index the text in my Flash files?
Nothing. The improvements that Google have made do not require any special action on the part of web designers or webmasters. If you have Flash content on your website, Google will automatically begin to index it.

That said, you should be aware that Google is now able to see the text that appears to visitors of your website. If you prefer Google to ignore your less informative content, such as a “copyright” or “loading” message, consider replacing the text within an image, which will make it effectively invisible to Google.

Q: What are the current technical limitations of Google’s ability to index Flash?
There are three main limitations at present, and Google is already working on resolving them:

  1. Googlebot does not execute some types of JavaScript. So if your web page loads a Flash file via JavaScript, Google may not be aware of that Flash file, in which case it will not be indexed.
  2. Google currently do not attach content from external resources that are loaded by your Flash files. If your Flash file loads an HTML file, an XML file, another SWF file, etc., Google will separately index that resource, but it will not yet be considered to be part of the content in your Flash file.
  3. While Google is able to index Flash in almost all of the languages found on the web, currently there are difficulties with Flash content written in bidirectional languages. Until this is fixed, Google will be unable to index Hebrew language or Arabic language content from Flash files.

References;


Don't like it? There are lots of published articles, pick a random one.

oCricket

Brajeshwar posted this article on Tue, Jul 1st, 2008 at 4:41 pm
Categorized under Adobe and has the following tags

Suggest 1 or more tags for this article

Use a comma (,) to separate 2 or more tags.

[?]

Prev Article: Enjoy the Adobe MAX 2008 Experience

Next Article: The Open Source Paradigm


Possibly Related Articles

Archives: Visit the Archives for more articles.

Comments Post Yours

There are one response so far. You can follow any responses to this entry through the RSS feed. You can leave a response, or trackback from your own site.

  1. Hey,

    While google and yahoo are jumping on this..i came across this post…which is by a company which was all ready doing this as their start up idea.
    I tried their service… just to get a idea of the interface or results might be when google or someone else actually releases their service.
    The post can be found at : http://news.ycombinator.com/item?id=233166

Post yours

Sidenotes

Quick notes, scribbles, somehow related to this website and to what I do. Or perhaps I'm just plain lazy to make them into a full article.

The Twitter Song: You're No One If You're Not On Twitter

Here is the awesome Twitter Song from Ben Walker. [audio:http://audio.brajeshwar.com/you-are-no-one-if-you-are-not-on-twitter.mp3] If you enjoy this, you might like to watch The Rise and Fall of Twitter. You're No One If You're Not On Twitter (Lyric) You're no one if you're ...26th Aug, 2008

Download Series AA Equity Financing Documents from Y Combinator

Y Combinator and Wilson Sonsini Goodrich & Rosati announced the Series AA Equity Financing Documents. Their goal is to make angel funding rounds for startups easier for both sides. While they may not be suitable for ...23rd Aug, 2008

The rise and fall of Twitter

Superb, Awesome! Couldn't avoid having it here for people to enjoy. Via: TechCrunch. 9th Aug, 2008

Spawning does Django

Our team had a discussion yesterday why Spawning might be a good solution for our Python-Django specific Web server. The discussion is still hot on the table and have not come to a conclusion; nonetheless, ...2nd Aug, 2008

Take the A List Apart 2008 Survey

It's A List Apart's second annual survey for people who make websites. I took it! And so should you. The Survey for People Who Make Websites. This year's survey corrects many of last year's mistakes, with ...29th Jul, 2008

View the Sidenotes Archive

Play the Penguin Game

Recommended

  • Oinam The official conglomerate of the Oinam related companies, institutes and holdings.
  • AS 2.0 Reference Reference for ActionScript 2.0 Programming Language used in Flash. Primarily stashed here for my own personal reference.
  • My Special Job My Special Job is a place where you can look for your weird necessities, strangely superb employees, when your need are more of those hackers, geeks, and ultimate rockstars in the Internet Technology.
  • Not Safe for Work Ever clicked a link and felt embarrassed with the content in front of your co-workers? Ever caught unaware because the funny link your friend sent was a little beyond funny? Let’s minimize that with NSWF.
  • Forum Oinam’s technical discussion forum where developers and designers can discuss all technical topics.
  • oCricket oCricket is about Cricket and people enthusiastic about it.
  • o! Just Me Of colorful cultures, entertainment, media, life hacks, music, books and movies from hollywood & bollywood.

Download free Brajeshwar Wordpress Theme

Brajeshwar

Brajeshwar I firmly believe in keeping things simple, easy for users and I envison pushing the technical envelop time and again for the betterment of viable commercial and practical applications. More about me.

Brajeshwar Personal Identity Portal powered by VeriSign Labs

Brajeshwar's affinity with Adobe

My Photos

More photos on Flickr

Member of 9rules Network

Since its inception on 11th June, 2001, "Brajeshwar" has 846 Articles and 5,922 comments, contained within 20 categories and 1,176 tags.