{"id":4,"date":"2004-07-25T04:28:48","date_gmt":"2004-07-25T04:28:48","guid":{"rendered":"http:\/\/lachy.id.au\/log\/2004\/07\/applicationxhtmlxml-google-file-format-unrecognized"},"modified":"2006-04-30T23:55:00","modified_gmt":"2006-04-30T23:55:00","slug":"applicationxhtmlxml-google-file-format","status":"publish","type":"post","link":"https:\/\/lachy.id.au\/log\/2004\/07\/applicationxhtmlxml-google-file-format","title":{"rendered":"application\/xhtml+xml+google = File Format: Unrecognized"},"content":{"rendered":"<p>Yesterday afternoon, while checking where I was ranked by searching for my name, <a href=\"http:\/\/www.google.com.au\/search?q=Lachlan+Hunt&amp;ie=UTF-8&amp;hl=en&amp;btnG=Google+Search&amp;meta=\"><kbd>Lachlan Hunt<\/kbd><\/a> in <a href=\"http:\/\/www.google.com\/\">Google<\/a> to see where my site was ranked, I was surprised to see not only was my site ranked 6th, only beaten by my <a href=\"http:\/\/www.blogger.com\/profile\/3043458\" title=\"Blogger: User Profile: Lachlan Hunt\">blogger profile<\/a>, two pages on <a href=\"http:\/\/www.msdn.com\/\" title=\"Microsoft Developer Network\">MSDN\u2019s<\/a> <a href=\"http:\/\/channel9.msdn.com\/wiki\/\">Channel9 Wiki<\/a> that I&#8217;ve edited, and two <a href=\"http:\/\/bobby.watchfire.com\/bobby\/html\/en\/index.jsp\">Bobby Watchfire<\/a> accessibility checks of my homepage <em>(I don&#8217;t know why!  Who&#8217;d be linking to those for Google to find?)<\/em>, but the description for my site turned out to be:<\/p>\r\n\r\n<pre><samp>File Format: Unrecognized - View as <abbr title=\"HyperText Markup Language\">HTML<\/abbr><\/samp><\/pre>\r\n\r\n<p>This is because currently, my homepage is only being served as <code>application\/xhtml+xml<\/code>, and it was surprising for 2 reasons.  Firstly, I thought that Google would have at least been designed to be able to parse <abbr title=\"Extensible HyperText Markup Language\">XHTML<\/abbr>, even if it were only doing it as <em>tag-soup<\/em> like everything else it searches.  And secondly, the <q>View as <abbr title=\"HyperText Markup Language\">HTML<\/abbr><\/q> link was still included, even though google had no idea what format it was, nor how to parse it. If you actually follow that link now, the page contains nothing except for the google branding and diclaimer, that it invalidly puts at the top of every <em>cached<\/em> and <em>view as <abbr title=\"HyperText Markup Language\">HTML<\/abbr><\/em> page it generates. <strong>above<\/strong> any <code>&lt;html&gt;<\/code> element and\/or <code>&lt;!DOCTYPE&gt;<\/code> in the file.<\/p>\r\n\r\n<p>When will Google learn to start writing valid <abbr title=\"HyperText Markup Language\">HTML<\/abbr> for <strong>all<\/strong> their pages, and when will they support industry standards?  I thought that only <strong title=\"aka. Internet Explorer\">Internet Exploder<\/strong> was the only user agent lagging behind with standards!<\/p>\r\n","protected":false},"excerpt":{"rendered":"Yesterday afternoon, while checking where I was ranked by searching for my name, Lachlan Hunt in Google to see where my site was ranked, I was surprised to see not only was my site ranked 6th, only beaten by my blogger profile, two pages on MSDN\u2019s Channel9 Wiki that I&#8217;ve edited, and two Bobby Watchfire &hellip; <a href=\"https:\/\/lachy.id.au\/log\/2004\/07\/applicationxhtmlxml-google-file-format\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">application\/xhtml+xml+google = File Format: Unrecognized<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[10,2,7],"tags":[],"_links":{"self":[{"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/posts\/4"}],"collection":[{"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/comments?post=4"}],"version-history":[{"count":0,"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/posts\/4\/revisions"}],"wp:attachment":[{"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/media?parent=4"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/categories?post=4"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lachy.id.au\/log\/wp-json\/wp\/v2\/tags?post=4"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}