草庐IT

php - 迁移到 "Sitemap Protocol 0.9"

coder 2024-04-20 原文

我有一个创建站点地图的 PHP 脚本。但它基于“Sitemap Protocol 0.84”。

我想更改代码以生成有效的“Sitemap Protocol 0.9”输出。

0.84 和 0.9 之间有哪些变化?

最佳答案

这是 0.8 和 0.9 站点地图和站点索引版本之间差异的完整输出。

$diff -u sitemap0.8.xml sitemal0.9.xml

--- sitemap0.8.xsd  2015-12-19 12:40:36.558949286 -0300
+++ sitemap.xsd 2015-12-19 12:40:55.958939049 -0300
@@ -1,85 +1,86 @@
 <?xml version="1.0" encoding="UTF-8"?>
-<xsd:schema xmlns:xsd="http://www.w3.org/2001/XMLSchema" targetNamespace="http://www.google.com/schemas/sitemap/0.84" xmlns="http://www.google.com/schemas/sitemap/0.84">
-<xsd:annotation>
-  <xsd:documentation>
-    XML Schema for Sitemap files.
-    Last Modifed 2005-05-24
-  </xsd:documentation>
-</xsd:annotation>
-
-<xsd:element name="urlset">
-  <xsd:annotation>
-    <xsd:documentation>
-      Container for a set of up to 50,000 document elements.
-      This is the root element of the XML file.
-    </xsd:documentation>
-  </xsd:annotation>
- <xsd:complexType>
-   <xsd:sequence>
-     <xsd:element ref="url" maxOccurs="unbounded"/>
-   </xsd:sequence>
- </xsd:complexType>
-</xsd:element>
-
-<xsd:element name="url">
+<xsd:schema xmlns:xsd="http://www.w3.org/2001/XMLSchema" targetNamespace="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" elementFormDefault="qualified">
   <xsd:annotation>
     <xsd:documentation>
-      Container for the data needed to describe a document to crawl.
+      XML Schema for Sitemap files.
+      Last Modifed 2008-03-26
     </xsd:documentation>
   </xsd:annotation>
- <xsd:complexType>
-   <xsd:all>
-     <xsd:element ref="loc"/>
-     <xsd:element ref="lastmod" minOccurs="0"/>
-     <xsd:element ref="changefreq" minOccurs="0"/>
-     <xsd:element ref="priority" minOccurs="0"/>
-   </xsd:all>
- </xsd:complexType>
-</xsd:element>

-<xsd:element name="loc">
-  <xsd:annotation>
-    <xsd:documentation>
-      REQUIRED: The location URI of a document.
-      The URI must conform to RFC 2396 (http://www.ietf.org/rfc/rfc2396.txt).
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
+  <xsd:element name="urlset">
+    <xsd:annotation>
+      <xsd:documentation>
+        Container for a set of up to 50,000 document elements.
+        This is the root element of the XML file.
+      </xsd:documentation>
+    </xsd:annotation>
+    <xsd:complexType>
+      <xsd:sequence>
+        <xsd:any namespace="##other" minOccurs="0" maxOccurs="unbounded" processContents="strict"/>
+        <xsd:element name="url" type="tUrl" maxOccurs="unbounded"/>
+      </xsd:sequence>
+    </xsd:complexType>
+  </xsd:element>
+
+  <xsd:complexType name="tUrl">
+    <xsd:annotation>
+      <xsd:documentation>
+        Container for the data needed to describe a document to crawl.
+      </xsd:documentation>
+    </xsd:annotation>
+    <xsd:sequence>
+      <xsd:element name="loc" type="tLoc"/>
+      <xsd:element name="lastmod" type="tLastmod" minOccurs="0"/>
+      <xsd:element name="changefreq" type="tChangeFreq" minOccurs="0"/>
+      <xsd:element name="priority" type="tPriority" minOccurs="0"/>
+      <xsd:any namespace="##other" minOccurs="0" maxOccurs="unbounded" processContents="strict"/>
+    </xsd:sequence>
+  </xsd:complexType>
+
+  <xsd:simpleType name="tLoc">
+    <xsd:annotation>
+      <xsd:documentation>
+        REQUIRED: The location URI of a document.
+        The URI must conform to RFC 2396 (http://www.ietf.org/rfc/rfc2396.txt).
+      </xsd:documentation>
+    </xsd:annotation>
     <xsd:restriction base="xsd:anyURI">
       <xsd:minLength value="12"/>
       <xsd:maxLength value="2048"/>
     </xsd:restriction>
   </xsd:simpleType>
-</xsd:element> 

-<xsd:element name="lastmod">
-  <xsd:annotation>
-    <xsd:documentation>
-      OPTIONAL: The date the document was last modified. The date must conform
-      to ISO 8601 (http://www.w3.org/TR/NOTE-datetime). Example: 2005-05-10
-      Lastmod may also contain a timestamp. Example: 2005-05-10T17:33:30+08:00
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
-    <xsd:restriction base="xsd:string">
-      <xsd:minLength value="10"/>
-      <xsd:maxLength value="25"/>
-    </xsd:restriction>
+  <xsd:simpleType name="tLastmod">
+    <xsd:annotation>
+      <xsd:documentation>
+        OPTIONAL: The date the document was last modified. The date must conform
+        to the W3C DATETIME format (http://www.w3.org/TR/NOTE-datetime).
+        Example: 2005-05-10
+        Lastmod may also contain a timestamp.
+        Example: 2005-05-10T17:33:30+08:00
+      </xsd:documentation>
+    </xsd:annotation>
+    <xsd:union>
+      <xsd:simpleType>
+        <xsd:restriction base="xsd:date"/>
+      </xsd:simpleType>
+      <xsd:simpleType>
+        <xsd:restriction base="xsd:dateTime"/>
+      </xsd:simpleType>
+    </xsd:union>
   </xsd:simpleType>
-</xsd:element> 

-<xsd:element name="changefreq">
-  <xsd:annotation>
-    <xsd:documentation>
-      OPTIONAL: Indicates how frequently the content at a particular URL is
-      likely to change. The value "always" should be used to describe
-      documents that change each time they are accessed. The value "never"
-      should be used to describe archived URLs. Please note that web
-      crawlers may not necessarily crawl pages marked "always" more often.
-      Consider this element as a friendly suggestion and not a command.
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
+  <xsd:simpleType name="tChangeFreq">
+    <xsd:annotation>
+      <xsd:documentation>
+        OPTIONAL: Indicates how frequently the content at a particular URL is
+        likely to change. The value "always" should be used to describe
+        documents that change each time they are accessed. The value "never"
+        should be used to describe archived URLs. Please note that web
+        crawlers may not necessarily crawl pages marked "always" more often.
+        Consider this element as a friendly suggestion and not a command.
+      </xsd:documentation>
+    </xsd:annotation>
     <xsd:restriction base="xsd:string">
       <xsd:enumeration value="always"/>
       <xsd:enumeration value="hourly"/>
@@ -90,26 +91,23 @@
       <xsd:enumeration value="never"/>
     </xsd:restriction>
   </xsd:simpleType>
-</xsd:element> 

-<xsd:element name="priority">
-  <xsd:annotation>
-    <xsd:documentation>
-      OPTIONAL: The priority of a particular URL relative to other pages
-      on the same site. The value for this element is a number between
-      0.0 and 1.0 where 0.0 identifies the lowest priority page(s).
-      The default priority of a page is 0.5. Priority is used to select
-      between pages on your site. Setting a priority of 1.0 for all URLs
-      will not help you, as the relative priority of pages on your site
-      is what will be considered.
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
+  <xsd:simpleType name="tPriority">
+    <xsd:annotation>
+      <xsd:documentation>
+        OPTIONAL: The priority of a particular URL relative to other pages
+        on the same site. The value for this element is a number between
+        0.0 and 1.0 where 0.0 identifies the lowest priority page(s).
+        The default priority of a page is 0.5. Priority is used to select
+        between pages on your site. Setting a priority of 1.0 for all URLs
+        will not help you, as the relative priority of pages on your site
+        is what will be considered.
+      </xsd:documentation>
+    </xsd:annotation>
     <xsd:restriction base="xsd:decimal">
       <xsd:minInclusive value="0.0"/>
       <xsd:maxInclusive value="1.0"/>
     </xsd:restriction>
   </xsd:simpleType>
-</xsd:element> 

 </xsd:schema>
\ No newline at end of file

$diff -u siteindex0.8.xml siteindex.xml

--- siteindex0.8.xsd    2015-12-19 12:40:44.782945495 -0300
+++ siteindex.xsd   2015-12-19 12:40:51.318941905 -0300
@@ -1,69 +1,72 @@
 <?xml version="1.0" encoding="UTF-8"?>
-<xsd:schema xmlns:xsd="http://www.w3.org/2001/XMLSchema" targetNamespace="http://www.google.com/schemas/sitemap/0.84" xmlns="http://www.google.com/schemas/sitemap/0.84">
-<xsd:annotation>
-  <xsd:documentation>
-    XML Schema for Sitemap index files.
-    Last Modifed 2005-05-24
-  </xsd:documentation>
-</xsd:annotation>
-
-<xsd:element name="sitemapindex">
+<xsd:schema xmlns:xsd="http://www.w3.org/2001/XMLSchema" targetNamespace="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" elementFormDefault="qualified">
   <xsd:annotation>
     <xsd:documentation>
-      Container for a set of up to 1,000 sitemap URLs.
-      This is the root element of the XML file.
+      XML Schema for Sitemap index files.
+      Last Modifed 2009-04-08
     </xsd:documentation>
   </xsd:annotation>
-  <xsd:complexType>
+
+  <xsd:element name="sitemapindex">
+    <xsd:annotation>
+      <xsd:documentation>
+        Container for a set of up to 50,000 sitemap URLs.
+        This is the root element of the XML file.
+      </xsd:documentation>
+    </xsd:annotation>
+    <xsd:complexType>
+      <xsd:sequence>
+        <xsd:any namespace="##other" minOccurs="0" maxOccurs="unbounded" processContents="strict"/>
+        <xsd:element name="sitemap" type="tSitemap" maxOccurs="unbounded"/>
+      </xsd:sequence>
+    </xsd:complexType>
+  </xsd:element>
+
+  <xsd:complexType name="tSitemap">
+    <xsd:annotation>
+      <xsd:documentation>
+        Container for the data needed to describe a sitemap.
+      </xsd:documentation>
+    </xsd:annotation>
     <xsd:sequence>
-      <xsd:element ref="sitemap" maxOccurs="1000"/>
+      <xsd:element name="loc" type="tLocSitemap"/>
+      <xsd:element name="lastmod" type="tLastmodSitemap" minOccurs="0"/>
+      <xsd:any namespace="##other" minOccurs="0" maxOccurs="unbounded" processContents="strict"/>
     </xsd:sequence>
   </xsd:complexType>
-</xsd:element>
-
-<xsd:element name="sitemap">
-  <xsd:annotation>
-    <xsd:documentation>
-      Container for the data needed to describe a sitemap.
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:complexType>
-    <xsd:all>
-      <xsd:element ref="loc"/>
-      <xsd:element ref="lastmod" minOccurs="0"/>
-    </xsd:all>
-  </xsd:complexType>
-</xsd:element>

-<xsd:element name="loc">
-  <xsd:annotation>
-    <xsd:documentation>
-      REQUIRED: The location URI of a sitemap.
-      The URI must conform to RFC 2396 (http://www.ietf.org/rfc/rfc2396.txt).
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
+  <xsd:simpleType name="tLocSitemap">
+    <xsd:annotation>
+      <xsd:documentation>
+        REQUIRED: The location URI of a sitemap.
+        The URI must conform to RFC 2396 (http://www.ietf.org/rfc/rfc2396.txt).
+      </xsd:documentation>
+    </xsd:annotation>
     <xsd:restriction base="xsd:anyURI">
       <xsd:minLength value="12"/>
       <xsd:maxLength value="2048"/>
     </xsd:restriction>
   </xsd:simpleType>
-</xsd:element> 

-<xsd:element name="lastmod">
-  <xsd:annotation>
-    <xsd:documentation>
-      OPTIONAL: The date the sitemap was last modified. The date must conform
-      to ISO 8601 (http://www.w3.org/TR/NOTE-datetime). Example: 2005-05-10
-      Lastmod may also contain a timestamp. Example: 2005-05-10T17:33:30+08:00
-    </xsd:documentation>
-  </xsd:annotation>
-  <xsd:simpleType>
-    <xsd:restriction base="xsd:string">
-      <xsd:minLength value="10"/>
-      <xsd:maxLength value="25"/>
-    </xsd:restriction>
+  <xsd:simpleType name="tLastmodSitemap">
+    <xsd:annotation>
+      <xsd:documentation>
+        OPTIONAL: The date the document was last modified. The date must conform
+        to the W3C DATETIME format (http://www.w3.org/TR/NOTE-datetime).
+        Example: 2005-05-10
+        Lastmod may also contain a timestamp.
+        Example: 2005-05-10T17:33:30+08:00
+      </xsd:documentation>
+    </xsd:annotation>
+    <xsd:union>
+      <xsd:simpleType>
+        <xsd:restriction base="xsd:date"/>
+      </xsd:simpleType>
+      <xsd:simpleType>
+        <xsd:restriction base="xsd:dateTime"/>
+      </xsd:simpleType>
+    </xsd:union>
   </xsd:simpleType>
-</xsd:element> 
+

 </xsd:schema>
\ No newline at end of file

关于php - 迁移到 "Sitemap Protocol 0.9",我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/13624030/

有关php - 迁移到 "Sitemap Protocol 0.9"的更多相关文章

  1. ruby-on-rails - rails : "missing partial" when calling 'render' in RSpec test - 2

    我正在尝试测试是否存在表单。我是Rails新手。我的new.html.erb_spec.rb文件的内容是:require'spec_helper'describe"messages/new.html.erb"doit"shouldrendertheform"dorender'/messages/new.html.erb'reponse.shouldhave_form_putting_to(@message)with_submit_buttonendendView本身,new.html.erb,有代码:当我运行rspec时,它失败了:1)messages/new.html.erbshou

  2. ruby-on-rails - 由于 "wkhtmltopdf",PDFKIT 显然无法正常工作 - 2

    我在从html页面生成PDF时遇到问题。我正在使用PDFkit。在安装它的过程中,我注意到我需要wkhtmltopdf。所以我也安装了它。我做了PDFkit的文档所说的一切......现在我在尝试加载PDF时遇到了这个错误。这里是错误:commandfailed:"/usr/local/bin/wkhtmltopdf""--margin-right""0.75in""--page-size""Letter""--margin-top""0.75in""--margin-bottom""0.75in""--encoding""UTF-8""--margin-left""0.75in""-

  3. ruby - 检查 "command"的输出应该包含 NilClass 的意外崩溃 - 2

    为了将Cucumber用于命令行脚本,我按照提供的说明安装了arubagem。它在我的Gemfile中,我可以验证是否安装了正确的版本并且我已经包含了require'aruba/cucumber'在'features/env.rb'中为了确保它能正常工作,我写了以下场景:@announceScenario:Testingcucumber/arubaGivenablankslateThentheoutputfrom"ls-la"shouldcontain"drw"假设事情应该失败。它确实失败了,但失败的原因是错误的:@announceScenario:Testingcucumber/ar

  4. ruby-on-rails - 迷你测试错误 : "NameError: uninitialized constant" - 2

    我遵循MichaelHartl的“RubyonRails教程:学习Web开发”,并创建了检查用户名和电子邮件长度有效性的测试(名称最多50个字符,电子邮件最多255个字符)。test/helpers/application_helper_test.rb的内容是:require'test_helper'classApplicationHelperTest在运行bundleexecraketest时,所有测试都通过了,但我看到以下消息在最后被标记为错误:ERROR["test_full_title_helper",ApplicationHelperTest,1.820016791]test

  5. ruby-on-rails - 相关表上的范围为 "WHERE ... LIKE" - 2

    我正在尝试从Postgresql表(table1)中获取数据,该表由另一个相关表(property)的字段(table2)过滤。在纯SQL中,我会这样编写查询:SELECT*FROMtable1JOINtable2USING(table2_id)WHEREtable2.propertyLIKE'query%'这工作正常:scope:my_scope,->(query){includes(:table2).where("table2.property":query)}但我真正需要的是使用LIKE运算符进行过滤,而不是严格相等。然而,这是行不通的:scope:my_scope,->(que

  6. 使用 ACL 调用 upload_file 时出现 Ruby S3 "Access Denied"错误 - 2

    我正在尝试编写一个将文件上传到AWS并公开该文件的Ruby脚本。我做了以下事情:s3=Aws::S3::Resource.new(credentials:Aws::Credentials.new(KEY,SECRET),region:'us-west-2')obj=s3.bucket('stg-db').object('key')obj.upload_file(filename)这似乎工作正常,除了该文件不是公开可用的,而且我无法获得它的公共(public)URL。但是当我登录到S3时,我可以正常查看我的文件。为了使其公开可用,我将最后一行更改为obj.upload_file(file

  7. ruby - 安装 Ruby 时遇到问题(无法下载资源 "readline--patch") - 2

    当我尝试安装Ruby时遇到此错误。我试过查看this和this但无济于事➜~brewinstallrubyWarning:YouareusingOSX10.12.Wedonotprovidesupportforthispre-releaseversion.Youmayencounterbuildfailuresorotherbreakages.Pleasecreatepull-requestsinsteadoffilingissues.==>Installingdependenciesforruby:readline,libyaml,makedepend==>Installingrub

  8. ruby - RVM "ERROR: Unable to checkout branch ."单用户 - 2

    我在新的Debian6VirtualBoxVM上安装RVM时遇到问题。我已经安装了所有需要的包并使用下载了安装脚本(curl-shttps://rvm.beginrescueend.com/install/rvm)>rvm,但以单个用户身份运行时bashrvm我收到以下错误消息:ERROR:Unabletocheckoutbranch.安装在这里停止,并且(据我所知)没有安装RVM的任何文件。如果我以root身份运行脚本(对于多用户安装),我会收到另一条消息:Successfullycheckedoutbranch''安装程序继续并指示成功,但未添加.rvm目录,甚至在修改我的.bas

  9. ruby - 如何关闭 ruby​​ gem "Spreadsheet?"中的文件 - 2

    下面的代码在我第一次运行它时就可以正常工作:require'rubygems'require'spreadsheet'book=Spreadsheet.open'/Users/me/myruby/Mywks.xls'sheet=book.worksheet0row=sheet.row(1)putsrow[1]book.write'/Users/me/myruby/Mywks.xls'当我再次运行它时,我会收到更多消息,例如:/Library/Ruby/Gems/1.8/gems/spreadsheet-0.6.5.9/lib/spreadsheet/excel/reader.rb:11

  10. ruby - Ruby 语法糖有 "rules"吗? - 2

    我正在学习Ruby的基础知识(刚刚开始),我遇到了Hash.[]method.它被引入a=["foo",1,"bar",2]=>["foo",1,"bar",2]Hash[*a]=>{"foo"=>1,"bar"=>2}稍加思索,我发现Hash[*a]等同于Hash.[](*a)或Hash.[]*一个。我的问题是为什么会这样。是什么让您将*a放在方括号内,是否有某种规则可以在何时何地使用“it”?编辑:我的措辞似乎造成了一些困惑。我不是在问数组扩展。我明白了。我的问题基本上是:如果[]是方法名称,为什么可以将参数放在括号内?这看起来几乎——但不完全是——就像说如果你有一个方法Foo.d

随机推荐