Simply submit the baseURL of your OAI repository (for instance,
https://www.example.com/oaiserver). When we query the baseURL, we automatically add query
parameters (such as ?verb=Identify or ?verb=ListRecords), so you can
simply submit the baseURL itself. When we extract the URLs for your site, we expect
the records in the repository to be formatted using Dublin Core, with the URLs embedded in
<dc:identifier> tags. Below is a sample record that includes the
<dc:identifier> tag in bold. The URL listed in that tag is what we extract.
As with other Sitemaps, the URLs must be within the same site and at the same directory location
or lower than the baseURL. For instance, if you submit
https://www.example.com/oaiserver as the baseURL, the following URLs
would be valid:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["\u003cp\u003eWebsites utilizing the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) 2.0 can use their OAI repository as a Sitemap.\u003c/p\u003e\n"],["\u003cp\u003eSubmitting the baseURL of your OAI repository allows Google to automatically extract URLs formatted using Dublin Core, specifically from \u003ccode\u003e<dc:identifier>\u003c/code\u003e tags.\u003c/p\u003e\n"],["\u003cp\u003eURLs within the Sitemap must be within the same site and at the same directory location or lower than the submitted baseURL.\u003c/p\u003e\n"],["\u003cp\u003eThis blog post may contain outdated information, such as missing images or broken links.\u003c/p\u003e\n"]]],["Sites using the OAI-PMH 2.0 protocol can use their OAI repository as a Sitemap by submitting the repository's baseURL (e.g., https://www.example.com/oaiserver). The system automatically adds query parameters. URLs are extracted from records formatted using Dublin Core, specifically from `\u003cdc:identifier\u003e` tags. Submitted URLs must be within the same site and directory location as the baseURL or lower. Sample record code shows `\u003cdc:identifier\u003e` in bold.\n"],null,["# Using OAI-PMH with Google Sitemaps\n\n| It's been a while since we published this blog post. Some of the information may be outdated (for example, some images may be missing, and some links may not work anymore).\n\nSeptember 20, 2005\n\n\nIf your site uses the\n[Open Archives Initiative Protocol for Metadata Harvesting](https://www.openarchives.org/OAI/openarchivesprotocol.html)\n(OAI-PMH) 2.0 protocol, an application-independent interoperability framework based on metadata\nharvesting, you can use your OAI repository\n[as your Sitemap](/search/docs/crawling-indexing/sitemaps/overview).\n\n\nSimply submit the baseURL of your OAI repository (for instance,\nhttps://www.example.com/oaiserver). When we query the baseURL, we automatically add query\nparameters (such as `?verb=Identify` or `?verb=ListRecords`), so you can\nsimply submit the `baseURL` itself. When we extract the URLs for your site, we expect\nthe records in the repository to be formatted using Dublin Core, with the URLs embedded in\n`\u003cdc:identifier\u003e` tags. Below is a sample record that includes the\n`\u003cdc:identifier\u003e` tag in bold. The URL listed in that tag is what we extract. \n\n```\n\u003coai_dc:dc\n xmlns:oai_dc=\"https://www.openarchives.org/OAI/2.0/oai_dc/\"\n xmlns:dc=\"https://purl.org/dc/elements/1.1/\"\n xmlns:xsi=\"https://www.w3.org/2001/XMLSchema-instance\"\n xsi:schemaLocation=\"https://www.openarchives.org/OAI/2.0/oai_dc/\n https://www.openarchives.org/OAI/2.0/oai_dc.xsd\"\u003e\n\u003cdc:title xml:lang=\"en\"\u003eA title of extraordinary things\u003c/dc:title\u003e\n\u003cdc:creator\u003eMcCormack, Michael\u003c/dc:creator\u003e\n\u003cdc:subject\u003eLCSH:Ausdehnungslehre; LCCN QA205.H99; Greatness:Amanda\u003c/dc:subject\u003e\n\u003cdc:publisher\u003eJ. Wiley ∓ Sons\u003c/dc:publisher\u003e\n\u003cdc:date\u003eCreated: 1906; Available: 1991\u003c/dc:date\u003e\n\u003cdc:type\u003etext\u003c/dc:type\u003e\n\u003cdc:identifier\u003ehttps://example.com/physics/1796949\u003c/dc:identifier\u003e\n\u003cdc:language\u003eenglish\u003c/dc:language\u003e\n\u003cdc:rights xml:lang=\"en\"\u003ePublic Domain\u003c/dc:rights\u003e\n\u003c/oai_dc:dc\u003e\n```\n\n\nAs with other Sitemaps, the URLs must be within the same site and at the same directory location\nor lower than the baseURL. For instance, if you submit\n`https://www.example.com/oaiserver` as the `baseURL`, the following URLs\nwould be valid: \n\n```\nhttps://www.example.com/\nhttps://www.example.com/samples.html\nhttps://www.example.com/images/\n```\n\n\nHowever, if you submit `https://www.example.com/dataprovider/oaiserver`, then none of\nthose URLs would be valid.\n\nPosted by [Vanessa Fox](https://www.vanessafox.com/)"]]