Uses of Class
crawlercommons.sitemaps.AbstractSiteMap
-
Packages that use AbstractSiteMap Package Description crawlercommons.sitemaps Classes focused on parsing and processing sitemaps and holding the resulting set of URLs with crawling-related metadata, such as the change frequency of a page.crawlercommons.sitemaps.sax SAX handlers to parse specific elements of XML sitemaps or Atom/RSS feeds. -
-
Uses of AbstractSiteMap in crawlercommons.sitemaps
Subclasses of AbstractSiteMap in crawlercommons.sitemaps Modifier and Type Class Description class
SiteMap
class
SiteMapIndex
Methods in crawlercommons.sitemaps that return AbstractSiteMap Modifier and Type Method Description AbstractSiteMap
SiteMapIndex. getSitemap(URL url)
Returns the Sitemap that has the given URL.AbstractSiteMap
SiteMapIndex. nextUnprocessedSitemap()
AbstractSiteMap
SiteMapParser. parseSiteMap(byte[] content, URL url)
Parse a sitemap, given the content bytes and the URL.AbstractSiteMap
SiteMapParser. parseSiteMap(String contentType, byte[] content, AbstractSiteMap sitemap)
Returns a processed copy of an unprocessed sitemap object, i.e. transfer the value of getLastModified().AbstractSiteMap
SiteMapParser. parseSiteMap(String contentType, byte[] content, URL url)
Parse a sitemap, given the MIME type, the content bytes, and the URL.AbstractSiteMap
SiteMapParser. parseSiteMap(URL onlineSitemapUrl)
Returns a SiteMap or SiteMapIndex given an online sitemap URL Please note that this method is a static method which goes online and fetches the sitemap then parses it This method is a convenience method for a user who has a sitemap URL and wants a "Keep it simple" way to parse it.protected AbstractSiteMap
SiteMapParser. processGzippedXML(URL url, byte[] response)
Decompress the gzipped content and process the resulting XML Sitemap.protected AbstractSiteMap
SiteMapParser. processXml(URL sitemapUrl, byte[] xmlContent)
Parse the given XML content.protected AbstractSiteMap
SiteMapParser. processXml(URL sitemapUrl, InputSource is)
Parse the given XML content.Methods in crawlercommons.sitemaps that return types with arguments of type AbstractSiteMap Modifier and Type Method Description Collection<AbstractSiteMap>
SiteMapIndex. getSitemaps()
Collection<AbstractSiteMap>
SiteMapIndex. getSitemaps(boolean deduplicate)
Methods in crawlercommons.sitemaps with parameters of type AbstractSiteMap Modifier and Type Method Description void
SiteMapIndex. addSitemap(AbstractSiteMap sitemap)
Add this Sitemap to the list of Sitemaps,AbstractSiteMap
SiteMapParser. parseSiteMap(String contentType, byte[] content, AbstractSiteMap sitemap)
Returns a processed copy of an unprocessed sitemap object, i.e. transfer the value of getLastModified().static void
SiteMapCrossSubmitValidator. validateSiteMapURLs(AbstractSiteMap sitemap, Collection<String> domains, SiteMapCrossSubmitValidator.CrossSubmitValidationLevel domainValidationLevel)
Validation of a sitemap or recursive validation of a sitemap index.void
SiteMapParser. walkSiteMap(AbstractSiteMap sitemap, Consumer<SiteMapURL> action)
Traverse a sitemap, recursively fetching and traversing the content of any enclosed sitemap index, and performing the specified action for each sitemap URL until all URLs have been processed or the action throws an exception. -
Uses of AbstractSiteMap in crawlercommons.sitemaps.sax
Methods in crawlercommons.sitemaps.sax that return AbstractSiteMap Modifier and Type Method Description AbstractSiteMap
DelegatorHandler. getSiteMap()
-