Uses of Class
crawlercommons.sitemaps.AbstractSiteMap
-
Packages that use AbstractSiteMap Package Description crawlercommons.sitemaps Classes focused on parsing and processing sitemaps and holding the resulting set of URLs with crawling-related metadata, such as the change frequency of a page.crawlercommons.sitemaps.sax SAX handlers to parse specific elements of XML sitemaps or Atom/RSS feeds. -
-
Uses of AbstractSiteMap in crawlercommons.sitemaps
Subclasses of AbstractSiteMap in crawlercommons.sitemaps Modifier and Type Class Description classSiteMapclassSiteMapIndexMethods in crawlercommons.sitemaps that return AbstractSiteMap Modifier and Type Method Description AbstractSiteMapSiteMapIndex. getSitemap(URL url)Returns the Sitemap that has the given URL.AbstractSiteMapSiteMapIndex. nextUnprocessedSitemap()AbstractSiteMapSiteMapParser. parseSiteMap(byte[] content, URL url)Parse a sitemap, given the content bytes and the URL.AbstractSiteMapSiteMapParser. parseSiteMap(String contentType, byte[] content, AbstractSiteMap sitemap)Returns a processed copy of an unprocessed sitemap object, i.e. transfer the value of getLastModified().AbstractSiteMapSiteMapParser. parseSiteMap(String contentType, byte[] content, URL url)Parse a sitemap, given the MIME type, the content bytes, and the URL.AbstractSiteMapSiteMapParser. parseSiteMap(URL onlineSitemapUrl)Returns a SiteMap or SiteMapIndex given an online sitemap URL Please note that this method is a static method which goes online and fetches the sitemap then parses it This method is a convenience method for a user who has a sitemap URL and wants a "Keep it simple" way to parse it.protected AbstractSiteMapSiteMapParser. processGzippedXML(URL url, byte[] response)Decompress the gzipped content and process the resulting XML Sitemap.protected AbstractSiteMapSiteMapParser. processXml(URL sitemapUrl, byte[] xmlContent)Parse the given XML content.protected AbstractSiteMapSiteMapParser. processXml(URL sitemapUrl, InputSource is)Parse the given XML content.Methods in crawlercommons.sitemaps that return types with arguments of type AbstractSiteMap Modifier and Type Method Description Collection<AbstractSiteMap>SiteMapIndex. getSitemaps()Collection<AbstractSiteMap>SiteMapIndex. getSitemaps(boolean deduplicate)Methods in crawlercommons.sitemaps with parameters of type AbstractSiteMap Modifier and Type Method Description voidSiteMapIndex. addSitemap(AbstractSiteMap sitemap)Add this Sitemap to the list of Sitemaps,AbstractSiteMapSiteMapParser. parseSiteMap(String contentType, byte[] content, AbstractSiteMap sitemap)Returns a processed copy of an unprocessed sitemap object, i.e. transfer the value of getLastModified().static voidSiteMapCrossSubmitValidator. validateSiteMapURLs(AbstractSiteMap sitemap, Collection<String> domains, SiteMapCrossSubmitValidator.CrossSubmitValidationLevel domainValidationLevel)Validation of a sitemap or recursive validation of a sitemap index.voidSiteMapParser. walkSiteMap(AbstractSiteMap sitemap, Consumer<SiteMapURL> action)Traverse a sitemap, recursively fetching and traversing the content of any enclosed sitemap index, and performing the specified action for each sitemap URL until all URLs have been processed or the action throws an exception. -
Uses of AbstractSiteMap in crawlercommons.sitemaps.sax
Methods in crawlercommons.sitemaps.sax that return AbstractSiteMap Modifier and Type Method Description AbstractSiteMapDelegatorHandler. getSiteMap()
-