Package | Description |
---|---|
crawlercommons.sitemaps |
Sitemaps package provides all classes relevant to focused sitemap parsing,
url definition and processing.
|
Modifier and Type | Method and Description |
---|---|
static void |
SiteMapTester.main(String[] args) |
AbstractSiteMap |
SiteMapParser.parseSiteMap(byte[] content,
URL url)
Parse a sitemap, given the content bytes and the URL.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(String contentType,
byte[] content,
AbstractSiteMap sitemap)
Returns a processed copy of an unprocessed sitemap object, i.e.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(String contentType,
byte[] content,
URL url)
Parse a sitemap, given the MIME type, the content bytes, and the URL.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(URL onlineSitemapUrl)
Returns a SiteMap or SiteMapIndex given an online sitemap URL
|
protected SiteMap |
SiteMapParser.parseSyndicationFormat(URL sitemapUrl,
Document doc)
Parse the XML document, looking for a feed element to determine if
it's an Atom doc rss to determine if it's an RSS
doc.
|
protected AbstractSiteMap |
SiteMapParser.processGzip(URL url,
byte[] response)
Decompress the gzipped content and process the resulting XML Sitemap.
|
protected AbstractSiteMap |
SiteMapParser.processXml(URL sitemapUrl,
byte[] xmlContent)
Parse the given XML content.
|
protected AbstractSiteMap |
SiteMapParser.processXml(URL sitemapUrl,
InputSource is)
Parse the given XML content.
|
Copyright © 2009–2016 Crawler-Commons. All rights reserved.