Package | Description |
---|---|
crawlercommons.sitemaps |
Sitemaps package provides all classes relevant to focused sitemap parsing,
url definition and processing.
|
Modifier and Type | Method and Description |
---|---|
protected SiteMap |
SiteMapParser.parseSyndicationFormat(URL sitemapUrl,
Document doc)
Parse the XML document, looking for a feed element to determine if
it's an Atom doc rss to determine if it's an RSS
doc.
|
protected SiteMap |
SiteMapParser.parseXmlSitemap(URL sitemapUrl,
Document doc)
Parse XML that contains a valid Sitemap.
|
protected SiteMap |
SiteMapParser.processText(String sitemapUrl,
byte[] content)
Process a text-based Sitemap.
|
Modifier and Type | Method and Description |
---|---|
protected void |
SiteMapParser.addUrlIntoSitemap(String urlStr,
SiteMap siteMap,
String lastMod,
String changeFreq,
String priority,
int urlIndex)
Adds the given URL to the given sitemap while showing the relevant logs
|
protected void |
SiteMapParser.parseAtom(SiteMap sitemap,
Element elem,
Document doc)
Parse the XML document which is assumed to be in Atom format.
|
protected void |
SiteMapParser.parseRSS(SiteMap sitemap,
Document doc)
Parse XML document which is assumed to be in RSS format.
|
Copyright © 2009–2016 Crawler-Commons. All rights reserved.