0.01
No commit activity in last 3 years
No release in over 3 years
MassiveSitemap - build huge sitemaps painfree. Differential updates keeps generation time short and reduces load on DB. It's heavealy inspired by BigSitemaps and offers compatiable API
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

>= 0
>= 0
 Project Readme

MassiveSitemap massive_sitemap Documentation

Build painfree sitemaps for websites with millions of pages

MassiveSitemap is a successor project of BigSitemap, a Sitemap generator for websites with millions of pages. It implements various generation stategies, e.g. to split large Sitemaps into multiple files, gzip files to minimize bandwidth usage, or incremental updates. Its API is very similar to BigSitemap, can be set up with just a few lines of code and is compatible with just about any framework.

Usage

A simple usecase which fits most of the standard scenarios. This example adds http://test.com/about to the sitemap.

require 'massive_sitemap'

index_url = MassiveSitemap.generate(:url => 'test.com') do
  add "/about"
end
MassiveSitemap.ping(index_url)

Using Rails (ActiveRecord)

This example itterates of the User resource and adds each with a change_frequency, last_modified and priority to the sitemap. In case there are more than 50.000 users, the sitemap will be auto-split in multiple files.

require 'massive_sitemap'

index_url = MassiveSitemap.generate(:url => 'test.com') do
  User.all do |user|
    add "/users/#{user.id}", :change_frequency => 'weekly', :last_modified => user.updated_at, :priority => 0.9
  end
end
MassiveSitemap.ping(index_url)

Structure

MassiveSitemap is structured in two major parts: Builder and Writer. Both offer an abstract interface which is tailored to the specific needs.

Builder

Builder keeps all the sitemap structure related logic to build the XML data. Builder::Index does the similar for the index structure. Builder::Rotation is an extension to make sure no more than 50k urls are written per files, according to sitemap specs.

Writer

The Writer takes care of the storage. At top level, that's just a string (Writer::String), however Writer::File stores to files, Writer::GzipFile gzips it as well. Writer keeps the state of the files and implements various strategies how to update the files.

Further extension and customization can easily be done, e.g. a Writer::S3 extenstion stores the sitemap files to Amazon S3 .

Contributing

We'll check out your contribution if you:

  • Provide a comprehensive suite of tests for your fork.
  • Have a clear and documented rationale for your changes.
  • Package these up in a pull request.

We'll do our best to help you out with any contribution issues you may have.

License

The license is included as LICENSE in this directory.