Agents

Agents run on client web servers to locally collate change information from a variety of disparate sources. One or more source-specific agents respond to changes made to the source document or dependent data. A master agent - the SUNA Master - is responsible for collating information from all other local agents and transmits the information to the internet hosted SUNS server.

Read more about Site Update Notification Agents.

db suna

sunadb monitors changes made in database tables that provide data to be displayed on a web page – such as in a Wiki or Blog system. sunadb is fully configurable to support proprietary systems, or one of the pre-configured solutions for popular systems such as Wordpress can be used with minimal customisation.

web suna

sunaweb uses proprietary technology and a custom web crawler to monitor changes made to external websites that provide data to be displayed, for example in a mashup system.

fs suna

sunafs uses a file system change notification mechanism to monitor for any changes that are made to files in the web server’s document root directory and sub directories therein. These directories contain the source files used to produce the website, whether HTML, a client-side script such as JavaScript, or a server-side script repository such as PHP or ASP. sunafs will detect changes made to the source files and determine the external URLs which will be affected by the changes.

man suna

sunaman provides a web-based user interface for webmasters to submit manual notifications of content changes and works in conjunction with sunaweb to identify the scope of the update. The sunaman function is similar to the URL notification form at Yahoo! [8], with the added value of integration with the site update technology of sunaweb and SUSS.

 

 

Benefits

Webmasters and content publishers will improve search result ranking by propagating updates to system subscribers immediately when content is published. Site traffic will be reduced as system subscribers no longer need to indiscriminately crawl the sites downloading unnecessary content, as change notifications are only produced where referenceable information on the site has changed. Site response time will therefore be improved for customers, while reducing bandwidth costs. The maintenance overhead of keeping an accurate and up-to-date sitemap XML file for robots can ultimately be saved.

Read More...

Products for Search Engine Companies and others that Crawler the Web

Products for Web Hosting Companies