Internet technology hosted by Berkman Center
RSS 2.0 Specification
Tuesday, July 15, 2003
Contents
- What is RSS?
- Sample files
- About this document
- Required channel elements
- Optional channel elements
- Elements of
- Comments
- Extending RSS
- Roadmap
- License and authorship
RSS is a Web content syndication format.
Its name is an acronym for Really Simple Syndication.
RSS is a dialect of XML. All RSS files must conform to the XML 1.0 specification, as published on the World Wide Web Consortium (W3C) website.
A summary of RSS version history.
At the top level, a RSS document is a element, with a mandatory attribute called version, that specifies the version of RSS that the document conforms to. If it conforms to this specification, the version attribute must be 2.0.
Subordinate to the element is a single element, which contains information about the channel (metadata) and its contents.
Sample files
Here are sample files for: RSS 0.91, 0.92 and 2.0.
Note that the sample files may point to documents and services that no longer exist. The 0.91 sample was created when the 0.91 docs were written. Maintaining a trail of samples seems like a good idea.
About this document
This document represents the status of RSS as of the Fall of 2002, version 2.0.1.
It incorporates all changes and additions, starting with the basic spec for RSS 0.91 (June 2000) and includes new features introduced in RSS 0.92 (December 2000) and RSS 0.94 (August 2002).
RSS is a dialect of XML. All RSS files must conform to the XML 1.0 specification, as published on the World Wide Web Consortium (W3C) website.
A summary of RSS version history.
At the top level, a RSS document is a
Subordinate to the
Sample files
Here are sample files for: RSS 0.91, 0.92 and 2.0.
Note that the sample files may point to documents and services that no longer exist. The 0.91 sample was created when the 0.91 docs were written. Maintaining a trail of samples seems like a good idea.
About this document
This document represents the status of RSS as of the Fall of 2002, version 2.0.1.
It incorporates all changes and additions, starting with the basic spec for RSS 0.91 (June 2000) and includes new features introduced in RSS 0.92 (December 2000) and RSS 0.94 (August 2002).
Change notes are here.
First we document the required and optional sub-elements of ; and then document the sub-elements of - . The final sections answer frequently asked questions, and provide a roadmap for future evolution, and guidelines for extending RSS.
Required channel elements
Here's a list of the required channel elements, each with a brief description, an example, and where available, a pointer to a more complete description.
Required channel elements
Here's a list of the required channel elements, each with a brief description, an example, and where available, a pointer to a more complete description.
Element | Description | Example |
title | The name of the channel. It's how people refer to your service. If you have an HTML website that contains the same information as your RSS file, the title of your channel should be the same as the title of your website. | GoUpstate.com News Headlines |
link | The URL to the HTML website corresponding to the channel. | http://www.goupstate.com/ |
description | Phrase or sentence describing the channel. | The latest news from GoUpstate.com, a Spartanburg Herald-Journal Web site. |
Optional channel elements
Here's a list of optional channel elements.
Element | Description | Example |
language | The language the channel is written in. This allows aggregators to group all Italian language sites, for example, on a single page. A list of allowable values for this element, as provided by Netscape, is here. You may also use values defined by the W3C. | en-us |
copyright | Copyright notice for content in the channel. | Copyright 2002, Spartanburg Herald-Journal |
managingEditor | Email address for person responsible for editorial content. | geo@herald.com (George Matesky) |
webMaster | Email address for person responsible for technical issues relating to channel. | betty@herald.com (Betty Guernsey) |
pubDate | The publication date for the content in the channel. For example, the New York Times publishes on a daily basis, the publication date flips once every 24 hours. That's when the pubDate of the channel changes. All date-times in RSS conform to the Date and Time Specification of RFC 822, with the exception that the year may be expressed with two characters or four characters (four preferred). | Sat, 07 Sep 2002 00:00:01 GMT |
lastBuildDate | The last time the content of the channel changed. | Sat, 07 Sep 2002 09:42:31 GMT |
category | Specify one or more categories that the channel belongs to. Follows the same rules as the | |
generator | A string indicating the program used to generate the channel. | MightyInHouse Content System v2.3 |
docs | A URL that points to the documentation for the format used in the RSS file. It's probably a pointer to this page. It's for people who might stumble across an RSS file on a Web server 25 years from now and wonder what it is. | http://blogs.law.harvard.edu/tech/rss |
cloud | Allows processes to register with a cloud to be notified of updates to the channel, implementing a lightweight publish-subscribe protocol for RSS feeds. More info here. | |
ttl | ttl stands for time to live. It's a number of minutes that indicates how long a channel can be cached before refreshing from the source. More info here. | |
image | Specifies a GIF, JPEG or PNG image that can be displayed with the channel. More info here. |
|
rating | The PICS rating for the channel. |
|
textInput | Specifies a text input box that can be displayed with the channel. More info here. |
|
skipHours | A hint for aggregators telling them which hours they can skip. More info here. |
|
skipDays | A hint for aggregators telling them which days they can skip. More info here. |
|
sub-element of
is an optional sub-element of , which contains three required and three optional sub-elements.
is the URL of a GIF, JPEG or PNG image that represents the channel.
describes the image, it's used in the ALT attribute of the HTML tag when the channel is rendered in HTML.
is the URL of the site, when the channel is rendered, the image is a link to the site. (Note, in practice the image and should have the same value as the channel's and .
Optional elements include and , numbers, indicating the width and height of the image in pixels. contains text that is included in the TITLE attribute of the link formed around the image in the HTML rendering.
Maximum value for width is 144, default value is 88.
Maximum value for height is 400, default value is 31.
sub-element of sub-element of ' in archive.">
is an optional sub-element of .
It specifies a web service that supports the rssCloud interface which can be implemented in HTTP-POST, XML-RPC or SOAP 1.1.
Its purpose is to allow processes to register with a cloud to be notified of updates to the channel, implementing a lightweight publish-subscribe protocol for RSS feeds.
In this example, to request notification on the channel it appears in, you would send an XML-RPC message to rpc.sys.com on port 80, with a path of /RPC2. The procedure to call is myCloud.rssPleaseNotify.
A full explanation of this element and the rssCloud interface is here.
sub-element of sub-element of ' in archive.">
is an optional sub-element of .
ttl stands for time to live. It's a number of minutes that indicates how long a channel can be cached before refreshing from the source. This makes it possible for RSS sources to be managed by a file-sharing network such as Gnutella.
Example:60
sub-element of sub-element of ' in archive.">
A channel may optionally contain a sub-element, which contains four required sub-elements.
-- The label of the Submit button in the text input area.
-- Explains the text input area.
-- The name of the text object in the text input area.
-- The URL of the CGI script that processes text input requests.
The purpose of the element is something of a mystery. You can use it to specify a search engine box. Or to allow a reader to provide feedback. Most aggregators ignore it.
Elements of- Elements of
- ' in archive.">
A channel may contain any number of- s. An item may represent a "story" -- much like a story in a newspaper or magazine; if so its description is a synopsis of the story, and the link points to the full story. An item may also be complete in itself, if so, the description contains the text (entity-encoded HTML is allowed; see examples), and the link and title may be omitted. All elements of an item are optional, however at least one of title or description must be present.
is the URL of the site, when the channel is rendered, the image is a link to the site. (Note, in practice the image
Optional elements include
Maximum value for width is 144, default value is 88.
Maximum value for height is 400, default value is 31.
It specifies a web service that supports the rssCloud interface which can be implemented in HTTP-POST, XML-RPC or SOAP 1.1.
Its purpose is to allow processes to register with a cloud to be notified of updates to the channel, implementing a lightweight publish-subscribe protocol for RSS feeds.
In this example, to request notification on the channel it appears in, you would send an XML-RPC message to rpc.sys.com on port 80, with a path of /RPC2. The procedure to call is myCloud.rssPleaseNotify.
A full explanation of this element and the rssCloud interface is here.
ttl stands for time to live. It's a number of minutes that indicates how long a channel can be cached before refreshing from the source. This makes it possible for RSS sources to be managed by a file-sharing network such as Gnutella.
Example:
A channel may optionally contain a
-- The URL of the CGI script that processes text input requests.
The purpose of the
Elements of
A channel may contain any number of
Element | Description | Example |
title | The title of the item. | Venice Film Festival Tries to Quit Sinking |
link | The URL of the item. | http://nytimes.com/2004/12/07FEST.html |
description | The item synopsis. | Some of the most heated chatter at the Venice Film Festival this week was about the way that the arrival of the stars at the Palazzo del Cinema was being staged. |
author | Email address of the author of the item. More. | oprah\@oxygen.net |
category | Includes the item in one or more categories. More. |
|
comments | URL of a page for comments relating to the item. More. | http://www.myblog.org/cgi-local/mt/mt-comments.cgi?entry_id=290 |
enclosure | Describes a media object that is attached to the item. More. |
|
guid | A string that uniquely identifies the item. More. | http://inessential.com/2002/09/01.php#a2 |
pubDate | Indicates when the item was published. More. | Sun, 19 May 2002 15:21:36 GMT |
source | The RSS channel that the item came from. More. |
|
|
Unless otherwise labeled by its originating author, the content found on this site is made available under the terms of an Attribution/Share Alike Creative Commons license, with the exception that no rights are granted -- since they are not ours to grant -- in any logo, graphic design, trademarks or trade names, including the Harvard name. Last update: 4/29/07; 7:33:52 AM.
Internet technology hosted by Berkman Center
RSS History
Tuesday, April 6, 2004
There are a lot of folk legends about the evolution of RSS. Here's the scoop, the sequence of events in the life of RSS, as told by the designer of most of the formats.
- scriptingNews format, designed by DW at UserLand. 12/27/97.
- RSS 0.90, designed by Netscape, for use with my.netscape.com, which also supported scriptingNews format. The only thing about it that was RDF was the header, otherwise it was plain garden-variety XML. 3/15/99.
- scriptingNews 2.0b1, designed by DW at UserLand, enhanced to include all the features in RSS 0.90. Privately DW urged Netscape to adopt the features in this format that weren't present in RSS 0.90. 6/15/99.
- RSS 0.91, designed by Netscape, spec written by Dan Libby, includes most features from scriptingNews 2.0b1. "We're trying to move towards a more standard format, and to this end we have included several tags from the popular
format." The RDF header is gone. 7/10/99.
- UserLand adopts RSS 0.91, deprecates scriptingNews formats. 7/28/99.
- The RSS team at Netscape evaporates.
- UserLand's RSS 0.91 specification. 6/4/00.
- RSS 1.0 published as a proposal, worked on in private by a group led by Rael Dornfest at O'Reilly. Based on RDF and uses namespaces. Most elements of previous formats moved into modules. Like 0.90 it has an RDF header, but otherwise is a brand-new format, not related to any previous format. 8/14/00.
- RSS 0.92, which is 0.91 with optional elements, designed by DW at UserLand. 12/25/00.
- RSS 0.93 discussed but never deployed. 4/20/01.
- MetaWeblog API merges RSS 0.92 with XML-RPC to provide a powerful blogging API. 3/14/02.
- RSS 2.0, which is 0.92 with optional elements, designed by DW, after leaving UserLand. MetaWeblog API updated for RSS 2.0. While in development, this format was called 0.94. 9/18/02.
- RSS 2.0 spec released through Harvard under a Creative Commons license. 7/15/03.
|
Unless otherwise labeled by its originating author, the content found on this site is made available under the terms of an Attribution/Share Alike Creative Commons license, with the exception that no rights are granted -- since they are not ours to grant -- in any logo, graphic design, trademarks or trade names, including the Harvard name. Last update: 4/14/07; 2:41:35 PM.
No hay comentarios:
Publicar un comentario