skip over left navigation links (press 4)

Bayesian Classification Project

Content-Based Data Services

A number of content-based data services are currently being explored at the GES DISC. These are illustrated in the image below and discussed in the following paragraphs.

Diagram showing content-based data services

Content-Based Subscriptions

Currently, the best a user can hope for is to qualify subscriptions based on characteristics of the whole scene. However, a scene that is 75% clear overall may still have the user's area of interest completely covered in clouds (top right). Content-based subscriptions will allow a user to specify that a certain area within the scene must be clear in order for it to fill the subscription.

Content-Based Subsetting

One technique is to deliver to the user only the pixels meeting certain content criteria, such as clear-sky. In this case, we use the content assessment to "zero out" cloudy pixels, and then apply a compression algorithm to the data (thus preserving the data structure). As a result, data files can be reduced by one-half to two-thirds. This is currently implemented for MODIS real-time data at the GES DAAC using on-the-fly subsetting. In order to invoke it, one replaces the ".hdf" extension with the ".clr" extension when doing an ftp "get".

Content-Based Cache Management

Online disk space, while much more available today than previously, is nevertheless limited. In order to make maximum use of this area, we are developing a mechanism to set the expiration time on disk based on how "interesting" we expect the data to be to the user community at large. Thus, relatively clear scenes will stay on the disk longer than mostly cloudy scenes, and scence where particular areas of interest are clear (e.g. validation sites) will stay on disk even longer.



NASA Logo - nasa.gov

  • Last updated: February 15, 2006 01:29:10 GMT