Abstract - Future NASA Earth science missions, including the Earth Observing
System (EOS), will generate vast amounts of data that must be processed and
stored at various locations around the world. This article presents a
stepwise-refinement of the Intelligent Database Management (IDM) system of the
Distributed Active Archive Center (DAAC - one of seven regionally-located
EOSDIS archive sites) architecture to showcase the telecommunications issues
involved. This architecture is developed into a general overall design. It is shown
that the current evolution of protocols is sufficient to support IDM at Gbps rates
over large distances. It is also shown that the network design can accommodate a
flexible data ingestion storage pipeline and a user extraction and visualization
engine, without interference between the two.
This data is processed prior to storage to facilitate access, and is retrieved and converted into a useful form. These functions comprise the EOS Data Management Software System (DMSS), which is an example of a more general concept called Intelligent Database Management (IDM). This article presents an overview of the telecommunications issues of IDM, which involves data ingestion, storage, fusion, and rendering. Data ingestion is the processing of data prior to storage. Data fusion is the combining of various streams of stored data to form a composite information base suitable for direct rendering. The components of IDM for EOS are distributed globally over large distances (over 2000 miles) and bandwidths (1 Gigabit/second). Thus, telecommunications issues, including latency reduction, high bandwidth protocols, and distributed resource allocation are a fundamental component of IDM.
This article presents a stepwise-refinement of the Distributed Active Archive Center (DAAC - one of seven regionally-located EOSDIS archive sites) architecture. Discussion also focuses on how current protocols are sufficient to support the IDM DAAC. A network design is described that accommodates both a flexible data ingestion storage pipeline and a user extraction and visualization engine.
The most abstract description of DAAC is a set of continuous satellite data input streams (between 16 Mbps and 26 Kbps, totaling 25 Mbps on average and 164 Mbps peak), and a 200-500 Mbps sporadic, user visualization stream, with low BW user commands. Internally, the input and output are related only by storage, i.e., the input stream archiving and output stream generation are independent. The continuous input archive stream (ingestion) is partitionedfrom the user command and visualization streams (extract), both of which operate on the data store. There also may be multiple ingestion and extraction streams per DAAC.
The general design proposed uses separate subnetworks of heterogeneous processors - one for ingestion and the other for extraction. The processors and subnetworks form a dynamically-configurable dataflow engine, where subnetwork partitioning inhibits interference and provides reconfigurability. It is shown that the telecommunications aspects of IDM can be managed by this physical resource partitioning.
More importantly, it is shown that existing protocols, or existing proposals to evolve these protocols, are sufficient to support IDM. There is a growing controversy in protocol research involving the use of existing protocols for high speed (Gbps), wide area (2,000+ mile) environments. There are several protocol issues involved, including soft real-time delivery (i.e., jitter control), guaranteed bandwidth (reservation), and accommodation of high bandwidth-delay product links. Issues under investigation (without evolutionary solutions) include hard real-time delivery (scheduled delivery constraints) and methods for latency reduction. IDM data processing (both ingestion and visualization) requires isochronous data transfer, i.e., controlled jitter in transmission and processing. Fortunately, the data collection is automated and uses loosely-coupled feedback from ground control, rather than from user visualization. Latency reduction affects only the user-visualization control loop. The user-perceived latency is likely to be affected more by the extraction processing latency than by the propagation latency (typically 100 ms). Existing evolutionary modifications to existing data transport protocols accommodate soft real-time transfer (RTP - the Real Time Protocol), bandwidth reservation (ST-II, RSVP - Resource reSerVation Protocol), and high bandwidth-delay product links given continuous data streams (TCP - Transmission Control Protocol - extensions for `long, fat networks').
Some of the initial documents of the IDM project have described various aspects of the project, but none has considered the specific telecommunications aspects within this project, nor the impact of those issues on the other design considerations. This study attempts to augment those discussions sufficiently to suggest a design of the telecommunications system, from which other design criteria implications will be readily evident.
One document describes the scaling issues of the database and processing components of the project, but admits that the processing load cannot be accurately determined a priori (Wharton, Chang, & Krupp, 1991). This indicates that a scalable processing solution is required, one in which dynamic load configuration is possible. Other documents describe the static issues of database and visualization access to the EOSDIS (McDonald & Blake, 1991), or the data distribution and archiving requirements (Quirk & Thompson, 1991). There are dynamic corollaries to these static issues that relate to reconfiguring of the system in a flexible way.
Management considerations mandate a relatively centralized facility or small set of facilities (the DAACs) (Ramapriyan & McConaughy, 1991). Relieving a centralized load requires a distributed facility, provided that the data distribution is not orthogonal to the geographic configuration. Because the DAAC facilities are geographically distributed, processing within the DAAC should occur at a MAN (Metropolitan-Area Network) or LAN (Local-Area Network) scale. The partitioning of data into functional and operational sets among the DAACs indicates that inter-DAAC access, i.e., processing requiring the participation of more than a single DAAC, would be unlikely at first.
Data modeling is described using spatial, spectral, temporal, etc. characteristics (Campbell & Short, 1991). This includes a description of the effects of the variation in access method on the storage organization. These descriptions can be easily augmented to include communications and dynamic post-processing costs, so as to describe the telecommunications effects on data organization as well. The only difficulty is that the telecommunications costs are distributed, whereas the access method frequencies proposed are local to a particular DAAC component.
Finally, the high-performance processing of satellite data before initial archiving requires the use of specialized equipment and systems (Chesney, Speciale, Horner, & Sabia, 1990; Shi & Grebowsky, 1990). The individual board design of these components and the functional decomposition into processing elements is a scalable solution (Shi & Grebowsky, 1990). The board integration currently relies on existing technologies for system design (VME/VSB), rather than on true networking of components. The resulting pipeline permits chaining of processing elements within a single processing node (i.e., VME/VSB backplane), but not among different backplanes. Further, components of a single system can be used to process at most a single data stream, thus prohibiting an ultimately flexible design.
The front-end user system involves expert systems (Short, 1991), connectionism (Short & Shastri, 1990) or neural networks (Cromp, 1991), and support for user visualization tools. All of these are user-level front-end issues, and can require sharing of scheduling information at the user-access level. This indicates a network of user-support system (with loose coupling of state information on the availability of back-end resources) and other competing front-end sessions.
The back-end satellite processing system involves the use of specialized hardware (Chesney et al., 1990; Shi & Grebowsky, 1990). Each of these systems can be independent, as it processes a specific data stream from a given satellite individually.
The data processing for archiving can also be independent because of the partitioning of the databases among the DAAC sites (Ramapriyan and McConaughy, 1991). Data reorganization is presumed to occur within operational units of the DAACs (Campbell & Short, 1991).
The extraction processing, however, has not been thoroughly considered in relation to the system design (Wharton et al., 1991). Current distributed systems design indicates a back-end network of dynamically-allocated processing elements, which can be configured according to the extraction processing needs of the users. Such a system feature is exhibited by a networked version of the back-end satellite processing system, with a few modifications (see below). The goal is a back-end network for the servicing of the user-level requests, according to the decompositions suggested by the expert systems at the front-end.
There are other services that are of use, especially in the software portion of the system. Current protocol technology is sufficient to support the data rates and characteristics of the large linear streams of information arising from satellite measurements. These include TCP/IP, for streaming data transfer. Other options include remote evaluation (late-binding RPC), and the conventional remote procedure call (RPC). Conventional RPC requires sending the data to a remote site and retrieving the results, a mechanism that describes the dynamic allocation of processing components but requires a central controller to scatter and gather the data, creating a communications bottleneck that existing networks cannot support. Late-binding RPC permits the processing components to transmit the results along to subsequent RPC's, rather than requiring collection of the results at the originator of the first RPC. This permits a dynamic pipeline to be created within an existing telecommunications paradigm.
The bandwidth requirements of this network (1 terabit/day) average out to a continuous 11.6 Megabits/second (Wharton et al., 1991). Conventional LAN technology (i.e., Ethernet) supports 10 Mbps, but only to a maximum of 80% load, i.e., 8 Mbps (Tanenbaum, 1988). This theoretical maximum assumes a single source station on a network; competition among multiple sources decreases this to 60% (6 Mbps) (Tanenbaum, 1988). Even a lightly loaded Ethernet is therefore unsuitable for even a single hop in the data path from satellite to disk storage.
New fiber optic LAN technology (FDDI) supports rates of 100-200 Mbps, large enough to support several simultaneous hops of the data stream if that stream is buffered and averaged over a 24-hour period (requiring 1 Terabit of tape delay). Burst data characteristics are not understood at this time, either within the EOSDIS system or in telecommunications in general, and so are not factored into any proposed solutions.
Rate control methods provide processing adjustment to reduce queuing requirements, and reduce resulting jitter in the packet flow. `Stop-and-go queuing', `Leaky bucket', and `Virtual Clock' are all similar methods for rate adjustment. However, none are currently included in stable transport protocol implementations. Another protocol of interest, especially in the data ingestion operation, is the XTP protocol. XTP is a lightweight protocol that is designed to be implemented in VLSI hardware.
Other methods of achieving high performance protocols are not required here. These include other lightweight protocols, such as protocols that support fast RPC, or low latency transactions, or methods to reduce protocol complexity. The frequency of transactions in the DMSS system is not high enough to warrant the consideration of these new protocols.
One suggestion for possible research in this area would be the development of a VME/VSB virtual backplane, one that would permit the arbitrary interconnection of board-level components but that would exhibit a conventional VME/VSB interface. This crossbar-backplane would permit multiple FDDI interfaces per system aggregate, and thus multiple LAN loops among systems, permitting a more flexible implementation. This latter solution could be implemented incrementally after the initial implementation phase of the design.
Data ingestion occurs on the way into the inner data storage LAN but does not use the outer security/scheduling LAN for access, since satellite-originated paths are presumed secured at the source. Further, the process of ingestion need not alter the meta-data storage until archived inside the inner LAN.
The best way to understand these observations is to see their evolution and extraction from the existing characteristics of the DMSS system. Here we present a step-wise refinement of the telecommunications structure. We also present a description of the data flow and meta-data flows of the system, all to finally define the characteristics of the system sufficient to indicate a design.
The input satellite data stream of 1 Terabit/day averages out to 11.5 Mbps (Figure 1). The user commands require negligible bandwidth, both because of their small textual content and their sporadic nature. The user visualization estimate is based on a 1000x1000 pixel display, changing at a rate of 24 frames/second (movie-quality video), at a depth of between 8 and 24 bits/pixel. This results in a session bandwidth of between approximately 200 and 600 Mbps, or 8-24 Mbits per frame.
The input satellite stream thus requires a T-3 signal line (45 Mbps), assuming the 1 Terabit/day rate can be smoothed to per-second equivalent. The user commands can be accepted over conventional modem/dialup lines. The raw visualization stream requires STS-12 (622 Mbps) rates, which are unlikely to be available for user deployment in the timeframe of this project. A lossless intraframe compression at these rates may be available, and would result in a 20-40 Mbps stream, which could be supported by FDDI LAN technology (100 Mbps). Lossy compression, such as JPEG, can further reduce this requirement to the Ethernet LAN realm at approximately 4-8 Mbps.
The processing requirements of this diagram are described in (Wharton et al., 1991). At this point, the processing and communications requirements are not sufficiently specified to determine the design, as was noted in the NASA analysis (Wharton et al., 1991). At this point, it is evident that this partitioning is not optimal, because the satellite data input stream and user visualization streams are largely independent, yet are processed in a single entity.
User commands, therefore, interact with the extraction process, but not the ingestion, which can be relegated to a separate component. Further, because the database is responding largely to command information (versus data) from the command and extraction interfaces, the database might benefit from a partitioned internal structure, so that data input into an unorganized archive component can be isolated from the extraction access bandwidth requirements.

We do not replicate the database component of the DAAC because we consider user requests within a single DAAC only. This is reasonable, because the data of the DAACs is partitioned based on expected use and semantic content. Merging of data streams may occur but is expected to be managed by the merging of independently delivered visualization streams from a number of independent DAAC sources.
Figure 5 denotes the interaction between the command processing elements operating in a scheduling capacity. The extraction processes are shown as independent, because determining overlapping computation is intractable and of little benefit given independent user control. The satellite processing streams are independent, but the number of pipeline stages is flexible and may be allocated from an aggregate rather than within dedicated system sets.
The decomposition shown here indicates the components of interaction and the bandwidth characteristics between them. It is also useful to view the data streams by semantic partitioning in data flow and meta-data flow diagrams within the same structure.

The satellite data flow consists of 1 Tbps streams pipelined through archival processing systems (Figure 6). If these systems are statically specified, the existing design of fixed-pipeline configuration will suffice (Chesney et al., 1990). If the satellite processing components are dynamically allocated, a network must be established among the elements. We assume here that these processing stages are largely static because of the data that would be lost during any reconfiguration. Thus, the satellite data flows represent fixed interconnections beyond the underlying dynamic network design. Further, scalability is provided by the addition of separate processing systems for additional satellite data streams in an independent fashion with linear cost. The streams can be compressed from the satellite to the pipeline processor, but the interprocess bandwidth requirements do not necessitate intermediate compression prior to storage.
It is noted that compression of archived data may have unanticipated effects on the communication load of the user visualization processing, as well as affecting retrieval. Extraction of particular information is complicated by stream encoding because it may require the decoding of a large section of data to locate a particular item, especially if the encoding destroys the key information. Effort should be made to avoid this if random access is required.
Further, a variable bit-rate encoding may cause fluctuating loads on the storage and extraction processes. While the storage process may be able to accommodate this fluctuation, the output data fluctuation will generate variable bit-rate streams to the extraction processors, which will require jitter control to permit stream merging. Recent research has also indicated that variable bit-rate streams can cause interference effects in networks, even when the streams do not directly compete for resources.
The user visualization may require an arbitrary amount of pipelining, merging, and interleaving of extracted archive information (Figure 7). Whereas the individual streams are independent to permit independence in user control, the allocation of resources to the extraction processes is necessarily highly dynamic. The resources of extraction processes should be part of a dynamically reconfigurable network to allow additional resources to be added for additional functionality or scale of service.
Processing the streams usually occurs in the uncompressed domain, so any compression should occur at the final stage before user output. As a result, compression cannot be effectively used to reduce internal network load. The resulting interaction requires a very high bandwidth, very high connectivity network, such as BISDN (i.e., ATM).
If the user visualizations are restricted to conventional resolutions (500 x 500 at 1-8 bits, rather than 1000 x 1000 at 8-24 bits depth), the data streams are reduced from 200-600 Mbps to 6-50 Mbps, at full-motion 24 frames/second. While these streams cannot be accommodated in even a single Ethernet hop, a modified FDDI ring can be used.
Consider the dual-ring FDDI. Each level of the ring can accommodate 100 Mbps. There are some recent protocol systems that permit the utilization of multiple segments of the ring simultaneously; this would permit sequences of processors on the ring to be configured as a pipeline, and the output would be collected on the other ring. The result would permit redistribution and configuration of extraction processing resources within the ring.
If the visualization stream is not full-motion or full-color, the bandwidth required would be reduced even further. Also, it is not clear at this time whether the full bit-rate is required during extraction or is the result of data stream merging costs, the latter of which could be transmitted to the user in a repeating loop.
These streams include interaction among the pipeline elements, monitoring of archive access for dynamic reorganization, user commands, extraction commands, database retrieval commands, and communication between the command processors for distributed resource allocation.
Indicated design
The following are the recommendations for the design of a network for a DAAC system that is flexible, scalable, and secure. There is a multiple ring structure. The rings comprise the data input, query transformation, and data processing and output components of operation. By separating the structure in this way, the satellite processing is partitioned from the user-level operations, and the query processing is partitioned from the internal extraction operations. The former provides a robust isolation between data input and output, and the latter provides a similar isolation of user and data processing. The result is a robust and secure system.
The distribution of satellite processing resources in a high-speed ring (FDDI II) or BISDN network (ATM) provides enhanced pipelining capability, scalability, and dynamic reorganization of resources not afforded by the current, fixed interconnection within individual backplanes (Chesney et al., 1990). This requires the use of emerging FDDI II protocols supporting the simultaneous use of multiple ring segments, sometimes called multiple tokens. This feature of the network design was emphasized by the meta-data description of the system.

The distribution of user query processing components among a low speed ring or bus (Ethernet for small distances or token ring for larger distances) provides links among the command processors to support distributed resource allocation at scheduling time (Short, 1991; Short & Shastri, 1990). This interaction was indicated by the stepwise refinement method.
The dynamic allocation of computation elements for extraction processing is similar to the satellite processing ring, i.e., both indicate an FDDI II modified multi-token ring or an ATM switching system (supporting full interconnections via a SONET-rate crossbar or multistage interconnection network). In the case of the extraction processing, the status of processors must be monitored by the query processing network for resource allocation. The idea is that the resources of the high-speed extraction network are allocated "out-of-band," at scheduling-time, in the query processing network.
The monitoring of the database usage and structure can also occur within the query processing network, because it is a resource reallocation function. The result is a system that is composed of three networks: one isolated multitoken FDDI II network or ATM switching system for satellite data processing and a slow query processing network linked to another fast extraction processing network. Security is enforced in the slow query processing network by nature of its physical partitioning from the other two networks.
The general structure is visualized and instantiated with canonical networks in Figure 9. The basic description is as follows. A control console and host computer to each network is assumed. The satellites are connected to SatNet with T-3 (45 Mbps) lines. The Pipe Processors are as described in the Ingestion portion, modified to provide a network interface, rather than a VME/VSB interface (Short &Shastri, 1990). SatNet is either an FDDI II multi-token ring, or (optimally) an ATM BISDN network, providing full crosspoint interconnection with rates of STS-3 (155 Mbps) to STS-12 (620 Mbps). Until such technology is commercially available, a conventional analog crossbar can be used because the connections within this network are not frequently modified.

The same Pipe Processors can be used as extraction engines with downloadable programs. They can also be used as remote processors by users at other workstations, if available. The design of ExtractNet supports heterogeneous systems, including supercomputers, workstations, and special-purpose Pipe Processors (as in SatNet).
The extraction processors can be connected to an ATM BISDN switch to implement the ExtractNet component. The ExtractNet should not be implemented with FDDI II or a crossbar, due to the highly dynamic reconfiguration that needs to occur to support varying user-specified extraction processes. The ExtractNet has a high bandwidth link to the database and another to each of the user-host processors. This latter link supports individual visualization streams.
The user access hosts are connected via a relatively conventional token ring, such as FDDI, or even Ethernet. Commands and resource allocation are processed on this network; these are low-bandwidth activities. It is assumed that one user-host will support each user connection because of the bandwidth required per user connection for high quality full-motion video. If still video is used, multiple users can be supported per station.
The ControlNet is used for distributed resource allocation among the user hosts and out-of-band resource allocation of the components of ExtractNet. SatNet allocation can occur off-line because the network is reconfigured only periodically.
A separate monitor host performs low bandwidth computations, such as database restructuring for performance (Campbell & Short, 1991). The design of the database to support dual high-bandwidth ports, or possibly multiple high-bandwidth retrieval ports to ExtractNet, is beyond the scope of this section.
User access is restricted to the ControlNet, where queries are processed within the hosts, or possibly off-loaded into the Extraction Processors of ExtractNet, or a separate high-performance engine connected to ControlNet (like the monitor). The access control is both physical and logical, so that the user commands are prohibited from utilizing the ExtractNet or SatNet. The interaction is similar to that of RPC, where user commands are decomposed into fixed, preexisting procedures that are pipelined together. User access is as secure as in RPC.
The DMSS architecture presented is scalable, partitions the DMSS via gateway access servers, and includes internally replicated processing components. A design in which control is distinct from data streams, both logically and topologically, has been shown.The architecture shown permits various implementations:
Chesney, J., Speciale, N., Horner, W., & Sabia, S. (1990, September). High Performance VLSI telemetry data systems. Second International Symposium on Space Information Systems, Pasadena, CA Amer.Inst.Aeronautics and Astronautics.
Cromp, R. (1991). Automated extraction of metadata from remotely sensed satellite imagery (Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 111-120.
Cromp, R., Campbell, W., & Short, Jr., N. (1992, February). An intelligent information fusion system for handling the archiving and querying of terabyte-sized spatial databases. International Space Year Conference on Earth and Space Science Information Systems, Pasadena, CA.
McDonald, K., & Blake, D. (1991). Information management challenges of the EOS data and information system (Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 258-267.
Quirk, B., & Thompson, R. (1991). Early-EOS actinitios at the land processes distributed active archive center (DAAC) (Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 339-351.
Ramapriyan, H., & McConaughy, G. (1991). Version 0 EOSDIS-An overview (Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 352-362.
Short, Jr., N., & Shastri, L. (1990). The application of connectionism to query planning/scheduling in intelligent user interfaces. Telematics and Informatics, 7(3/4), 209-220.
Shi, J., & Grebowsky, G. (1990). A performance model for realtime packet processing (Mission Operations and Data Systems Directorate paper). Greenbelt, MD: NASA Goddard Space Flight Center.
Short, Jr., N. (1991). A real-time expert system and neural network for the classification of remotely sensed data ( Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 406-418.
Tanenbaum, A.S. (1988). Computer Networks (2nd ed. <h>. New Jersey: Prentice-Hall.
Wharton, S., Chang, H., & Krupp, B. (1991). Sizing the science data processing requirements for EOS (Technical Papers ACSM-ASPRS Annual Convention). Remote Sensing, 3, 478-487.
Acknowledgements- We would like to thank the members of the NASA GSFC Information Science and Technology Office (ISTO), Code 930.1, notably Nick Short, as well as Bob Cromp and Bill Campbell, for their input into this work. We would also like to thank Jim Chesney of NASA for providing detailed information on the DMSS ingestion hardware. We also thank the members of USC/ISI HPCC Division for their constructive comments on this document.