Network Working Group                                        W. Chimiak
Request for Comments: 1453                                         BGSM
                                                             April 1993

         A Comment on Packet Video Remote Conferencing and the
                        Transport/Network Layers

Status of this Memo

   This memo provides information for the Internet community.  It does
   not specify an Internet standard.  Distribution of this memo is
   unlimited.

Abstract

   The new generation of multimedia applications demands new features
   and new mechanisms for proper performance.  ATM technology has moved
   from concept to reality, delivering very high bandwidths and new
   capabilities to the data link layer user.  In an effort to anticipate
   the high bandwidth-delay data link layer, Delta-t [Delta-t], NETBLT
   [RFC 988], and VMTP [RFC 1045] were developed.  The excellent
   insights and mechanisms pioneered by the creators of these
   experimental Internet protocols were used in the design of Xpress
   Transfer Protocol (XTP) [XTP92] with the goal of eventually
   delivering ATM bandwidths to a user process.  This RFC is a vehicle
   to inform the Internet community about XTP as it benefits from past
   Internet activity and targets general-purpose applications and
   multimedia applications with the emerging ATM networks in mind.

1.  Introduction

   Networking is no longer synonymous with analog telephony.  High-
   performance lower-layer networks have made possible exciting new
   applications: collaboratory environments, distributed client/server
   computing, remote conferencing, teleclassrooms, and distributed
   life-sciences imaging.  These applications normally demand a great
   deal of bandwidth and often create operating system bottlenecks.
   Enabling these new multimedia applications entails delivering
   bandwidth to the applications, not just having bandwidth available on
   the network.  This statement may appear obvious, but often solutions
   at the transport layer are satisfied by having bandwidth at that
   layer without sufficient sensitivity to higher-layer access to the
   bandwidth.  The unavailability of bandwidth at upper layers is
   becoming the real issue as the networks are becoming a high-
   performance virtual backplane without concomitant high-performance
   control schemes.  It appears that new services are needed that
   require communication with all layers.  The ATM architecture calls

Chimiak                                                         [Page 1]

RFC 1453             Comments on Video Conferencing           April 1993

   for such an integrated control scheme.

2.  Remote Conferencing

   The challenges of remote conferencing is an application whose
   challenges may be met at the data link layer by the emerging
   broadband networks.  If so, important medical applications such as
   medical imaging for diagnosis and treatment planning would be
   possible [CHIM92].  Remote conferencing would permit imaging
   applications for life sciences through the use of national resource
   centers.  Collaboratory conferences in molecular modeling, design
   efforts, and visualization of data in numerous disciplines could
   become possible.

   At the Second Packet Video Workshop, held December, 1992, at MCNC in
   the Research Triangle Park, North Carolina, a recurrent theme was the
   use of multimedia in remote conferencing.  Its applications included
   the use of interactive, synchronized voice and video transmission,
   multicast transmission, data transfer, graphics transmission,
   noninteractive video and audio transmission, and data base query
   within a virtually shared workspace.  A few participants doubted the
   ability of current computer networks to handle these multimedia
   applications and preferred only connection-oriented, circuit-switched
   services.  Most participants, however, looked forward to using an
   integrated network approach.

2.1.  Remote Conferencing Functions and Requirements

   Remote conferencing as seen at the workshop requires a set of
   functions.  It must provide session scheduling that deals with
   initiating a session, joining in-progress sessions, leaving a session
   without tearing it down if there are multiple participants, and
   terminating a session.

   The remote-conferencing session needs a control subsystem that is
   either tightly controlled for an n-to-n connection for two to 15
   participants, or loosely controlled for a 1-to-n connection for any
   number of participants.  The Multipeer-Multicast Consortium is
   working on defining the control requirements and the mechanisms for
   control.  At the Packet Video Workshop, one participant presented a
   conference control protocol (CCP) shown in Figure 1 [CCP92].  In this
   architecture the CCP controls the Network Voice Protocol (NVP)
   [RFC741] and the Packet Video Protocol (PVP) [PVP81] over the
   experimental Internet Stream Protocol, Version 2 (ST-II) [RFC1190]
   rather than IP.

   Latency and intramedia synchronization and intermedia synchronization
   (lip-sync) are critical for the interactive voice and video streams

Chimiak                                                         [Page 2]

RFC 1453             Comments on Video Conferencing           April 1993

   of remote conferencing.  Client/server applications including data
   base operations are equally important.  The ability to display
   noninteractive video and high-resolution graphics is necessary.

   As with most applications, security will eventually be an issue.  At
   the very least, there must be a mechanism to determine who can find
   out what about conference and who can join a conference.  This
   determination will be part of an upper-layer protocol.

      +--------------+ +--------+ +-----+ +------------+
      |Teleconference| |  File  | |Email| |   Domain   |
      |   (CCP)      | |Transfer| |     | |Name Service|
      +----+-------+-+ +-----+--+ +-+---+ +-----+------+
           |       |         |__  __|           |
           |       |            ||              |
     +-----+--+ +--+-----+    +-++-+       +----+---+
     |Network | | Packet |    | T  |       |    U   |
     | Voice  | | Video  |    | C  |       |    D   |
     |Protocol| |Protocol|    | P  |       |    P   |
     +---+----+ +--+-----+    +-+--+       +--+-----+
         |__     __|            |__         __|
            |   |                  |       |
          +-+---+--+             +-+-------+-+
          | Stream |             |     I     |
          |Protocol|             |     P     |
          +---+----+             +---+-------+
              |                      |
        +-----+----------------------+----+
        |IEEE_802.X,FDDI,DARTnet,ATOMIC...|
        +---------------------------------+

          Figure 1: The Connection Control Protocol Architecture

   The solutions must range in geography from single machines through
   LAN, CAN, MAN, WAN conferences, as well as in data content from PCs
   to high-end workstations.  Implicit in the scaling is the notion of
   product and application interoperability.

   Remote conferencing applications should take advantage of future
   network enhancements, as well as the functions provided by ATM, FDDI,
   and SMDS.  Doing so should provide function versus resource trade-
   offs such as cost versus error control and cost versus rate control.
   As a result, the transport layer should be able to provide the
   services offered by the data link layer.

   Most of the presentation on remote conferencing indicated a need for
   some degree of multicast functionality, ranging from the 1-to-n, with
   conference membership completely known, to conferences for which

Chimiak                                                         [Page 3]

RFC 1453             Comments on Video Conferencing           April 1993

   existence of a group is known, but exact membership is not.

   In remote conferencing, it is preferable to use one network for all
   information traffic.  This network should have an offered quality of
   service (QOS) criteria to accommodate tradeoff metrics, which would
   include guaranteed throughput, connection reliability, high call-
   completion rate, few dropped calls, tolerable error rate, varying
   degrees of compression on the video and audio streams, and tolerable
   motion artifacts, flow control, and latency.  The QOS should be an
   input to the transport layer from the application or transport
   service user.

   The compression/coding function should provide time-stamping and
   packetizing information, as well as real-time echo cancellation.
   These functions are usually at the presentation and session layer of
   the Open System Interconnection (OSI) model or the at the application
   in some Internet models, but not the transport layer.

3.  Potential Solutions

   RFC 1193 deals with the requirements of real-time communications,
   which include some of the same capabilities [RFC1193].  But the
   requirements articulated create the necessity for new
   transport/network protocols.  The new protocols under development by
   the Audio Visual Transport [SCHU92] (RTC, RTCP), and other protocols
   in this area (ST-II) suggest an architecture like that shown in
   Figure 2.

   These approaches may work.  However, they encourage a discipline that
   creates a protocol for each new class of application.  Another
   approach might be to unify the protocols.  It is felt that this is
   one of the main goals of XTP (see Figure 3).

   Other design considerations of XTP include the following:

Chimiak                                                         [Page 4]

RFC 1453             Comments on Video Conferencing           April 1993

             +----------------------+
             |          media       |
             |       application    |
             +--------+----+-+------+
             |        |RTCP| |      |
             |        +----+ |   T  |
             |         RTP   |   C  |
             +-----+-----+   |   P  |
             |ST-II| UDP |   |      |
             +     +-----+---+------|
             |     |       IP       |
             +-----+-------+--------+
             |    Data Link Layer   |
             +----------------------+

              Figure 2: One emerging multimedia architecture

     +--------------+  +--------+ +-----+ +------------++-----------+
     |Teleconference|  |  File  | |Email| |   Domain   ||   media   |
     |              |  |Transfer| |     | |Name Service||application|
     +------+-------+  +----+---+ +--+--+ +-----+------++-----+-----+
            |               |        |          |             |
            +---------------+--------+----------+-------------+
                                     |
                             +-------+--------+
                             |Unified Protocol|
                             +----------------+
                             |Data Link Layer |
                             +----------------+

           Figure 3: Another integrated multimedia architecture

   (1)  Developing a protocol based on the work and experience of
        the protocol groups such as IETF, which produced NETBLT,
        VMTP, TCP, IP, and UDP.

   (2)  Incorporating lessons from Delta-t.

   (3)  Observing the design paradigm shift of ATM, SONET, and  VMTP
        in the header, trailer, and information segment construction.

   (4)  Targeting the real-time requirements articulated by the
        Department of Defense SAFENET committee and the French
        Ministry of Defense military real-time specification [GAM-T-103].

   Mechanisms in XTP allow an application to approach the performance
   desired.  A session-scheduling mechanism for joining and leaving a

Chimiak                                                         [Page 5]

RFC 1453             Comments on Video Conferencing           April 1993

   multicast conference exists.  The XTP mechanism for multicast
   satisfies the loosely controlled session requirements.  The tightly
   controlled session would require the use of multiple XTP
   associations.

   The priority mechanism that uses the 32-bit SORT field permits an
   application to prioritize data.  Because XTP is a transport layer,
   this priority mechanism follows through every node tranversed.  There
   is also an out-of-band delivery mechanism.  However, XTP does not
   offer latency control by itself; it just provides a priority
   mechanism.

   The selective acknowledgement, fast negative acknowledgement, and
   selective retransmission permit an application to choose an
   appropriate level of error control.  The combination of the priority
   mechanism and these error-control mechanisms is likely to approach
   the latency and synchronization requirements of remote conferencing.

   Noninteractive audio and video, as well as graphics presentation, can
   be accommodated in many ways by the application.  It is important
   that the transport layer provides adequate mechanisms to deliver the
   appropriate data streams in a manner compatible with the
   applications.  These applications can probably be accomplished by
   means of extant protocols, as well as XTP.

   The scalability of the solution will be a function of the standards
   used.  At the Packet Video Workshop, some of the applications
   sacrificed computer network standards in favor of desired
   performance.  This approach usually impedes scalability.  From the
   explanation of the applications taking this approach, it appeared
   that using XTP would have provided an adequate transport service for
   the applications.

   XTP was designed to provide mechanisms to accommodate application
   requirements, that is, the ability to respond to QOS requests.  For
   example, guaranteed throughput may be accomplished by using XTP's
   rate and burst control together with flow control or no flow control.
   Tolerable error rate can be accomplished with partially error
   controlled connections (PECC), a service which can be placed in the
   application or just above the transport layer [PECC92].  Motion
   artifacts and varying degrees of compression should be done above the
   transport layer in coordination with the transport layer or possibly
   in a network management function.

3.1.  Synthesize the Hardware Fabrication Process into the Design

   To produce an affordable solution, the hardware fabrication process
   should be a design consideration.  Technologies are evolving too

Chimiak                                                         [Page 6]

RFC 1453             Comments on Video Conferencing           April 1993

   rapidly to assume that a generic protocol design will anticipate all
   fabrication advances, but this fact should not impede use of the
   features of advanced hardware fabrication processes.

   System interface problems and VLSI techniques should be considered in
   the specification of the protocol.  An examination of the ATM and
   SONET standards appears to support this philosophy.  Similarly,
   NETBLT and VMTP design efforts seem to support this approach.  XTP
   does use it.

   It is very helpful to break down the protocol into parallel-state
   machines for execution on more inexpensive hardware.  This procedure
   reduces the context switching and interrupt handling requirements of
   the hardware, thereby decreasing production costs while producing a
   scalable protocol machine.

4.  Multimedia Applications over XTP

   In parallel with the IETF efforts to enable multimedia applications
   such as remote conferencing, the XTP forum members have been
   experimenting with major elements of these applications.

   (1)  At the University of Virginia, more than 100 simulated voice
        channels were run on an FDDI network [UVAVOICE92].  The
        purpose of this experiment was to test the limits of FDDI
        and a software version of XTP in a simulated interactive
        voice environment.  Multicasted, noninteractive video has
        been supported there for several years.

        UVa also has a video-mail demo over XTP/FDDI that uses
        Fluent multimedia interface and standard JPEG compression.
        This PC-based demo delivers full frame, full color, 30
        frames/sec video from any network disk to a remote VGA
        screen.  It is important that users could not discern any
        difference  in  playback  between  a local disk and a remote
        disk.  An Xpress File System (XFS) is used on server and
        client systems.

   (2)  The Technical University of Berlin, Germany, reports that
        the coordination, implementation, and operation of
        multimedia services (CIO) of the R&D in Advanced
        Communications Technologies in Europe (RACE) is using XTP as
        a starting point for design [XTPRACE].

   (3)  At the Naval Command, Control, and Ocean Surveillance Center
        Research, Development, Test and Evaluation Division NRaD
        (formerly the Naval Ocean Systems Command (NOSC)), voice is

Chimiak                                                         [Page 7]

RFC 1453             Comments on Video Conferencing           April 1993

        multicasted over XTP/FDDI.  A simple multicast is
        distributed to a group with a latency of around 25 ms, where
        the latency represents delay from the voice signal from the
        microphone to the audio signal to the speaker.  This group
        is currently doing research on an n-party multicast of voice
        (telephone conference-call paradigm [n x n multicast]).

   (4)  Commercially, Starlight Networks Inc., migrated a subset of
        XTP into the transport layer of its video application
        server.  By using XTP rate control, full-motion, full-screen
        compressed video is delivered at a constant 1.2 Mbps, over
        switched-hub Ethernet to viewstations.  This network
        delivers at least 10 simultaneous video streams.

   Therefore, XTP has been used in applications that were previously
   placed over IP or even a data link layer.

5.  Policy versus Mechanism

   Separating protocol policies and mechanisms [XTPbk] permits adoption
   of new policies without altering offered mechanisms.  An excellent
   example is UVa's Partially Error Controlled Connections (PECC).  This
   control system maximizes error control in such a way that receiving
   FIFOs are never starved i.e., the application, driver, or operating
   system buffer control, and not the transport layer becomes the
   bottleneck.

   Because XTP is mechanism-rich and policy-tolerant, this very dynamic
   error control policy mechanism is possible.  Separating policy and
   mechanism permits an error-control or flow-control policy to adapt to
   the data link layer conditions without shutting down a connection and
   rebuilding (or multiplexing) a new one on a different protocol stack.
   This may also provide an easier way for a network management
   subsystem to maintain a desired QOS.

6.  Summary

   Remote conferencing presents new opportunities for research,
   business, and administration.  Although some are proposing that only
   classical circuit-switched mechanisms be used, most network engineers
   are searching for ways to use the new features of FDDI, SMDS, and ATM
   in multimedia applications such as remote conferencing.  Some new
   applications have been placed directly on a data link layer.  New
   Transport/Network layer combinations have been proposed and are being
   tested.  It is believed that consideration should be given to XTP as
   a possible solution because its forum membership includes commercial,
   government, and research institutions, some of which have implemented
   various applications that contribute to an overall remote-

Chimiak                                                         [Page 8]

RFC 1453             Comments on Video Conferencing           April 1993

   conferencing application.

7.  References

   [CCP92]     Schooler, E., "An Architecture for Multimedia Connection
               Management", in Proceedings of the 4th IEEE ComSoc
               International Workshop on Multimedia Communications,
               Monterey, CA, April 1992.

   [CHIM92]    Chimiak, W., "The Digital Radiology Environment", IEEE
               JSAC, Vol. 10, No. 7, pp. 1133-1144, September 1992.

   [Delta-t]   Watson, R. W., "Delta-t Protocols Specification",
               Lawrence Livermore Laboratory, April 15, 1983.

   [GAM-T-103] French Ministry of Defense, "GAM-T-103 Military
               Real-Time Local Area Network Reference Model
               (Transfer Layer)", February 7, 1987.

   [PECC92]    Dempsey, B., Strayer, T.  and Weaver A., "Adaptive Error
               Control for Multimedia Data Transfer", in Proceedings
               of the IWACA 92, Munich, Germany, pp. 279-288, March
               1992.

   [PVP81]     Cole, R., "PVP - A Packet Video Protocol", W-Note 28,
               Information Sciences institute, University of
               California, Los Angeles, CA, August 1981.

   [RFC1045]   Cheriton, D., "VMTP: Versatile Message Transaction
               Protocol Specification", RFC 1045, Stanford
               University, February 1988.

   [RFC998]    Clark, D., Lambert, M., and L. Zhang, "NETBLT: A Bulk
               Data Transfer Protocol", RFC 998, MIT, March 1987.

   [RFC1193]   Ferrari, D., "Client Requirements For Real-Time
               Communication Services", RFC 1193, UC Berkeley,
               November 1990.

   [RFC1190]   Topolcic, C., Editor, "Experimental Internet Stream
               Protocol: Version 2 (ST-II)", RFC 1190, CIP Working
               Group, October 1990.

   [SCHU92]    Schulzrinne, H., "A Transport Protocol for Audio and
               Video Conferences and other Multiparticipant
               Real-Time Applications", Internet Engineering Task
               Force, Internet-Draft, October 1992.

Chimiak                                                         [Page 9]

RFC 1453             Comments on Video Conferencing           April 1993

   [UVAVOICE92] Weaver, A. C. and McNabb, J.F., "Digitized Voice
                Distribution Using XTP and FDDI", Transfer, Vol. 5,
                No.  6, pp. 2-7 (November/December 1992).

   [XTP92]     Xpress Transfer Protocol, version 3.6, XTP Forum,
               1900 State Street, Suite D, Santa Barbara, California
               93101 USA, January 11, 1992.

   [XTPbk]     Strayer, W.T., Dempsey, B.J., and Weaver, A.C., "XTP:
               The Xpress Transfer Protocol", Addison-Wesley
               Publishing Company, Inc., 1992.

   [XTPRACE]   Rebensburg, K. and Miloucheva, I., "The Use of XTP in a
               Large European Communication Project", XTP Forum
               Research Affiliate Annual Report, Document 92-183,
               pp. 105-112, 1992.

Security Considerations

   Security issues are discussed in section 2.1.

Author's Address

   William J. Chimiak
   Department of Radiology
   Bowman Gray School of Medicine
   Medical Center Boulevard
   Winston-Salem, NC 27157-1022

   Phone: 919-716-2815
   EMail: chim@relito.medeng.wfu.edu

Chimiak                                                        [Page 10]