14.2 Creating Next-generation Network Services and Infrastructure 279After 35 years of increasing Internet success, communications standards organiza-tions have now almost universally en
Trang 1Routing Decision Node (RDN) is assumed The first three schemes basic standalonemechanisms, and the last scheme attempts to find the best combination of the abovetwo schemes The primary route from node I to E is depicted as a thick black lineand the link 2→ 3 is assumed to be broken during the operation.
When the headend 2 (or tailend 3) of link 2→ 3 detects the link failure, it informsthe RDN via the control plane The RDN conducts the routing recomputation andupdates the forwarding tables for all nodes, and new bursts will subsequently followthe new routes For example, new bursts will follow the route I→ 4 → 5 → 6 E Thissolution is optimal and the existing routing protocol can handle it well However,global routing table updating is a slow process (in seconds or even minutes) due
to the long round-trip time for the signal transmission and processing between theOBS nodes and the routing entity As a result, a large amount of bursts will be lostbefore the forwarding tables are updated
This is similar to the traditional deflection routing usually seen in a congestionresolution scheme When the headend 2 detects the link failure, it will automaticallypick up the alternative next hop in the forwarding table for every new burst whosenext hop on its primary route passes the faulty link In the example, new burstsfrom I to E will follow the alternative route I→ 1 → 2 → 5 → 6 → E This would bethe fastest restoration scheme since new bursts will be deflected to an alternativegood link right after the link failure is detected locally Therefore, it will incur thesmallest restoration loss However, because all the affected bursts are deflected toone alternative path, this scheme would increase the congestion loss
In this scheme, the headend 2 will also send a different fault notification message
to all its adjacent nodes in addition to the one to the RDN This fault notificationmessage contains the destination information for all the primary routes passing thefaulty link After receiving this message, each of the adjacent nodes will pick up
an alternative next hop for the affected bursts that are heading to the faulty linkaccording to their primary route In the example, bursts from I to E will take the newroute I→ 1 → 4 → 5 → 6 → E Compared with the local deflection scheme, neighbordeflection has the potential to make the rerouted traffic more distributed instead
of being totally deflected to one alternative path In this way, less congestion andtherefore less burst loss may occur However, this scheme requires extra one-hopfault notification One possible problem is that, if the network traffic load is alreadyvery heavy, distributed deflection may have a negative impact as it may deterioratethe congestion condition all over the network
While the restoration based on the neighbor deflection may make the deflected loadmore balanced, it suffers from an extra one-hop fault notification delay that will result
Trang 213.6 Recovery and Restoration 271
in greater overall burst loss because of bigger restoration loss Therefore, a more
efficient algorithm is a combination of local deflection and neighbor deflection, i.e.,
the affected bursts are deflected locally until the adjacent nodes receive the fault
notification At that time the affected bursts will be deflected in a distributive way
We name this scheme distributed deflection
One interesting observation from scheme 2 is that the capacity of the links between
the headend (node 2) of the faulty link and its adjacent nodes (node 1) will not
be utilized if affected bursts are deflected at adjacent nodes Therefore, we define a
distribution ratio, , to determine the portion of affected bursts that will be deflected
at the adjacent nodes That is, after the adjacent nodes receive the fault notification,
affected bursts will be deflected distributively and 1– affected bursts will be
forwarded to the headend node of the faulty link to be deflected locally
With a different value of ∈ 01, we have a different variance of the distributed
restoration scheme When= 0, it is equivalent to scheme 1, local deflection-based
restoration When = 1, it becomes scheme 3, the distributed deflection-based
restoration We use distributed deflection to denote the generalized distributed
deflection mechanism We also note that using introduces only a tiny amount
of management complexity in the adjacent nodes We expect that there exists an
optimal value of that makes the affected bursts deflected in a most balanced way
such that the minimum burst loss can be achieved
Here are defined three restoration classes with different priority values for all bursts
When a burst is generated at the edge node, it will be assigned a priority value in its
control packet according to its restoration QoS requirement Bursts in the highest
class will pick up best from the local and neighbor deflection restoration schemes
during different restoration periods The local deflection will be chosen during the
fault notification period because of its shorter fault notification time And the local
and neighbor deflection scheme with shorter alternative route length (number of
hops in this paper) during the deflection period will be chosen because of its lower
backup capacity requirement (and possible lower average burst loss probability)
Bursts in the middle class will be restored via neighbor deflection, and the bursts in
the lowest class will be dropped until the global forwarding table update is finished
In this way, the lower class bursts do not compete for the bandwidth with the higher
class bursts in the deflected alternative routes
Figure 13.6 depicts the simulation results on a NSF topology The y-axis represents
the overall burst loss probability under medium traffic during the three network
operational phases around a link failure In the x-axis, 1 represents the period before
the link failure, 2 represents the period before a global forwarding table update and
only the proposed fast restoration schemes are in place, and 3 represents the period
after the global forwarding table update We observe that the burst loss probability
is very low for both phases 1 and 3, though it is actually a little bit higher in phase
3 due to the reduction in the network capacity
Trang 30.1 0.11 0.12 0.13 0.14 0.15 0.16
Global update Local deflection Neighbour deflection Distributed deflection
Restoration phase
Figure 13.6. Restoration performance
However, the loss probability could increase significantly in phase 2 Relying only
on the forwarding table update would incur very high burst loss in this phase.Nonetheless, the loss probability increases only moderately when the proposed fastrestoration schemes are used in this phase Among the four restoration schemes,distributed deflection shows the best performance (almost no extra burst loss)followed by local deflection and neighbor deflection Global routing update incursthe highest burst loss Specifically, the improvements from using the three fastrestoration schemes over the global forwarding table update are 24.3%, 20.1%, and10.5%, respectively
The distributed deflection and class-based QoS restoration schemes are built
based on the performance differentiation of above four basic restoration schemes,therefore they can be directly built into the fault management module to providedifferentiated restoration service We note that the above schemes are studied for OBSnetworks, but they could be directly extended to other packet-switched networks
13.7 INTEGRATED FAULT MANAGEMENT
Except for the unreliable fault detector, the current Globus toolkit does not provideother fault tolerant mechanisms There exist a number of different fault handlingmechanisms for Grid applications to respond to system/component failures [37]:
• fail-stop: stop the application;
• ignore the failure: continue the application execution;
• fail-over: assign the application to new resources and restart;
• migration:replicationandreliablegroupcommunicationtocontinuetheexecution
Trang 4References 273
Fail-over and migration are the two choices for fault-tolerant computing and can
be achieved either in a deterministic way, in which the backup compute resourceand network route are precomputed along with the primary, or in a dynamic way,
in which the backup is dynamically discovered after the failure occurrence anddetection We note that this could be coupled with the connection protection andrestoration mechanism in the network context discussed in Section 13.6
At the system level, performance monitoring, data analysis and fault detection,fault recovery will form a feedback loop to guarantee the system performance andavailability The complexity comes from the fact that performance/reliability data can
be collected in different timescales from different layers of the Grid networks (layer1/2/3 and application) Therefore, a Grid platform needs to provide different levels
of fault tolerance capability in terms of timescale and granularity A large portion
of the fault management activities are conducted within the Grid infrastructure andtransparent to the end-user applications
From the point view of applications, we identify the following failure scenariosand fault handling mechanisms to explicitly integrate the Grid service migration withthe network protection or restoration:
• Fail-over or migrate within the same host(s) The application requires dedicated or
shared backup connections for the primary connections (either unicast or cast) The applications can also specify the network restoration time requirement
multi-• Fail-over or mitigate to different host(s) This mechanism is more generic in the
sense that the application requires redundant Grid resource/service In the casethat the primary Grid resource fails, the Grid middleware can reset the connec-tions to the backup Grid resource/service and jobs can be migrated directly fromthe failed primary resource to the backups
Further studies are under way for the aforementioned cases in terms of dynamicconnections setup and network resource allocation
REFERENCES
[1] F Cappello (2005) “Fault Tolerance in Grid and Grid 5000,” IFIP Workshop on GridComputing and Dependability, Hakone, Japan, July 2005
[2] K Birman, “Like it or Not, Web Services are Distributed Objects!”, white paper
[3] V Paxson, G Almes, J Mahdavi, and M Mathis (1998) “Framework for IP PerformanceMetrics.” RFC 2330, May 1998
[4] R Hughes-Jones, P Clarke, and S Dallison (2005) “Performance of Gigabit and 10 GigabitEthernet NICs with Server Quality Motherboards,” grid edition of Future Generation Computer Systems, 21, 469–488.
[5] M Mathis and M Allman (2001) “A Framework for Defining Empirical Bulk TransferCapacity Metrics,” RFC 3148, July 2001
[6] Source code and documentation of iperf is available at http://dast.nlanr.net/Projects/ Iperf/.[7] G Almes, S Kalidindi, and M Zekauskas (1999) “A One-way Delay Metric for IPPM.” RFC
2679, September 1999
[8] G Almes, S Kalidindi, and M Zekauskas (1999) “A Round-trip Delay Metric for IPPM.”RFC2681, September 1999
Trang 5[9] T Ferrari and F Giacomini (2004) “Network Monitoring for Grid Performance tion.”Computer Communications, 27, 1357–1363.
Optimiza-[10] S Andreozzi, D Antoniades, A Ciuffoletti, A Ghiselli, E.P Markatos, M Polychronakis,and P Trimintzios (2005) “Issues about the Integration of Passive and Active Moni-toring for Grid Networks,” Proceedings of the CoreGRID Integration Workshop,November 2005
[11] Source code and documentation of cricket is available at http://people.ee.ethz.ch/∼oetiker/webtools/mrtg/
[12] Multi Router Traffic Grapher software and documentation, http://people.ee.ethz.ch/∼oetiker/webtools/mrtg/
[13] “Grid network Monitoring: Demonstration of Enhanced Monitoring Tools,” DeliverableD7.2, EU Datagrid Document: WP7-D7.2–0110-4-1, January 2002
[14] “Final Report On Network Infrastructure And Services”, Deliverable D7.4, EU DatagridDocument Datagrid-07-D7-4-0206-2.0.doc, January 26, 2004
[15] EGGE JRA4: “Development of Network Services, Network Performance Monitoring –e2emonit,” http://marianne.in2p3.fr/egee/network/download.shtml
[16] Internet2 end-to-end performance initiative http://e2epi.internet2.edu/web traceroute.Home page for web based traceroute http://www.traceroute.org/
[17] A large collection of traceroute, looking glass, route servers and BGP links traceroutemay be found at http://www.traceroute.org/
[18] J Boote, R Carlson, and I Kim NDT source code and documentation available athttp://sourceforge.net/projects/ndt
[19] R.L Cottrell and Connie Logg (2002) “A new high performance network and applicationmonitoring infrastructure.” Technical Report SLAC-PUB-9202, SLAC
[20] IEPM monitoring page, http://www.slac.stanford.edu/comp/net/iepm-bw.slac ford.edu/slac_wan_bw_tests.html
stan-[21] Source code and documentation of thrulay is available at http://people.internet2.edu/∼shalunov/thrulay/
[22] C Dovrolis, P Ramanathan, and D Moore (2001) “What do Packet Dispersion TechniquesMeasure?,” IEEE INFOCOM
[23] Source code and documentation of pathload is available at http://www.cc.gatech.edu/fac/Constantinos.Dovrolis/pathload.html
[24] Source code and documentation of pathchirp is available at http://www.spin.rice.edu/Software/pathChirp/
[25] Source code and documentation of bbftp is available at http://doc.in2p3.fr/bbftp/.[26] A Hanushevsky, source code and documentation of bbcp is available at http://www.slac.stanford.edu/∼abh/bbcp/
[27] L Cottrell (2002) “IEPM-BW a New Network/Application Throughput PerformanceMeasurement Infrastructure,” Network Measurements Working Group GGF Toronto,February 2002, www.slac.stanford.edu/grp/scs/net/talk/ggf-feb02.html
[28] R Hughes-Jones, source code and documentation of UDPmon, a tool for investigatingnetwork performance, is available at www.hep.man.ac.uk/∼rich/net
[29] Endace Measurement Systems, http://www.endace.com
[30] J Hall, I Pratt, and I Leslie (2001) “Non-Intrusive Estimation of Web Server Delays,”Proceedings of IEEE LCN2001, pp 215–224
[31] Web100 Project team Web100 project, http://www.web100.org
[32] B Tierney, NetLogger source code and documentation available at didc.lbl.gov/NetLogger/
http://www-[33] “Telecommunications: Glossary of Telecommunication Terms”, Federal Standard 1037C,August 7, 1996
Trang 6Fault-[36] H Jin, D Zou, H Chen, J Sun, and S Wu (2003) “Fault-tolerant Grid Architecture andPractice,”Journal of Computing Science and Technology, 18, 423–433.
[37] P Stelling, C DeMatteis, I Foster, C Kesselman, C Lee, and G von Laszewski (1999)
“A Fault Detection Service for Wide Area Distributed Computations,”Cluster Computing,
2(2), 117–128
[38] O Bonaventure, C Filsfils, and P Francois (2005) “Achieving Sub-50 onds Recovery upon BGP Peering Link Failures,” Co-Next 2005, Toulouse, France,October 2005
Millisec-[39] J Lang, “Link Management Protocol (LMP),” IETF RFC 4204
[40] M Médard and S Lumetta (2003) “Network Reliability and Fault Tolerance”,Wiley clopedia of Engineering (ed by J.G Proakis), John & Wiley & Sons Ltd.
Ency-[41] L Valcarenghi (2004) “On the Advantages of Integrating Service Migration and GMPLSPath Restoration for Recovering Grid Service Connectivity”, 1st International Workshop
on Networks for Grid Applications (gridNets 2004), San Jose, CA, October 29, 2004.[42] D Papadimitriou (ed.) “Analysis of Generalized Multi-Protocol Label Switching (GMPLS)based Recovery Mechanisms (including Protection and Restoration),” Draft-ietf-ccamp-gmpls-recovery-analysis-05
[43] Y Xin and G.N Rouskas (2004) “A Study of Path Protection in Large-Scale OpticalNetworks,”Photonic Network Communications, 7(3), 267–278.
[44] J L Marzo, E Calle, C Scoglio, and T Anjali (2003) “Adding QoS Protection in Order toEnhance MPLS QoS Routing,” Proceedings of IEEE ICC ’03
[45] S Rai and B Mukherjee (2005) “IP Resilience within an Autonomous System: CurrentApproaches, Challenges, and Future Directions”, IEEE Communications Magazine,
Manage-[48] Y Xin, J Teng, G Karmous-Edwards, G.N Rouskas, and D Stevenson (2004) “A NovelFast Restoration Mechanism for Optical Burst Switched Networks”, Proceedings of theThird Workshop on Optical Burst Switching, San Jose, CA, USA, October 2004
Trang 8Currently, many communities are examining requirements and designs fornext-generation networks, including research communities, standards associations,technology developers, and international networking consortia Some of these newdesigns are implemented in early prototypes, including in next-generation opencommunication exchanges This chapter describes the general concept of such anexchange, and presents the services and functions of one that would be based on afoundation of optical services.
Grid Networks: Enabling Grids with Advanced Communication Technology Franco Travostino, Joe Mambretti,
Trang 914.2 CREATING NEXT-GENERATION NETWORK SERVICES AND
in Internet architecture For example, it would be highly distributed
Even without the Grid as a driving force to motivate the creation of communicationsinfrastructure with these characteristics, architectural designs based on these princi-ples are motivated by other dynamics Major changes are required in the methods
by which communications services and infrastructure are designed and provisioned.These changes are motivated by the explosive growth in new data services, by theadditional communities adopting data services, and by the number and type ofcommunication-enabled devices Soon services will be required for 3 billion mobilegeneral communication devices
Today, there is a growing need to provide seamless, ubiquitous access, at anytime from any location and any device In addition to general communicationdevices, many more billions of special-purpose communications-enabled devices areanticipated: sensors, Radio Frequency Identification (RFID) tags, smart dust, nano-embedded materials, and others All of these requirements are motivating a transfor-mation of the design of core network infrastructure The legacy architectural modelsfor services design and provisioning cannot meet the needs of these new require-ments A fundamentally new approach is required The guiding principles for thatnew approach will be based on those that informed Internet architecture The nextsection presents some of these principles, which are key enablers for next-generationcommunication services and facilities
In previous chapters, the important of the end-to-end principle in distributed tructure design was discussed Its basic premise is that the core of such distributedinfrastructure, including networks, should be kept as simple as possible, and theintelligence of the infrastructure should be placed at the edge This principle is nowalmost universally accepted by all networking standards bodies Recently, the ITUformally endorsed the basic principles of the Internet design through its Study Group
infras-13 recommendation for a Next-Generation Network (NGN) [1] Balancing the need
to maintain the end-to-end principle and to create new types of “intelligent” corefacilities is a key challenge
Trang 1014.2 Creating Next-generation Network Services and Infrastructure 279
After 35 years of increasing Internet success, communications standards
organiza-tions have now almost universally endorsed a architecture founded on the
communi-cation of packet-based data The IETF has demonstrated the utility of this approach
The IEEE has established initiatives to create layer 2 transport enhancements to better
support based services The ITU-T NGN recommendation specifies “A
packet-based network able to provide telecommunication services and able to make use of
multiple broadband, QoS-enabled transport capabilities.” The NGN Release 1
frame-work indicated that all near-term future services are to be based on IP, regardless of
underlying transport
Earlier chapters have stressed the importance of abstracting capabilities from
infras-tructure The design of the Internet has been based on this principle Attaining this
goal is being assisted by the general recognition that legacy vertical infrastructure
stacks, each supporting a separate service, must be replaced by an infrastructure that
is capable of supporting multiple services This design direction implies a
funda-mentally new consideration of basic architectural principals The ITU-T has begun
to formally adopt these concepts One indication of this direction is inherent in
the ITU-T NGN recommendation that “service-related functions” should be
“inde-pendent from underlying transport related technologies.” These recommendations
indicates that the design should enable “unfettered access for users to networks,”
through open interfaces [1]
Also, the ITU-T is beginning to move toward a more flexible architectural model
than the one set forth in the classic 7 layer Open Systems Interconnect (OSI) basic
reference model [2] This trend is reflected in the ITU-T recommendations for the
functional models for NGN services [3] This document notes that the standard
model may not be carried forward into future designs The number of layers may
not be the same, the functions of the layers may not be the same, standard attributes
may not be the same, future protocols may not be defined by X.200, e.g., IP, and
understandings of adherence to standards may not be the same
Another important architectural design principle for any future network is
self-organization
The Internet has been described as a self-organizing network This description has
been used to define its architecture in general, because many of its individual
proto-cols are designed to adapt automatically to changing network conditions without
having to rely on external processes This term has also been used to describe
the behavior of individual protocols and functions, such as those that automatically
adjust to changing conditions Because the Internet was designed for an unreliable
infrastructure, it can, therefore, tolerate uncertain conditions better than networks
that depend on a highly reliable infrastructure
Trang 11The principle of self-organization for networks is particularly important forscalability in a ubiquitous communication infrastructure containing an extremelylarge number of edge devices, especially mobile edge devices The only method forensuring reliable services in this type of global distributed environment is through anarchitecture that enables edge devices to be self-sufficient, to be context aware, self-configuring, adjustable to fault conditions and highly tolerant of instability Theseattributes enable individual services, objects, and processes to automatically adjust
to changing circumstances Recently, progress has been made in identifying mental issues in requirements and potential solutions for autoconfiguration forcomponents in large-scale dynamic IP networks [4]
funda-Another motivation for developing architecture for self organizing networks, such
as ad hoc networks, is to enable communication services in areas with minimalinfrastructure or unstable infrastructure
Another key principle for next generation communications architecture is ensuringcapabilities for distributed service creation Traditionally, service providers have care-fully defined services, and have then designed the technology and infrastructure tosupport those services, with an understanding that they will remain in place forlong periods of time However, in an environment of rapid change, in which newservices can be conceptualized continually, this approach is no longer feasible Thearchitecture must anticipate and support continuous changes in services
Also, the architecture should be designed to provide support for distributed servicecreation It should provide the tools and functionality for services creation to broadcommunities and allows those communities to create, manage, and change theirown services
14.3 LARGE-SCALE DISTRIBUTED FACILITIES
These design principles are inherent in the basic architecture of the Internet, andthey are reflected in Grid design principles Currently, these principles are used
to design new types of large-scale distributed infrastructure to provide support forGrid services The communications infrastructure for these environments does notresemble a traditional network Within this environment, it is possible either to accept
Trang 1214.4 Designs for an Open Services Communications Exchange 281
predefined default services or to create required services through dynamic, ad hoc
processes These environments are used to develop and implement customized Grid
services and specialized communication services For example, in some Grid
envi-ronments, communications services are used as a large-scale distributed backbone
to support specialized Grid processes
This architectural approach provides highly distributed communities with direct
access to resources, toolsets, and other components, including complete resource
management environments, that allow them to create new services and to change
existing services Network services, core components, resources such as lightpaths,
layer 2 channels, cross-connections, and even individual elements can be identified
and partitioned into separate domains and provided with secure access mechanisms
that enable organizations, individuals, communities, or applications to discover,
interlink and utilize these resources Accessible resources can even include basic
optical components, which can be managed and controlled by highly distributed
edge processes
14.4 DESIGNS FOR AN OPEN SERVICES COMMUNICATIONS
EXCHANGE
The architecture described here is implemented within many Grid environments As
these environments were developed, it was recognized that an important component
within this type of large scale distributed facility is an open, neutral communications
exchange point The design of such exchanges constitutes a significant departure
from that of traditional carrier communications exchanges
Today, almost all data networks exchange information at Internet exchange
facil-ities Almost all of these facilities have an extremely limited range of services, and
limited peering capabilities – primarily they provide capabilities for provider peering
at layer 3 and, for some under restrictive policies, layer 2 Also, traditional exchanges
are oriented almost exclusively to provider-to-provider traffic exchanges
The international advanced networking research community is developing new
designs for data communication exchange points that provide for many more types
of services, including Grid services and layer 1 peering capabilities These designs are
currently being developed and implemented in prototype by research communities
An open Grid service exchange provides mechanisms for nonprovider services
exchange through subpartitioning of resources For example, it can allow segments of
resources along with selected control and management processes for those resources
to be allocated to specific external entities
A basic design objective for an open Grid services exchange is the creation of a facility
that fully supports interactions among a complete range of advanced Grid services,
within single domains and across multiple domains Because Grid services are defined
as those that can be constructed by integrating multiple resources, advertised by
Trang 13common methods, this type of exchange provides capabilities for resource discovery,assembly, use, reconfiguration, and discard.
Unlike existing exchanges, this type of exchange will provide methods for ering a complete range of services that can be utilized and manipulated as Gridresources, including options for integration with other distributed Grid resources.This capability provides far more options for fulfilling requirements of services andapplications than are possible with traditional exchanges
discov-This type of exchange fulfills the design principles described earlier, especiallythose related to decentralization Using such an exchange, there is no need torely on a carrier cloud for transport Grid collaborations will be able to managededicated communications resources and services within an environment that theycan customize directly in accordance with their requirements
The infrastructure design used for traditional exchanges is specifically oriented to
a narrow set of limited services As a result, such exchanges require major physicalprovisioning to add new services, enhance existing services, or to remove obsoleteservices The new type of exchange described here will provide a flexible infrastruc-ture with many component resources that can be dynamically combined and shapedinto an almost unlimited number of services
The abstraction layer made possible by the Grid service-oriented approach toarchitecture, combined with new types of network middleware and agile opticalcomponents, allows provisioning to be undertaken through high-level signaling,protocols, and APIs rather than through physical provisioning
These exchanges have the following characteristics
• They are based on a services-oriented network model, and they provide facilities
to support that model, incorporating capabilities that provide direct access tonetwork services Resources are advertised through standard, common servicesmodels
• They are provider and services neutral They allow any legitimate tion service entity to utilize internal resources Because they are based on aservices-oriented architecture, services provisioning, access, and control can beundertaken by multiple service providers, using distributed management andcontrol systems
communica-• They provide a significantly expanded set of services, both a wide range ofstandard, advanced, and novel communication services and other data-orientedservices, including Grid services
• They support ad hoc dynamic services and resource provisioning
• They are highly decentralized They enable multiple capabilities for autonomouspeerings, including ad hoc peerings, among client constituencies, both serviceproviders and other communities such as individual organizations
Trang 1414.5 Open Grid Optical Exchanges 283
• They enable peering to take place at any service or communications level,
including those for data transport services at layer 1
• They provide capabilities for extensive integration and service concatenation at
multiple levels, e.g., layer 1, layer 2, and layer 3, and among multiple services
and resources
• They enable such integration across multiple independent domains
• They allow for direct peer-to-peer connections from specifically designated sites,
and enable those sites to established unique shared network services
Currently, a new architecture for next-generation open exchange points based
on these concepts is being designed, developed, implemented in prototype, and
in some cases being placed into production The most advanced designs for these
exchange points incorporate innovative methods of providing a services foundation
based on dynamic lightpath provisioning Unlike traditional exchanges points, these
exchanges will provide accessible layer 1 transport services as foundation blocks
for all other services This capability will be a key foundation service within such
exchanges
14.5 OPEN GRID OPTICAL EXCHANGES
Because optical services layers within these exchanges will provide for key
foun-dation capabilities, it is appropriate to term these types of exchanges “open grid
optical exchanges.” As noted, these facilities represent a significant departure from
traditional exchanges
There are currently three types of Internet exchanges [5,6]:
(1) LAN-based Internet exchanges These are the most common exchanges, and they
typically implement layer 2 switches at their core, although some exchanges are
distributed This is the only stateless exchange Blocking is possible if multiple
peers want to send traffic to the same peer at the same time
(2) ATM-based Internet exchanges These are statefull exchanges, with Permanent
Virtual Circuits (PVCs) at their core If Variable Bit Rate (VBR) circuits are used,
there is no guaranteed congestion-free transmission The use of Constant Bit
Rate (CBR), on the other hand, results in very inefficient use of resources and
poor scalability
(3) MPLS-based Internet exchanges MPLS exchanges are both stateful and packet
based Though the service may be congestion free, it requires a lookup at the IP
routing table at the ingress edge Label Switching Router (LSR) This approach
results in a relatively expensive operation for very-high-bandwidth datastreams
Because of the properties of these types of exchanges, they cannot provide many
services required by Grids These services are not sufficient to support even a few very
Trang 15high-bandwidth flows that do not require routing Either it is technically impossible(for LAN-based Internet exchanges) or it yields unnecessary and costly overhead(for ATM- and MPLS-based Internet exchanges) Also, such exchanges provide nocapabilities for external entities to implement and customize services dynamically.
This section describes the rationale and model for an exchange that can be termed
an “open Grid optical exchange.” There are multiple reasons for developing this type
of exchange, including those that are derived from the design principles described
in earlier sections This section also describes the basic interfaces and services thatsuch an optical exchange may provide
The case for an open Grid optical exchange is derived directly from a number ofkey requirements, including the need for direct access to services at lower layers
A growing requirement has been recognized for transport services that allow datatraffic to remain at lower layers (layer 2, layer 1) and not be converted, for examplefor routing through other layers Also, there are reasons based on requirementsfor service quality, and others that relate to taking advantage of multiple newoptical innovations First, optical connection-oriented solutions can ensure highquality, e.g., they can be guaranteed congestion free This feature eliminates anyneed to impose complex QoS technologies In addition, connection-oriented linksallow users to use more efficient, but TCP-unfriendly, transport protocols Suchflows would be problematic in a common layer 3 exchange Although other tech-nologies also attempt to offer high-quality services with connectionless transport,such as Multi-Protocol Label Switching (MPLS) and Optical Burst Switching (OBS),these services do not match the performance of optical services for many requiredGrid services
Second, there is an economic argument for this type of exchange – oriented technologies will always remain an order of magnitude cheaper becausethere is no need to look into headers at the datastream itself Even as traditionalfacility components are becoming less expensive, optical components are also contin-uing to fall in price, and the relative price ratio is at least remaining constant(although there is an argument that optical component prices are falling faster).Many new technologies are being developed [7], such as Optical Add-Drop Multi-plexers (OADM)s and advanced photonic devices that will provide powerful newcapabilities to these environments Furthermore, the long-term forecast for opticalnetwork components is that they will become increasingly less expensive, includingsuch key devices as optical shift registers [8] If current research projects developoptical routers, this type of facility would be able to take immediate advantage of theircapabilities
connection-Third, it is more efficient to keep traffic that does not need any high-levelfunctionality at the lowest layer possible In particular, this will apply to relatively long(minutes or more) and high-bandwidth (gigabps or more) datastreams between alimited number of destinations Examples of these requirements are data replicationprocesses and instrumentation Grids where huge datastreams must be collected at acentral place (like the eVLBI project [9]) The packets in these streams do not need
Trang 1614.5 Open Grid Optical Exchanges 285
routing One knows the data has to go from this source to that destination In general,
this is called the service approach: apply only the required services, nothing more
Fourth, even with a limited number of destinations, it is very likely that
data-intensive traffic streams will traverse multiple network domains Considering
scal-ability issues, it is likely that the most cost-effective means for peering among the
domains will be at optical peering locations
The next sections of this chapter describe the concept of an optical exchange, and
they provide an example of how such an exchange may be implemented Different
applications require different transport services, including layer 1 services [10],
ranging from layer 0 to layer 3 There could be many types of optical exchanges
For example, one could be a trivial co-location site that provides no more
function-ality other than rack space and the ability for providers to have bilateral peerings
with other providers at the same co-locations Others can be highly complex with
extensive services Instead of describing four or more different potential exchanges,
a generic architecture for an exchange is described, consisting of a “black box” with
advertised interfaces and services (Figure 14.1)
The “black box” interfaces can implement different services and protocols For
example, one interface may be used to carry undefined traffic over 32 wavelengths
using Dense Wavelength Division Multiplexing (DWDM), while an other interface may
carry only one signal at 1310 nm, which carries SDH-framed traffic A third interface
may be LAN-PHY-based Ethernet, where traffic must reside in the 145.146.100.0/24
subnet
The basic functionality of any exchange is to transport traffic from one
organiza-tional entity to another, usually a provider For an optical exchange, this requirement
implies that the main function is providing cross-connections among interfaces,
forming circuits The different interfaces require the optical exchange to extract a
given signal from a provider circuit and inject it in another provider circuit using
some form of multiplexing
Optical Exchange
Interfaces
Services External
connections
Figure 14.1. Components of an optical exchange
Trang 17For example, if one provider uses DWDM with 50 GHz spacing, and another uses
100 GHz spacing, it will probably be necessary to first demultiplex (demux) andthen multiplex (mux) the signal again If a signal enters at one layer and leaves at
a different layer, the optical exchange effectively acts as an “elevator,” lowering orraising traffic to different layers For example, if a signal comes in as a wavelengthusing DWDM, and is injected in a vLAN, the signal is elevated from layer 1 tolayer 2
The first type of service can be described as providing cross-connections Thesecond type of service would be providing capabilities for muxing and demuxingbitstreams Other more complex services may also be required, such as aggregation
of traffic using a switch and optical multicast and store-and-forward services Anygiven optical exchange may provide only a subset of all possible services However, atsuch an exchange, if other services are not offered by the exchange itself, they may beoffered by a service provider that is connected at the exchange, as Figure 14.2 shows
An important point is that there is no technical obligation to place one set ofservices in the optical exchange domain (theinternal services) and another set in a
services domain (theexternal services) The decision as to which service should be
an internal and which external service is a no technical one determined by other,external, considerations
As noted, an optical exchange may accept one or more types of interfaces The listbelow describes common interfaces
At layer 0, only single-mode fibers are considered Specifically ignored hereare multimode fiber and electrical (UTP) carriers, because currently within opticalexchanges such as NetherLight [11] and StarLight [12], there is a trend toward single-mode fiber and away from multimode fiber and UTP There are three likely causes forthis trend First, optical cross-connects with MEMS switches absorb light at 800 nm,and thus cannot deal with multimode fiber Secondly, DWDM use is increasing, and
Optical exchange
External connections
External services
Services domain
Internal service
Figure 14.2. Internal and external services
Trang 1814.5 Open Grid Optical Exchanges 287
DWDM is possible only with single-mode fiber Third, single-mode has a wider range
of operation (a few hundred kilometers) than multimode (about 2 km)
At layer 1, each fiber may either carry a single bitstream within the 1260–1675 nm
range or multiple bitstreams of data using separate DWDM wavelengths (lambdas)
Each bitstream (lambda) can use one of the following sampling and framings:
(1) a bitstream, at a certain wavelength, with up to 10.1 GHz or up to 40 GHz
sampling, without other known properties;
(2) a bitstream, at a certain wavelength, with SONET/SDH framing, either OC-48
(2.5 Gbps), OC-192 (10 Gbps), or OC-768 (40 Gbps);
(3) a bitstream, at a certain wavelength, with 1 Gbps Ethernet;
(4) a bitstream, at a certain wavelength, with LAN PHY Ethernet;
(5) a bitstream, at a certain wavelength, with WAN PHY Ethernet
Ethernet is a major service that scales from the local to the WAN scale Each fiber
typically carries one wavelength, or it carries 32 or 64 wavelengths though DWDM
The speed is typically either 1 Gbps or 10 Gbps LAN-PHY variant
From the regional to global scale, typically a single SONET/SDH or WAN-PHY signal
is used per carrier The SONET speed may be OC-48 (2.5 Gbps), OC-192 (10 Gbps),
or OC-768 (40 Gbps) The reason that TDM (with SONET or SDH) is used on a
world-wide scale is that providers of transoceanic links require customers to comply with
SONET/SDH framing Note that WAN-PHY is compatible with SONET framing and also
with SONET optical equipment [13] Thus, WAN-PHY can be used on transoceanic links
There are many ways to multiplex signals in one carrier using DWDM, because
of the variety of wavelength bands and channel spacings Similarly, not all vendors
encapsulate Ethernet in the same way over a SONET/SDH connection An optical
exchange must have this type of information to determine the correct service to use
Of course, if no demultiplexing is required, then the optical exchange does not need
this information
A major issue when connecting two fibers together is to correctly tune the power
levels Especially with single-mode fibers, a single speck of dust can ruin signal
strength When making and breaking connections between two carriers dynamically,
using an optical cross-connect, the signal strength should have a known, predefined
value when leaving the optical exchange domain
The protocol used on each circuit either is unknown to the optical exchange or
may be specified by:
(1) IP over Ethernet (note that if all interfaces are of this type, the optical exchange
would be reduced to a LAN-based Internet exchange);
(2) Ethernet implemented with 802.1q (vLAN tagging) [14]
Most end-to-end connections are full-duplex, even if the application does not
require dedicated bandwidth in both directions Most applications do not support a
one-way connection, but a relatively easy way to optimize resource use is to support
asynchronous connections At a minimum, an optical exchange must be aware if an
interface is always full-duplex or supports one-way connections
Trang 1914.5.5 OPTICAL EXCHANGE SERVICES
This section describes possible services for an optical exchange However, not all
of these services have to be offered by the optical exchange itself: some may beoutsourced to a services provider connected to the optical exchange, or may not beoffered at all The abbreviations are used in the service matrix below
(a) Cross (connect) Given two interfaces of equal type, be able to make a
cross-connect between these interfaces Typically, this should be done in a user- ortraffic-initiated way by a software control plane However, here this limitation
is not imposed
(b) Regenerate Amplify or attenuate the power levels, to match a certain output
power level; amplify and reshape; or reamplify, reshape, and retime (3R).(c) convert Wavelength conversion, by regenerating the signal or by physically
altering the wavelength Regenerating, using tunable transponders, may allownetwork engineers to monitor the Bit Error Rate (BER) of a signal, but requiresthe regenerator to understand the modulation and possible framing of thesignal
(d) WDM mux/demux Multiplex wavelengths of different color into a single carrier,
and demultiplex different signals in a single carrier into many separate fibers.This process does not need to convert the wavelengths itself An advanceddemultiplexer may first demux the signal into sets of wavelengths, called wave-bands, before the wavebands are demuxed into individual wavelengths [15].Also, not all multiplexers are compatible, since different optical bands andchannel spacings may be used
(e) (Optical) multicast The ability to duplicate an optical signal as-is Of course,
this can only be done one-way Possible uses include visualization
(f) TDM mux/demux The ability to extract an Ethernet signal from a SONET or
SDH carrier or to insert one or more Ethernet connections in a SONET or SDHcarrier It should be noted that not all SONET/SDH multiplexers are compatible.(g) SONET (switching): The ability to combine and switch SONET or SDH circuits,
without knowing the contents of the circuits
(h) Aggregate There may be different needs for an Ethernet switch in an optical
exchange First, it allows aggregation of traffic, although this may cause tion
conges-(i) (Ethernet) conversion A second use for an Ethernet switch is the conversion
between different types of framing The most useful conversion is perhaps theability to convert between LAN-PHY and WAN-PHY
(j) vLAN encap/decap Ethernet encapsulation A third use for an Ethernet switch
is the encapsulation of traffic in a vLAN trunk [14] This allows the combining
of different, separable datastreams on a single link
The combination of DWDM support, wavelength conversion, and cross-connectswill effectively provide for an OAD) facility Multiple services may be combined in asingle device