2 Simulation Methods for Fractal Processes 492.2.2 Advantages and Shortcomings of FBM/FGN Models 2.3.3 Autoregressive Models of Moving Average, ARMAðp; qÞ 682.3.4 Fractional Autoregressi
Trang 2Moscow State Technical University of Service (MSTUS), Russia
John Wiley & Sons, Ltd
Trang 4IN TELECOMMUNICATIONS
Trang 6Moscow State Technical University of Service (MSTUS), Russia
John Wiley & Sons, Ltd
Trang 7Telephone (þ44) 1243 779777
Email (for orders and customer service enquiries): cs-books@wiley.co.uk
Visit our Home Page on www.wiley.com
All Rights Reserved No part of this publication may be reproduced, stored in a retrieval system or transmitted
in any form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except under the terms of the Copyright, Designs and Patents Act 1988 or under the terms of a licence issued by the Copyright Licensing Agency Ltd, 90 Tottenham Court Road, London W1T 4LP, UK, without the permission
in writing of the Publisher Requests to the Publisher should be addressed to the Permissions Department, John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex PO19 8SQ, England, or
emailed to permreq@wiley.co.uk, or faxed to (þ44) 1243 770620.
Designations used by companies to distinguish their products are often claimed as trademarks All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners The Publisher is not associated with any product or vendor mentioned in this book.
This publication is designed to provide accurate and authoritative information in regard to the subject matter covered It is sold on the understanding that the Publisher is not engaged in rendering professional services If professional advice or other expert assistance is required, the services of a competent professional should be sought.
Other Wiley Editorial Offices
John Wiley & Sons Inc., 111 River Street, Hoboken, NJ 07030, USA
Jossey-Bass, 989 Market Street, San Francisco, CA 94103-1741, USA
Wiley-VCH Verlag GmbH, Boschstr 12, D-69469 Weinheim, Germany
John Wiley & Sons Australia Ltd, 42 McDougall Street, Milton, Queensland 4064, Australia
John Wiley & Sons (Asia) Pte Ltd, 2 Clementi Loop #02-01, Jin Xing Distripark, Singapore 129809
John Wiley & Sons Canada Ltd, 6045 Freemont Blvd, Mississauga, ONT, L5R 4J3, Canada
Wiley also publishes its books in a variety of electronic formats Some content that appears in print may not be available in electronic books.
Anniversary Logo Design: Richard J Pacifico
British Library Cataloguing in Publication Data
A catalogue record for this book is available from the British Library
ISBN 978-0-470-01486-8 (HB)
Typeset in 10/12 pt Times by Thomson Press (India) Limited, New Delhi
Printed and bound in Great Britain by Antony Rowe Ltd, Chippenham, England.
This book is printed on acid-free paper responsibly manufactured from sustainable forestry
in which at least two trees are planted for each one used for paper production.
Trang 8Foreword xi
1 Principal Concepts of Fractal Theory and Self-Similar Processes 1
1.4.2 Frequency Domain Methods of Hurst Exponent
Trang 92 Simulation Methods for Fractal Processes 49
2.2.2 Advantages and Shortcomings of FBM/FGN Models
2.3.3 Autoregressive Models of Moving Average, ARMAðp; qÞ 682.3.4 Fractional Autoregressive Integrated Moving Average
2.4.7 Fractal Binomial Noise Driven Poisson Process (FBNDP) 962.4.8 Fractal Shot Noise Driven Poisson Process (FSNDP) 97
2.5 Fractional Levy Motion and its Application to Network
2.6.2 Modified Estimation Method of Multifractal Functions 112
3.2 Statistical Characteristics of Telecommunication Real Time Traffic 124
Trang 103.3 Voice Traffic Characteristics 130
3.3.2 Voice Traffic Characteristics at the Packet Layer 133
3.4.2 Algorithm for the Partition Function SmðqÞ Calculation 1393.4.3 Multifractal Properties of Multiplexed Voice Traffic 1403.4.4 Multifractal Properties of Two-Component Voice Traffic 142
3.5.3 Estimation of Semi-Markovian Model Parameters and the Modelling
3.5.4 Mathematical Models of Voice Traffic at the Packets Layer 148
3.6.2 Parameters Choice of Pareto Distributions for Voice Traffic Source in ns2 155
3.6.4 Results of Traffic Multiplexing for the Separate ON/OFF Sources 157
3.9.2 Model of the Video Traffic Scene Changing Based on the
3.9.3 Video Traffic Models in the Limits of the Separate Scene 200
3.9.5 MPEG Data Modelling Using I, P and B Frames Statistics 206
4.2.1 Experimental Investigations of the Ethernet Traffic Self-Similar Structure 213
Trang 114.3 Self-Similarity of WAN Traffic 218
4.3.2 Some Limiting Results for Aggregated WAN Traffic 2194.3.3 The Statistical Analysis of WAN Traffic at the Application Level 221
5 Queuing and Performance Evaluation of Telecommunication
5.1 Traffic Fractality Influence Estimate on Telecommunication
5.1.2 Communication System Model and the Packet Loss Probability
Estimate for the Asymptotic Self-Similar Traffic Described by
5.1.4 Estimate of the Effect of Traffic Multifractality Effect on Queuing 2575.2 Estimate of Voice Traffic Self-Similarity Effects on the IP Networks
5.3.2 Telecommunication Network Parameter Optimization on the Basis of
the Minimization of the Discrepancy Functional of QoS Characteristics 271
5.4 Estimation of the Voice Traffic Self-Similarity Influence on QoS
5.4.1 Packet Delay at Transmission through the Frame Relay Network 283
Trang 125.4.2 Frame Relay Router Modelling 283
Trang 14At the very beginning of development of the telephony technique twin-wire telephone linesstarted to entangle our world At first the signals were transferred by the human voice and thedata on the called number, and the spectrum width did not exceed 3.4 kHz The popularity andthe necessity to improve the telephone communication lines immediately attracted the attention
of communications engineers and experts dreaming of increasing the activity factor of thealready laid communication lines and aspiring to (as they say today) ‘multiplex’ the messagesand to use the channel simultaneously and frequently
As radio engineering and radio communication developed, the problem to increase theinformation capacity for radio channels became as acute as that for wire communication.The initial studies in the 1930s were oriented towards analysis of the discretization (the firststage when numeralizing the analogous signals) of the transferred message: the analogousmessage transformation (e.g the slow voice) into the digital signal and further digital transfer inthe multiplexed mode, with the transformation into the initial analogous form at the receivingside As a result, the so-called ‘sampling theorem’ acquired special significance for the process
of numeralization and further reconstruction of the initial message, the 70th anniversary ofwhich was celebrated in Russia in 2003 Various experts have connected this theorem with thenames of V.A Kotelnikov, Cl.E Shannon, H Nyquist, H Raabe, W.R Bennett, I Someya, andE.T Whittaker In 1999 the German Professor Hans Dieter Luke (from Aachen University)1recognized the importance of the Russian expert in radio engineering and radio physics,Vladimir A Kotelnikov, and published an excellent research on the history of this problem.The sampling theorem (Kotelnikov’s theorem) allowed researchers and engineers to approachextreme possibilities in the communication network and to initiate the development of variousbranches of science and technology, such as, for example, the theory of the potential noiseimmunity of the radio and wire communication channels
The twentieth century can easily be characterized by the growth of the need for informationalexchange in accordance with the geometric series, which naturally required the channels totransfer this information When finally high-capacity and branching communication networkswere formed, the researchers and engineers of the telecommunication networks faced abso-lutely new problems Under conditions of rapid technological development, leading to growth
of the processing speed of both the computer systems and the communication channels andsystems as a whole, the number of users steadily increased Since the users download thecommunication networks in their professional activity (remote job, distant education, IP-telephony, etc.) as well as in their spare time (web, music, games, chats, etc.), the list ofclaimed services using the telecommunication networks and their information capacity grewvery rapidly
1
H.L Luke, ‘The origins of the sampling theorem’, IEEE Communication Magazine, 37(4), April 1999, 106–108.
Trang 15Unfortunately, technological improvement does not keep pace with users’ needs and thesituations of communication channel overload occur more and more frequently, which leads toinformation transfer delays and in the worst cases to its loss The users cannot and should notknow the reasons for the caused discomfort: they have concluded the agreement, paid for theservice and they have the right to demand a high-quality service.
In order to find a compromise between the growing needs in communication networkresources and their limited possibilities it is necessary to apply well-engineered algorithms
of control and regulation of the informational flows Therefore, the problems of the optimal use
of telecommunication channels acquire a different aspect, and the priority allocation and thequeue in inquiries and answers become first and foremost The message volume among usersgrows considerably or, as the specialists say, the traffic dynamics in the channels becomesmultiplexed and complicated Therefore, the problems of optimal traffic control and theinvestigation of new traffic features caused by the huge users and services volume in thenetworks are becoming especially important
One of these features is connected with the nature of the traffic as a time process, which moreand more acquires the features of so-called ‘fractals’ Many fractals have self-similar char-acteristics and, generally speaking, these concepts are closely related to each other Inmathematical language the self-similar feature results in an exact or probabilistic replication
of the object characteristics when considered on different scales The self-similar feature leads
to definite regularities in the traffic statistical behaviour and to the necessity to consider theprobability of complicated stochastic processes Then the traffic itself heads to be described as apeculiar dynamic system by so-called ‘fractal’ or chaotic models In worldly language this canmean that the traffic possesses the features to save the basic distinctive patterns irrespective ofthe periods when it is analysed The process becomes ‘similar to itself’, just as a fern leaf looks
so much like the other leaves that they seem to us to be absolutely similar The fern leaf patterncan be found in almost every book as being related to the fractal phenomena because thisexample has already become canonical for an explanation of fractal features, such as the UKcoastline or Koch curves
Chaotic consideration of the most plentiful processes in our lives has probably become one ofthe most attractive and ‘fashionable’ scientific tendencies in the past decades These are theprocesses in biology, in medicine, in mathematics, in economics, in forecasting and in tele-communications It is most likely that in future it will be impossible to analyse any complicatedsystems without using the chaotic approach
The aim of this book is to try to investigate the self-similar processes in the tion network application, to present some more or less generalized understanding of manypublications of the past 10 to 20 years connected with it, to acquaint readers having an activeinterest in the main approaches in this interesting and complicated direction, to give a review ofpreviously obtained and these new results and, ‘to open the door’ to versatile specialists in thisnew and fascinating field of research activity
telecommunica-The authors are very well aware of the fact that the desire to explain visually the new andcomplicated ideas in this area, where even the terminology is hardly settled and we ourselvesare yet very far from a full understanding, may not be rewarding Even more so, the authors areoriented towards a wide audience: the students, the engineers, the researchers, the commu-nications experts and the communication network equipment designers Naturally, this bookcan arouse a sharply critical assessment by many experts on traffic and the evident displeasure
of those whose scientific research lines are either not reflected in the book or described
Trang 16taciturnly Others may be dissatisfied who had decided that they could understand the issueswithin a couple of evenings and for them the considered problems may have turned out to behard to perceive and mathematically complicated Nevertheless, we shall consider our aimfulfilled if the reader becomes interested in self-similar processes and the number of experts inthis perspective and steadily developing field increases It often happens that various experi-enced specialists working alongside in a certain direction for some reason enrich each other,causing the most unexpected ‘singular’ processes to occur, which stepwise could lead todefinite revolutions in standard scientific approaches There is every expectation that thiswill happen in the promising field of self-similar processes.
The term ‘fractal’ was first introduced by Benua Mandelbrot As we have already mentioned,self-similar processes closely follow the fractals They describe the phenomenon in which someobject feature (e.g some image, voice, digital telecommunication message, time series) ispreserved with varying space or time scales If the analysed object is self-similar (or fractal), itsparts (fractions) are similar when increased (in a certain sense) to their full image In contrast tothe deterministic (sharply and unambiguously defined) fractals, the stochastic fractal processeshave no evident similarity to the component parts in the finest detail, but in spite of this,stochastic self-similarity is the feature that can be illustrated visually and can be estimatedmathematically definitely enough (H.E Hurst)
In most cases it is enough to use the statistical characteristics of the second order, well known
to telecommunication networks experts, for a quantitative estimation and description of thebursty (pulsating) structure (or changeability) of the stochastic fractal processes As a result,the usual correlation function of the process plays rather an important part, being essentially themain criterion, with the help of which the scaling invariance of similar processes, i.e self-similarity, is successfully determined The existence of the correlation ‘within range’ canusually be characterized by the term ‘long-range dependence’ The distinctive difference of theself-similar process correlation function compared to the usual process correlation function isthat for the former the correlation, as the time delay function, assumes the polynomial delayrather than the exponential delay
In the telecommunication applications the measured traffic traces (routes) correspond to thestochastic self-similarity (fractality) features It is assumed here that the traffic form withthe corresponding amplitude normalization is the conformity measure It is difficult to observethe clear structure of the measured traffic traces, but self-similarity allows consideration to
be taken of the stochastic nature of many network devices and events that together influence thenetwork traffic If the viewpoint that the traffic series is a sample of the stochastic processrealization is accepted and the conformity degree is weakened, i.e some statistical character-istic of the re-scaled time series is chosen, it would then possible to obtain the exact similarity ofthe mathematical objects and the asymptotic similarity of its specific samples regarding thisweakened similarity criteria
The telecommunication traffic self-similarity as an independent scientific direction wasformed very recently The essential contributions in this direction were made by J Beran, M.Crovella, K Park, W Willinger, P Abry, M.S Taqqu, V Teverovsky, W.E Leland, J.R Wallis,P.M Robinson, C.F Chung, V Paxson, S Floyd, S.I Resnick, R Riedi, J.B Levy, J.W Roberts,S.B Lowen, I Norros, B.K Ryu, G Samorodnitsky and many others The investigationsfulfilled by these authors are quite extensive and the results are significant
The book is divided into two parts
Trang 17In the first part (Chapters 1 and 2) the theoretical aspects of the self-similar (fractal andmultifractal) random processes are considered The main definitions necessary to understandthe rest of the book are given The current state as well as the problems related to the self-similarprocess description are analysed, and it is explained why the traffic in modern telecommunica-tion systems should be considered fractal.
The second part (Chapters 3 to 5) is devoted to the theoretical aspects of the best-knownmodels demonstrating self-similar features These models are considered also from the point ofview of their software realization (the algorithms and modelling results, etc., are given) Themain theoretical results relating to each model are presented and discussed Various approachesused to estimate the traffic fractal and multifractal features are considered
In Chapters 3 and 4 the traffic of the real telecommunication and computer network isanalysed in detail Chapter 3 is devoted to the self-similarity research of the real time traffic towhich the traffic, created by the voice and video services, is referred On the basis of an analysis
of the traffic experimental results the characteristics of the description and self-similar ties are studied, including mono- and multifractal characteristics The traffic self-similarity inLAN (Ethernet) and WAN (Internet) is analysed in Chapter 4 with an account of the transport(TCP/IP) and application (HTTP, UDP, SMTP, etc.) protocol levels
proper-In Chapter 5 the features of the self-similarity influence on the quality of service estimates onthe examples of voice services are analysed The traffic control aspects under conditions of itsself-similarity and the long-range dependence are discussed To do this, the informationextracted over large time scales is used, which can be applied to correct the control mechanisms
of the network resources
In particular, it is shown that the queue length distribution in the infinite system buffer in thelong-range dependent input process decays slower than exponentially (or subexponentially).Conversely, for short-range dependence at the input the decay has an exponential character Thequeue length distribution illustrates that the buffering (as the strategy providing the resources)
is ineffective from the point of view of the occurrence of disproportionate delays, when theinput traffic is self-similar
From the position of traffic control, self-similarity implies the existence of the correlationstructure over the time interval, which can be used for traffic control To do this the informationextracted from the large time scales can be used to correct the overload control mechanisms
In spite of the obviously mathematical consideration of many aspects of the self-similarityand stochastic phenomena used by many authors, this book, in the authors’ opinion, is notoverloaded with mathematical expressions, and in a number of cases it will act as a referencebook for specialists That is why it can be recommended to readers at large who are interested intelecommunication and computer technologies The interest of potential students in this bookcan be related to the specific lecture courses (standard or short) or parts of other courses devoted
to self-similar processes Acronyms used in the book are explained at the end in appendix B.The authors would appreciate any comments concerning this book
Trang 18OLEG I SHELUHIN
Oleg I Sheluhin was born in 1952 in Moscow, Russia
In 1974 he graduated from the Moscow Institute of
Transport Engineers (MITE) with a Master of Science
Degree in Radio Engineering After that he entered the
Lomonosov State University (Moscow) and graduated
in 1979 with a Second Diploma of Mathematics He
received a PhD (Techn.) at MITE in 1979 in Radio
Engineering and Dr Sci (Techn.) at Kharkov Aviation
Institute in 1990 The title of his PhD thesis was
‘Investigation of interfering factors influence on the
structure and activity of noise short-range radar’ and
the Dr Sci thesis, ‘Synthesis, analysis and realisation
of short-range radio detectors and measuring
sys-tems’
Oleg I Sheluhin is a member of the International
Academy of Sciences of Higher Educational
Institu-tions He has published 15 scientific books and textbooks for universities and more than 250scientific papers Since 1990 he has been the Head of the Radio Engineering and Radio SystemsDepartment of Moscow State Technical University of Service (MSTUS)
Oleg I Sheluhin is the Chief Editor of the scientific journal Electrical and InformationalComplexes and Systems and a member of Editorial Boards of various scientific journals In 2004
he was awarded the honorary title ‘Honoured scientific worker of the Russian Federation’ by theRussian President
His scientific interests are radio and telecommunication systems and devices
Trang 19SERGEY M SMOLSKIY
Sergey M Smolskiy was born in 1946 in Moscow In
1970 he graduated from the Radio Engineering Faculty
of the Moscow Power Engineering Institute (MPEI) In
the same year he began work at the Department of Radio
Transmitting Devices of MPEI After concluding his
postgraduate study and his PhD thesis in 1974
(‘Quasi-harmonic oscillations stability in autonomous and
synchronized high-frequency transistor oscillators’)
he continued research at the Department of Radio
Transmitting Devices, where he was engaged in
theo-retical and practical questions concerning the
transmit-ting stages of short-range radar development and in
various questions concerning transistor oscillators
the-ory and microwave oscillations stability He was
nomi-nated as the scientific supervisor of many scientific
projects, which were carried out under the decrees of
the USSR Government In 1993 he presented his thesis
for the Dr Sci (Techn.) degree (‘Short-range radar systems on the basis of controlled tors’) and became a Full Professor
oscilla-He has been the Chairman of the Radio Receivers Department of MPEI since 1995 Hispedagogical experience extends over twenty years He is a lecturer in the following courses:
‘Radio Transmitting Devices’, ‘Systems of Generation and Control of Oscillations’, linear Oscillations Theory in Radio Engineering’, ‘Analysis Methods for Non-linear Radio-Electronic systems’and ‘Autodyne Short-Range Radar’
‘Non-The list of his scientific publications and inventions contains 170 scientific papers, ten books,three copyright certificates of USSR inventions and more than 90 scientific and technicalreports at various conferences, including international ones He is a member of the InternationalAcademy of Informatization, the International Academy of Electrical Engineering Sciences,the International Academy of Sciences of Higher Educational Institutions, a member of theIEEE and an Honorary Doctor of several foreign universities He was awarded the State Order
of Poland for merits in preparation of the scientific staff, the title of ‘Honoured Radio Engineer’and the title of ‘Honoured worker of universities’
His scientific work during the last ten years has been connected with conversion directions ofshort-range radar system engineering, radio-measuring systems for the fuel and energyindustry, systems of information acquisition and transfer for industrial purposes with the use
of wireless channels and systems of medical electronics
Trang 20ANDREY V OSIN
Andrey V Osin was born in 1980, received a Bachelor
Degree in 2001 and an Engineer Degree in 2002 in Radio
Engineering at the Moscow State Technical University
of Service He entered a three-year PhD course and
successfully presented his PhD thesis (‘The influence
of voice traffic self-similarity on quality of service in
telecommunication networks’) in 2005 in the speciality
‘Telecommunications systems, networks and devices’ at
Moscow Power Engineering Institute (Technical
Uni-versity)
At present Dr A Osin works as the senior lecturer at
Moscow State Technical University of Service (Radio
Engineering and Radio Systems Department) and
deli-vers the lecture courses ‘Radio Engineering System
Modelling’, ‘The Bases of Computer Modelling and
Computer Design of the Radio Engineering Sets’ and
‘CAD Systems in Service Activity’ He works with postgraduate and PhD students on mutualresearch Three Bachelor final projects and five Engineering projects were fulfilled under hissupervision
He has published 11 scientific papers and is the co-author of two books in the field oftelecommunication systems modelling He prepared 12 scientific reports at various conferencesheld in Russia
Trang 22The authors would like to express their thanks to colleagues from the Department of RadioEngineering and Radio Systems at the Moscow State Technical University of Service and fromthe Department of Radio Receivers at the Moscow Power Engineering Institute (MPEI) foruseful discussions of manuscript material They appreciate the help given by Lydia Grishaeva incorrecting the English text of the manuscript.
Trang 24Principal Concepts of Fractal
Theory and Self-Similar Processes
1.1 Fractals and Multifractals
B Mandelbrot introduced the term ‘fractal’ for geometrical objects: lines, surfaces and spatialbodies having a strongly irregular form These objects can possess the property of self-similarity The term ‘fractal’ comes from the Latin word fractus and can be translated asfractional or broken The fractional object has an infinite length, which essentially singles it out
on the traditional Euclidean geometry background As the fractal has the self-similar property it
is more or less uniformly arranged in a wide scale range; i.e there is a characteristic similarity ofthe fractal when considered for different resolutions In the ideal case self-similarity leads to thefractional object being invariant when the scale is changed The fractional object may not beself-similar, but self-similar properties of the fractals considered in this book are observedeverywhere Therefore, when self-similar traffic is mentioned, it will be assumed that its timerealizations are fractals
There is some minimal length lminfor the naturally originated fractal such that at the l lmin
scale its fractional structure is not ensured Moreover, at rather a large scale l > lmax, where lmax
is the typical geometrical size for the object in a considered environment, the fractionalstructure is also violated That is why the natural fractal properties are analysed for l scalesonly, which satisfies the relation lmin l lmax
These restrictions become understandable when the broken (nonsmooth) trajectory of aBrownian particle is used as an example of the fractal On a small scale the Brownian particlemass and size finiteness affects this trajectory as well as the collision time finiteness Takingthese circumstances into consideration the Brownian particle trajectory becomes a smoothcurve and loses its fractal properties This means that the scale (lmin) at which the Brownianmotion can be examined in the fractal theory context is limited by the mentioned factors Whenspeaking about the scale restrictions from above (lmax) it is obvious that the Brownian particlemotion is limited by some space in which this particle is located, e.g the tank with the liquid intowhich the paint particles are injected during the classical experiment of Brownian motionidentification
Self-Similar Processes in Telecommunications O I Sheluhin, S M Smolskiy and A V Osin
Trang 25It is noteworthy that the exact self-similarity property is typical for regular fractals only Ifsome element of chance is included in its creation algorithm instead of the deterministicapproach, so-called random (stochastic) fractals occur Their main difference from the regularones consists in the fact that self-similarity properties are correct only after correspondingaveraging has taken place over all statistically independent object realizations At the same timethe enlarged fractal part is not fully identical to the initial fragment, but their statisticalcharacteristics are the same Network (telecommunications) traffic is often referred to as aclass of self-similar stochastic fractals That is why in the scientific literature the concepts offractal and self-similar traffic are used synonymously when this does not lead to confusion.
1.1.1 Fractal Dimension of a Set
It was mentioned earlier that the fractional dimension presence is a distinctive fractalproperty The fractional dimension concept is now formalized and its calculation approach isevaluated
In accordance with the algorithm from Reference [1] for determination of the Hausdorffdimension Df of the set occupying the area with volume LDf in D-dimensional space, this set isnow covered by cubes having the volume eD f The minimum number of nonempty cubescovering the set is MðeÞ ¼ LD fð1=eÞDf From this expression an approximate estimation of Df
can be obtained:
Df ¼ lim
e!0
ln MðeÞlnð1=eÞ
ð1:1Þ
In practice, to estimate this dimension it is more convenient to use the mathematical structurewell-known as the Renji dimension Dqrelated to the probability piof the test point presence inthe ith cell to power q:
i.e the Renji dimension D0coincides with the Hausdorff dimension (1.1) Due to Dqmonotony
as a function of q, the Renji dimension decreases as a power function and therefore thefollowing inequality is fulfilled: D2 D0¼ D Thus the largest low border of the Hausdorffdimension can be presented as
D2¼ lim
l!0
ln PMðtÞ i¼1 pðeÞ2i
Trang 26Taking this into consideration, the probability piof the test point in the presence of the ith set can
of the linear regression inclination angle for the next points½ln 1=N2ðPMðtÞ
i¼1 Ni2Þ; ln ðeÞ definedfor different e values
1.1.2 Multifractals
Multifractals are heterogeneous fractional objects where, compared to regular fractals, it is notsufficient to introduce only one magnitude, its fractal dimension Df, in a detailed description,the full spectrum of the dimensions whose number is infinite in the general case must be given.The reason for this lies in the fact that, together with purely geometrical features defined by the
Dfmagnitude, these fractals have some statistical properties
Multifractal objects can be described from the formal point of view The fractal object isconsidered to occupy the boundary area £ defined by the size L in Euclidean space with the Ddimension The fractal is represented by a set of N 1 points distributed in a certain manner inthis area at some formation stage It is supposed that N! 1 The full £ area is divided into cellswith l L sides that cover eDunits of the examined space Only the occupied cells will beconsidered, in which at least one point from K points belonging to this fractal is contained Letthe occupied cell index i be changed within i¼ 1; 2; ; NðeÞ, where NðeÞ is the total quantity
of occupied cells depending on the size of the cell side e Assuming that NiðeÞ is the number ofpoints in the cell with index i, the magnitude can be found where
Trang 27relative occupancies pðeÞ ¼ 1=NðeÞ are also the same and the generalized statistical sumbecomes
hetero-dtðqÞ
dq ¼ lim
e!0
PNðeÞ i¼1 pqi ln pi
PNðeÞ i¼1 pqi
ln e
ð1:13Þ
Trang 28This derivative has an important physical characteristic If it is not constant and changes with q,
it is a multifractal
The physical sense of generalized fractal dimensions Dqwill be analysed for several q values
At q¼ 0 it follows from Equations (1.7) that Zð0; eÞ ¼ NðeÞ On the other hand, according to
Equations (1.9) and (1.8a),
Comparing these two equalities gives the relation NðeÞ eD 0 This shows that the D0
magnitude represents the usual Hausdorff dimension of the £ set, which is the roughest fractalcharacteristic and provides no information about its statistical properties For the physical sense
of the D1magnitude it can be shown that
D1¼ lim
e!0
PNðeÞ i¼1 piln pi
The numerator of this expression represents (with the accuracy of a sign) the entropy SðeÞ ofthe fractal set: SðeÞ ¼ PNðeÞ
i¼1 piln pi As a result, the generalized fractional dimension D1
relates to the entropy SðeÞ by
to the initial problem about the distribution of points in the fractal set £, it is possible to concludethat since
the D1quantity characterizes the information required for determination of the point location insome cell In this connection the generalized fractal dimension D1is often called an informa-tional dimension It shows how the information, required to determine the point location,increases as the cell size e tends to zero
1.1.3.1 Properties of the DqFunction
As stated above, a multifractal is characterized by the heterogeneous point distribution amongthe cells At the same time, if the points that constitute the multifractal are distributed uniformlyamong all NðeÞ cells with the probability p ¼ 1=NðeÞ, the entropy of such a distribution would
Trang 29be maximal and equal to
SmaxðeÞ ¼ XNðeÞ
at least remains constant) as q rises: Dq Dq 0 at q0> q The sign of equality occurs, for
example, for the uniform fractal Dqachieves the maximal value Dmax¼ D1as q! 1 andthe minimal value Dmin¼ D1as q! 1
1.1.3.2 Spectrum of Fractal Dimensions
The multifractal concept formulated above represents the heterogeneous fractal In order todescribe it, the set of generalized fractional dimensions Dq, where q possesses any value in theinterval1 < q < 1, was introduced However, Dqvalues are not, strictly speaking, thefractional dimensions in the conventional sense of the word For this reason they are calledgeneralized dimensions
Together with these terms used to characterize the multifractal set the function ofthe multifractional spectrum fðaÞ (the spectrum of multifractal singularities) is often used,with the term fractional dimension describing it best It can be shown that the fðaÞ value isactually equal to the Hausdorff dimension of some homogeneous fractional subset from theinitial £ set, which ensures that it is the dominating contribution in the statistical sum at a given qvalue
The set of probabilities pi, showing the relative occupation e of the cells with which the setunder examination can be covered, may be regarded as one of the main multifractal features.The smaller the size of the cells the smaller is the occupation value For self-similar sets thedependence of pion the cell size e has the power character
where airepresents some exponent (which is different for different cells i) It is known that forthe regular fractal all exponents aiare the same and equal to the fractional dimension Df:
pi¼ 1NðeÞ e
Trang 30Therefore, tðqÞ ¼ Dfðq 1Þ and all generalized fractional dimensions Dq¼ Df in this casecoincide and do not depend on q.
However, for such a complex object as the multifractal, due to its heterogeneity, theprobabilities of the cell occupation piin the common case are different and the aiexponentfor different cells can possess different values In the case of the monofractal, where all aiare thesame (and equal to the fractional dimension Df), the number NðeÞ obviously depends on cell size
e in accordance with the power function Therefore, NðeÞ eD f: The exponent in this case is
defined by the set dimension Df
For the multifractal this situation is not typical and different values of aican be met with theprobability characterized not by the same Dfvalue but by different (depending on a) values ofthe fðaÞ exponent:
nðaÞ ef ðaÞThus the physical sense of the fðaÞ function consists in the fact that it represents the Hausdorffdimension of some homogeneous fractional subset £afrom the initial set £ characterized by thesame probabilities of cell occupancy pi ea Since the fractional dimension of the subset isobviously always smaller or equal to the fractional dimension of the initial set D0, the importantinequality for the fðaÞ function takes place: f ðaÞ D0
As a result it can be concluded that the set of different values of the fðaÞ function (at differenta) represents the spectrum of the fractional dimensions for homogeneous subsets £aof theinitial set £, each of which has its own values of the fractional dimension fðaÞ Since any subsethas only part of the total number of cells NðeÞ, into which the initial set £ is divided, theprobability normalization condition PNðeÞ
i¼1 piðeÞ ¼ 1 is not fulfilled when summing on thissubset only The sum of these probabilities is less than 1 Therefore the probabilities pihavingthe same aivalue are obviously less (or at least have the same order) than the efðai Þvalue, which isinversely proportional to the number of used cells covering this subset (it should be noted that inthe case of the monofractal pi 1=NðeÞ) This results in the important inequality for the f ðaÞ
function and it follows that at all values of a then fðaÞ a
The sign of equality takes place, for example, for the fully homogeneous fractal, where
fðaÞ ¼ a ¼ Df
1.1.4 Legendre Transform
If the interconnection between the fðaÞ function and the tðqÞ function introduced earlier isestablished, it can be shown that the expression for the statistical sum takes the form
Comparing (1.22) with (1.8) gives the conclusion that
According to Definition 1.1,
Dq¼ 1
Trang 31Therefore, if the multifractional spectrum function fðaÞ is known, it is possible to obtain thefunction Dq, and vice versa, knowing Dq, it is possible to obtain function aðqÞ with the help ofthe equation
fðaÞ ¼ qdt
dq t
ð1:26Þ
1.2 Self-Similar Processes
1.2.1 Definitions and Properties of Self-Similar Processes
A random process in discrete time or the time series XðtÞ, t 2 Z, is considered where XðtÞ isinterpreted as the traffic volume (measured in packets, bytes or bits) up to moment t
Definition 1.2
Consider that a real-valued processfXðtÞ; t 2 Rg has stationary increments if
fXðt þ DtÞ XðDtÞ; t 2 Rg ¼dfXðtÞ Xð0Þ; t 2 Rg
for all Dt2 R
The sequence of increments for {X(t), t2 R} at discrete time is defined as
Yk¼ Xðk þ 1Þ XðkÞ, k 2 Z For the purpose of traffic simulation the process XðtÞ is sidered as stationary in the wide sense, applying the restriction that the covariance functionRðt1; t2Þ ¼ M½ðXðt1Þ mÞðXðt2Þ mÞ is invariant with regard to the shift, i.e.Rðt1; t2Þ ¼ Rðt1þ k; t2þ kÞ for any t1, t2, k2 Z It is assumed that two first moments
con-m¼ M½XðtÞ, s2¼ M½XðtÞ m2 exist and are finite for any t 2 Z Here M½ is the averagingoperation, m is the initial moment (mathematical expectation) and s2is the variance (disper-sion) of the XðtÞ process Suppose for convenience that m ¼ 0 Since at the stationary conditionRðt1; t2Þ ¼ R tð1 t2; 0Þ, the covariance function will be designated as R kð Þ and the correlationfactor as rðkÞ ¼ RðkÞ=Rð0Þ ¼ RðkÞ=s2
Definition 1.3 [2]
The real-valued processfXðtÞ; t 2 Rg is self-similar with the exponent H > 0 (H-ss) if, for any
Trang 32distributionsfaHXðtÞ; t 2 Rg, i.e if, for any k 1, t1, t2; ; tk2 R and any a > 0,
Self-similar processes with the self-similar parameter H are marked as H-ss (H-self-similar)
in the literature The nondegenerate self-similar H-ss process cannot be stationary theless, there is an important connection between self-similar and stationary processes.Theorem 1.1 [2]
Never-IffXðtÞ; 0 < t < 1g is an H-ss process, the process
An H-ss process having stationary increments is specifically marked as H-sssi (self-similarprocess with the self-similarity parameter H and stationary increments)
Rðt1; t2Þ ¼1
2f½jt1j2Hþ jt2j2H jt1 t2j2Hs2
Trang 33If XðtÞ is a (nondegenerate) H-sssi process with finite dispersion, then 0 < H < 1 During the
traffic simulation the range 0:5 < H < 1 is of special interest since the H-sssi process XðtÞ with
prohibited due to the stationary condition on the incremental process The range 0 < H < 0:5
can be excluded in practice because in this case the increment process is so-called short-range
dependence (SRD) For practical goals only the range 0:5 < H < 1 is important In this range
the correlation factor for the increment process YðtÞ
m-YiðmÞ¼ 1
mðYimmþ1þ Yimmþ2þ þ YimÞ; m¼ 1; 2;
or in a more convenient form
YkðmÞ¼ 1
mH
Xkm i¼ðk1Þmþ1
Trang 34condition (1.35) or the approximate (less strict) condition (1.36) – is preserved when the timeseries is aggregated.
The form RðkÞ ¼ ½ðk þ 1Þ2H 2k2Hþ ðk 1Þ2H s2=2 is not fortuitous and presupposes an
additional structure (a long-range dependence), which will be discussed later The similarity of the second order (in an exact or approximate sense) is the main structure tosimulate the network traffic
self-There is a connection between the process that is strictly self-similar in the wide sense and theprocess that is self-similar in the narrow sense
Definition 1.6
A process X is called self-similar in the narrow sense (strictly self-similarity), with parameter
H¼ 1 ðb=2Þ, 0 < b < 1, if m1HXðmÞ¼ X, m 2 N, where the sign ‘¼’ designates theequality of finite-dimensional distributions XðmÞ¼ ðX1ðmÞ, X2ðmÞ; Þ is the process X averagedover intervals with length m, the components XðmÞof which are defined by the expression
XkðmÞ¼D 1
mðXkmmþ1þ þ XkmÞ; m; k2 NThe connection between the self-similar process in the wide sense and the self-similarprocess in the narrow sense is analogous to the connection between processes being stationary
in the wide and narrow senses
In addition to the statistical similarity when scaling, the self-similar processes can bedetected due to several equivalent features:
1 They have a hyperbolically decaying covariance function of the following form:
where LðtÞ is the function slowly variable at infinity (i.e limt!1LðtxÞ=LðtÞ ¼ 1 for all
sequential values of the covariance function diverges as
Although in the past the teletraffic analysis was mainly based on SRD models, theconsequences of LRD can be very serious Since LRD is the reason for the appearance ofbursty traffic which exceeds average levels of traffic, this property leads to buffer overflowand generates losses and/or delays
2 The sample variance of aggregated processes decreases more slowly than the magnitude and
is inversely proportional to the sample size If the new time sequencefXiðmÞ; i¼ 1; 2; g,
obtained by averaging the initial sequence fX; i¼ 1; 2; g over the nonoverlapping
Trang 35consecutive intervals of size m, is introduced for analysis, for self-similar processes slowersample variance decay according to the law
s2ðXðmÞ ð2H2Þ; as m! 1 ð1:39Þ
will be typical, while for the traditional (not self-similar) stationary random processes
s2ðfXðmÞi ; i¼ 1; 2; gÞ ¼ s2m1, i.e the decay is inversely proportional to the samplelength This indicates that the statistical characteristics of samples, such as the sample meanvalue and the sample variance, will converge very slowly, especially as H! 1 Thisproperty is reflected in all the measures of self-similar processes and will be considered
in more detail when evaluating statistical characteristics
3 If self-similar processes are examined in the frequency domain the long-range dependencephenomenon leads to the power character of the spectral density near zero:
where 0 < g < 1, L2is a slowly varying function at 0 and SðoÞ ¼ SkRðkÞeikois the spectraldensity Therefore, from a spectral analysis position the long-range dependence means thatSð0Þ ¼ SkRðkÞ ¼ 1, i.e the spectral density tends to þ1 when the frequency o tends to 0
(a similar event is further called 1=f noise) Conversely, the processes with short-range
dependence can be characterized by the spectral density, having a positive and finite value at
o¼ 0
Equations (1.37), (1.39) and (1.40) are related to parameter H, which is called the Hurstexponent The Hurst exponent for self-similar processes is situated between 0.5 and 1 As Happroaches 1 the series becomes more and more self-similar, revealing itself in a slowercovariance decay, as can be seen from Equation (1.37)
1.2.2 Multifractal Processes
In contrast to self-similar processes, multiscaling or multifractal processes ensure a moreflexible law of scaling behaviour The class of multifractal processes embraces all the processeswith the scaling property including self-similar monoscaled and multiscaled processes
Definition 1.7 [3]
The stochastic process XðtÞ is called multifractional if it has stationary increments and satisfiesthe equality
for some positive q2 Q; ½0; 1 Q, where tðqÞ is called the mass parameter (scale function)
and the moment factor cðqÞ does not depend on t
The obvious consequence is that tðqÞ is a convex function If tðqÞ depends on q linearly, theprocess is referred to as one-scaled or monofractal; otherwise it is multifractal It can beshown that in the specific case of the self-similar process with parameter H, then tðqÞ ¼ qH 1and cðqÞ ¼ M½jXð1Þjq The class of multifractal processes also includes monofractal andself-similar cases
Trang 361.2.3 Long-Range and Short-Range Dependence
So far the role of self-similarity in the stationarity of the second order has been discussed but notmuch has been mentioned about the role of H and its limiting values The definition of long-range dependence and its interconnection with the correlation factor rðkÞ will now be discussedagain
Among the large number of SRD processes in probability theory and in simulation the similar processes are of special interest due to their relation to the limiting theorems and theirrather simple structure
The processfYi; i2 Zg is called the stationary process with short-range dependence if there is
a constant 0 < c0< 1 such that
The asymptotic behaviour of factor rðkÞ results, with the help of expansion in the Taylorseries, in
rðkÞ ¼ Hð2H 1Þk2H2þ Oðk2H2Þ as k! 1 ð1:43Þ
In accordance with Definition 1.8, the process fYi; i 2 Zg for 0:5 < H < 1 is long-range
dependent with exponent a¼ 2 2H in Equation (1.42) This also means that the correlationsare nonsummable:
X1 k¼1
Therefore, when rðkÞ is hyperbolically decaying the appropriate stationary process fYi; i2 Zg,
as can be seen from Equation (1.42), is long-range dependent
Accordingly, thefYi; i2 Zg process is short-range dependent if the normalized correlationfunction is summable ðP1
k¼1rðkÞ ¼ finite < 1Þ It should also be possible to offer anequivalent definition of long-range dependence in the frequency domain, where it is necessary
Trang 37that the process spectral density
SðoÞ ¼ 12p
X1 k¼1
rðkÞeiko; o2 ½p; p; i¼ ffiffiffiffiffiffiffi
1p
satisfies this definition
In accordance with this definition the process fYi; i 2 Zg with 0:5 < H < 1 is long-range
dependent with exponent b¼ 2 H 1 The behaviour of SðoÞ near the origin of coordinates iswell described by the function behaviour at zero:
SðoÞ ¼ cfjoj12Hþ Oðjojminð32H;2ÞÞ ð1:46Þ
Here
cf ¼ 1
2psinðpHÞG ð2H þ 1Þs2 and GðzÞ ¼
Z þ1 0
xz1exdx; z> 0
The approximation S fjojb; o! 0, 0 < b ¼ 2H 1 < 1 is very good, even for
relatively large frequencies As a result, Equation (1.46) can be used in the evaluation of H
in the frequency domain
1.2.4 Slowly Decaying Variance
As shown above, for a self-similar process the variance of sample mean decays more slowly ascompared with the magnitude inverse to the sample size:
s2ðXtðmÞ b; 0 < b < 2H 1 < 1 ð1:47Þ
for rather large m Conversely, for SRD processes the parameter b¼ 1 and
The property of slowly decaying variance can easily be discovered by plotting the m function
s2ðXtðmÞÞ on a log–log diagram (variance–time plot) A straight line with a negative (less than 1)slope in the wide range of m demonstrates slowly decaying variance This property can also bedefined with the help of the index of dispersion for counts (IDC)
Trang 38For a given time interval with length t the IDC is defined as a variance of entrance quantity Atduring length interval t divided by the expectation value of the same magnitude:
1.3.1 Distribution with ‘Heavy Tails’ (DHT)
There is a close connection between long-range dependence and DHT, which will be repeatedlymentioned in this book To begin with some definitions will be introduced and the most typicalcases will be examined A random variable Z has a distribution with a ‘heavy tail’ if theprobability
where 0 < a < 2 is called the ‘tail’ index or form parameter and c is a positive constant In
other words, the distribution ‘tail’ decays according to the hyperbolic law Conversely,distributions with ‘light tails’ (DLTs), e.g exponential and Gaussian, have an exponentiallydecaying ‘tail’ The specific property of DHT consists in the fact that its dispersion for
0 < a < 2 is infinite, but for 0 < a 1 it also has an infinite mean value When considering
network traffic, the case 1 < a < 2 is of most interest Pareto distribution is a frequently
used DHT There are some distributions (e.g Weibull and lognormal) that have nentially decaying tails but a finite variance More generally, it can be said that X has adistribution with a ‘heavy tail’ F if
Trang 39The simplest case of a distribution with a ‘heavy tail’ is the so-called Pareto distribution Inthis case LðxÞ 1; therefore the distribution function for the Pareto distribution can be written
as FðxÞ ¼ 1 xa The difference between exponential tails and heavy tails can be seen inFigure 1.1
The main distinctive feature of the random variable subordinate to DHT consists in the factthat it shows extraordinary changeability In other words, DHT leads to very large values withfinite, in the common case, nonnegligible probability Therefore samples from such distributionhave for the most part ‘small’ values, but a small number of ‘very large’ values occurs as well It
is not surprising that as a! 1 the influence of the ‘heavy tail’ has an effect on the sample bylowering the convergence speed of the sample mean value to the expectation value Forexample, when the sample size is m the sample mean value for the Pareto distributed randomvariable Z may deviate widely from the expectation value, often underestimating it The errormagnitude for estimatingjZm MðZÞj actually behaves approximately as mð1=aÞ1[4] There-fore, when the sample yields to DHT for a values close to 1 there should be concern that theconclusions about the network behaviour and performance related to sample error could bewrong It can be shown that the DHT parameters directly linked to a network (e.g file sizes andduration (lifetime) of the connection) are the reason for long-range dependence and self-similarity in network traffic
A simple case is examined of the predictability related to the ‘heavy tails’ of randomvariables [5] Z is considered as the variable having DHT and can be interpreted as the duration
or lifetime of a connection, e.g the TCP (transmission control protocol) connection or the IP(Internet protocol) stream or session Since the connection duration is a physically measurable
event it can be supposed that the connection is active during t > 0 To simplify the discussion,
the time can be changed to the discrete formðt 2 ZþÞ and A : Zþ! f0; 1g is a point at AðtÞ ¼ 1only when Z t The conditional probability P that the connection continues to exist in thefuture, supporting its activity t, can be estimated as
Trang 40Equation (1.53) can be calculated as
PfZ ¼ tgPfZ tg
c1ec2 t c1ec2 ðtþ1Þ
c1ec 2 t ¼ 1 ec 2
where for large t, L c2is obtained For heavy tails the same calculations will lead to
PfZ ¼ tgPfZ tg
DHT processes can be characterized by the fact that the tail of the distribution decays muchmore slowly than in the case of an exponential distribution It is the main starting point formethods using heavy tail detection Moreover, various research graphic approaches are avail-able, to estimate parameter a
1.3.2.1 Hill’s Estimation
numbers having the distribution function F and X1;n X2;n Xn;nare the order statistics
If F is DHT, Hill’s estimation for index a takes the following form [6]:
^
a¼ ^ak;n¼ 1
k
Xn j¼1
1.3.2.2 Modified QQ graph
The main principle of the QQ (quantile–quantile) graph is based on the following assumption: if
X1 X2 Xkare the samples from the process with the distribution function F, and k islarge enough, F at x¼ Xjcan be estimated in the form
P½x < Xj ¼ FðXjÞ 1 j
kþ 1
... heavy tails can be seen inFigure 1.1The main distinctive feature of the random variable subordinate to DHT consists in the factthat it shows extraordinary changeability In other words, DHT... tail of the distribution decays muchmore slowly than in the case of an exponential distribution It is the main starting point formethods using heavy tail detection Moreover, various research graphic... having DHT and can be interpreted as the duration
or lifetime of a connection, e.g the TCP (transmission control protocol) connection or the IP(Internet protocol) stream or session Since