(Texts in Computational Science and Engineering 11) Walter Gander, Martin J. Gander, Felix Kwok (auth.)-Scientific Computing - An Introduction using Maple and MATLAB-Springer International Publishing

We are conducting ever more complex computations built upon the assumption that the underlying numerical methods are mature and reliable.When we bundle existing algorithms into librari

Trang 1

Walter Gander · Martin J Gander · Felix Kwok

Trang 2

Science and Engineering 11 Editors

Trang 4

Scientific Computing

An Introduction using Maple and MATLAB

123

Trang 5

ISSN 1611-0994

ISBN 978-3-319-04324-1 ISBN 978-3-319-04325-8 (eBook)

DOI 10.1007/978-3-319-04325-8

Springer Cham Heidelberg New York Dordrecht London

Library of Congress Control Number: 2014937000

Mathematics Subject Classification (2010): 65-00, 65-01

c

Springer International Publishing Switzerland 2014

This work is subject to copyright All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed Exempted from this legal reservation are brief excerpts in connection with reviews or scholarly analysis or material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work Duplication of this pub- lication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’s location, in its current version, and permission for use must always be obtained from Springer Permis- sions for use may be obtained through RightsLink at the Copyright Clearance Center Violations are liable to prosecution under the respective Copyright Law.

The use of general descriptive names, registered names, trademarks, service marks, etc in this tion does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

publica-While the advice and information in this book are believed to be true and accurate at the date of lication, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made The publisher makes no warranty, express or implied, with respect

pub-to the material contained herein.

Printed on acid-free paper

Springer is part of Springer Science+Business Media (www.springer.com)

Trang 6

This book is dedicated toProfessor Gene H Golub1932–2007

(picture by Jill Knuth)

The three authors represent three generations of mathematicians who have

been enormously inﬂuenced by Gene Golub

He shaped our lives and our academic careers through his advice, hisleadership, his friendship and his care for younger scientists

We are indebted and will always honor his memory

Trang 8

We are conducting ever more complex computations built upon the assumption that the underlying numerical methods are mature and reliable.

When we bundle existing algorithms into libraries and wrap them into packages to facilitate easy use, we create

de facto standards that make it easy to ignore numerical analysis.

John Guckenheimer, president SIAM, in SIAM News,

June 1998: Numerical Computation in the Information Age

When redrafting the book I was tempted to present the algorithms in ALGOL, but decided that the diﬃculties

of providing procedures which were correct in every tail were prohibitive at this stage.

de-James Wilkinson, The Algebraic Eigenvalue Problem, ford University Press, 1988.

Ox-This book is an introduction to scientiﬁc computing, the mathematical

mod-eling in science and engineering and the study of how to exploit computers

in the solution of technical and scientiﬁc problems It is based on matics, numerical and symbolic/algebraic computations, parallel/distributedprocessing and visualization It is also a popular and growing area — many

mathe-new curricula in computational science and engineering have been, and

con-tinue to be, developed, leading to new academic degrees and even entire newdisciplines

A prerequisite for this development is the ubiquitous presence of puters, which are being used by virtually every student and scientist Whiletraditional scientiﬁc work is based on developing theories and performing ex-periments, the possibility to use computers at any time has created a thirdway of increasing our knowledge, which is through modeling and simulation.The use of simulation is further facilitated by the availability of sophisticated,robust and easy-to-use software libraries This has the obvious advantage

com-of shielding the user from the underlying numerics; however, this also hasthe danger of leaving the user unaware of the limitations of the algorithms,which can lead to incorrect results when used improperly Moreover, somealgorithms can be fast for certain types of problems but highly ineﬃcient forothers Thus, it is important for the user to be able to make an informeddecision on which algorithms to use, based on the properties of the problem

to be solved The goal of this book is to familiarize the reader with the basic

Trang 9

concepts of scientiﬁc computing and algorithms that form the workhorses

of many numerical libraries In fact, we will also emphasize the eﬀectiveimplementation of the algorithms discussed

Numerical scientiﬁc computing has a long history; in fact, computers were

ﬁrst built for this purpose Konrad Zuse [154] built his ﬁrst (mechanical)computer in 1938 because he wanted to have a machine that would solvesystems of linear equations that arise, e.g., when a civil engineer designs

a bridge At about the same time (and independently), Howard H Aiken

wanted to build a machine that would solve systems of ordinary diﬀerentialequations [17]

The first high quality software libraries contained indeed numerical rithms They were produced in an international effort in the programminglanguage ALGOL 60 [111], and are described in the handbook “NumericalAlgebra” [148] These fundamental procedures for solving linear equationsand eigenvalue problems were developed further, rewritten in FORTRAN,and became the LINPACK [26] and EISPACK [47] libraries They arestill in use and available at www.netlib.orgfrom Netlib In order to helpstudents to use this software, Cleve Moler created around 1980 a friendly in-terface to those subroutines, which he called Matlab (Matrix Laboratory).Matlabwas so successful that a company was founded: MathWorks Today,Matlab is “the language of technical computing”, a very powerful tool inscientific computing

algo-Parallel to the development of numerical libraries, there were also eﬀorts

to do exact and algebraic computations The ﬁrst computer algebra systemswere created some 50 years ago: At ETH, Max Engeli created Symbal, and

at MIT, Joel Moses Macsyma Macsyma is the oldest system that is stillavailable However, computer algebra computations require much more com-puter resources than numerical calculations Therefore, only when computersbecame more powerful did these systems ﬂourish Today the market leadersare Mathematica and Maple

Often, a problem may be solved analytically (“exactly”) by a computeralgebra system In general, however, analytical solutions do not exist, andnumerical approximations or other special techniques must be used instead.Moreover, computer Algebra is a very powerful tool for deriving numericalalgorithms; we use Maple for this purpose in several chapters of this book.Thus, computer algebra systems and numerical libraries are complementarytools: working with both is essential in scientiﬁc computing We have chosenMatlaband Maple as basic tools for this book Nonetheless, we are awarethat the diﬀerence between pure computer algebra systems and numericalMatlab-like systems is disappearing, and the two may merge and becomeindistinguishable by the user in the near future

Trang 10

How to use this book

Prerequisites for understanding this book are courses in calculus and linearalgebra The content of this book is too much for a typical one semestercourse in scientiﬁc computing However, the instructor can choose those sec-tions that he wishes to teach and that ﬁt his schedule For example, for

an introductory course in scientiﬁc computing, one can very well use theleast squares chapter and teach only one of the methods for computing the

QR decomposition However, for an advanced course focused solely on leastsquares methods, one may also wish to consider the singular value decompo-sition (SVD) as a computational tool for solving least squares problems Inthis case, the book also provides a detailed description on how to computethe SVD in the chapter on eigenvalues The material is presented in such away that a student can also learn directly from the book To help the readernavigate the volume, we provide in section1.2some sample courses that havebeen taught by the authors at various institutions

The focus of the book is algorithms: we would like to explain to thestudents how some fundamental functions in mathematical software are de-signed Many exercises require programming in Matlab or Maple, since

we feel it is important for students to gain experience in using such erful software systems They should also know about their limitations and

pow-be aware of the issue addressed by John Guckenheimer We tried to includemeaningful examples and problems, not just academic exercises

Acknowledgments

The authors would like to thank Oscar Chinellato, Ellis Whitehead, OliverErnst and Laurence Halpern for their careful proofreading and helpful sug-gestions

Walter Gander is indebted to Hong Kong Baptist University (HKBU)and especially to its Vice President Academic, Franklin Luk, for giving himthe opportunity to continue to teach students after his retirement at ETH.Several chapters of this book have been presented and improved successfully

in courses at HKBU We are also thankful to the University of Geneva, where

we met many times to ﬁnalize the manuscript

Geneva and Z¨urich, August 2013Walter Gander, Martin J Gander, Felix Kwok

Trang 12

Chapter 1 Why Study Scientiﬁc Computing? 1

1.1 Example: Designing a Suspension Bridge 1

1.1.1 Constructing a Model 1

1.1.2 Simulating the Bridge 3

1.1.3 Calculating Resonance Frequencies 4

1.1.4 Matching Simulations with Experiments 5

1.2 Navigating this Book: Sample Courses 6

1.2.1 A First Course in Numerical Analysis 7

1.2.2 Advanced Courses 8

1.2.3 Dependencies Between Chapters 8

Chapter 2 Finite Precision Arithmetic 9

2.1 Introductory Example 10

2.2 Real Numbers and Machine Numbers 11

2.3 The IEEE Standard 14

2.3.1 Single Precision 14

2.3.2 Double Precision 16

2.4 Rounding Errors 19

2.4.1 Standard Model of Arithmetic 19

2.4.2 Cancellation 20

2.5 Condition of a Problem 24

2.5.1 Norms 24

2.5.2 Big- and Little-O Notation 27

2.5.3 Condition Number 29

2.6 Stable and Unstable Algorithms 33

2.6.1 Forward Stability 33

2.6.2 Backward Stability 36

2.7 Calculating with Machine Numbers: Tips and Tricks 38

2.7.1 Associative Law 38

2.7.2 Summation Algorithm by W Kahan 39

2.7.3 Small Numbers 40

2.7.4 Monotonicity 40

2.7.5 Avoiding Overﬂow 41

Trang 13

XII CONTENTS

2.7.6 Testing for Overﬂow 42

2.7.7 Avoiding Cancellation 43

2.7.8 Computation of Mean and Standard Deviation 45

2.8 Stopping Criteria 48

2.8.1 Machine-independent Algorithms 48

2.8.2 Test Successive Approximations 51

2.8.3 Check the Residual 51

2.9 Problems 52

Chapter 3 Linear Systems of Equations 61

3.2 Gaussian Elimination 66

3.2.1 LU Factorization 73

3.2.2 Backward Stability 77

3.2.3 Pivoting and Scaling 79

3.2.4 Sum of Rank-One Matrices 82

3.3 Condition of a System of Linear Equations 84

3.4 Cholesky Decomposition 88

3.4.1 Symmetric Positive Deﬁnite Matrices 88

3.4.2 Stability and Pivoting 92

3.5 Elimination with Givens Rotations 95

3.6 Banded matrices 97

3.6.1 Storing Banded Matrices 97

3.6.2 Tridiagonal Systems 99

3.6.3 Solving Banded Systems with Pivoting 100

3.6.4 Using Givens Rotations 103

3.7 Problems 105

Chapter 4 Interpolation 113

4.1 Introductory Examples 114

4.2 Polynomial Interpolation 116

4.2.1 Lagrange Polynomials 117

4.2.2 Interpolation Error 119

4.2.3 Barycentric Formula 121

4.2.4 Newton’s Interpolation Formula 123

4.2.5 Interpolation Using Orthogonal Polynomials 127

4.2.6 Change of Basis, Relation with LU and QR 132

4.2.7 Aitken-Neville Interpolation 139

4.2.8 Extrapolation 142

4.3 Piecewise Interpolation with Polynomials 144

4.3.1 Classical Cubic Splines 145

4.3.2 Derivatives for the Spline Function 147

4.3.3 Sherman–Morrison–Woodbury Formula 155

4.3.4 Spline Curves 157

Trang 14

4.4 Trigonometric Interpolation 158

4.4.1 Trigonometric Polynomials 160

4.4.2 Fast Fourier Transform (FFT) 162

4.4.3 Trigonometric Interpolation Error 164

4.4.4 Convolutions Using FFT 168

4.5 Problems 171

Chapter 5 Nonlinear Equations 181

5.2 Scalar Nonlinear Equations 184

5.2.1 Bisection 185

5.2.2 Fixed Point Iteration 187

5.2.3 Convergence Rates 190

5.2.4 Aitken Acceleration and the ε-Algorithm 193

5.2.5 Construction of One Step Iteration Methods 199

5.2.6 Multiple Zeros 205

5.2.7 Multi-Step Iteration Methods 207

5.2.8 A New Iteration Formula 210

5.2.9 Dynamical Systems 212

5.3 Zeros of Polynomials 215

5.3.1 Condition of the Zeros 217

5.3.2 Companion Matrix 220

5.3.3 Horner’s Scheme 222

5.3.4 Number Conversions 227

5.3.5 Newton’s Method: Classical Version 230

5.3.6 Newton Method Using Taylor Expansions 231

5.3.7 Newton Method for Real Simple Zeros 232

5.3.8 Nickel’s Method 237

5.3.9 Laguerre’s Method 239

5.4 Nonlinear Systems of Equations 240

5.4.1 Fixed Point Iteration 242

5.4.2 Theorem of Banach 242

5.4.3 Newton’s Method 245

5.4.4 Continuation Methods 251

5.5 Problems 252

Chapter 6 Least Squares Problems 261

6.2 Linear Least Squares Problem and the Normal Equations 266

6.3 Singular Value Decomposition (SVD) 269

6.3.1 Pseudoinverse 274

6.3.2 Fundamental Subspaces 275

6.3.3 Solution of the Linear Least Squares Problem 277

6.3.4 SVD and Rank 279

Trang 15

XIV CONTENTS

6.4 Condition of the Linear Least Squares Problem 280

6.4.1 Diﬀerentiation of Pseudoinverses 282

6.4.2 Sensitivity of the Linear Least Squares Problem 285

6.4.3 Normal Equations and Condition 286

6.5 Algorithms Using Orthogonal Matrices 287

6.5.1 QR Decomposition 287

6.5.2 Method of Householder 289

6.5.3 Method of Givens 292

6.5.4 Fast Givens 298

6.5.5 Gram-Schmidt Orthogonalization 301

6.5.6 Gram-Schmidt with Reorthogonalization 306

6.5.7 Partial Reorthogonalization 308

6.5.8 Updating and Downdating the QR Decomposition 311

6.5.9 Covariance Matrix Computations Using QR 320

6.6 Linear Least Squares Problems with Linear Constraints 323

6.6.1 Solution with SVD 325

6.6.2 Classical Solution Using Lagrange Multipliers 328

6.6.3 Direct Elimination of the Constraints 330

6.6.4 Null Space Method 333

6.7 Special Linear Least Squares Problems with Quadratic Constraint 334

6.7.1 Fitting Lines 335

6.7.2 Fitting Ellipses 337

6.7.3 Fitting Hyperplanes, Collinearity Test 340

6.7.4 Procrustes or Registration Problem 344

6.7.5 Total Least Squares 349

6.8 Nonlinear Least Squares Problems 354

6.8.1 Notations and Deﬁnitions 354

6.8.2 Newton’s Method 356

6.8.3 Gauss-Newton Method 360

6.8.4 Levenberg-Marquardt Algorithm 361

6.9 Least Squares Fit with Piecewise Functions 364

6.9.1 Structure of the Linearized Problem 367

6.9.2 Piecewise Polynomials 368

6.9.3 Examples 372

6.10 Problems 374

Chapter 7 Eigenvalue Problems 387

7.2 A Brief Review of the Theory 392

7.2.1 Eigen-Decomposition of a Matrix 392

7.2.2 Characteristic Polynomial 396

7.2.3 Similarity Transformations 396

Trang 16

7.2.4 Diagonalizable Matrices 397

7.2.5 Exponential of a Matrix 397

7.2.6 Condition of Eigenvalues 398

7.3 Method of Jacobi 405

7.3.1 Reducing Cost by Using Symmetry 414

7.3.2 Stopping Criterion 417

7.3.3 Algorithm of Rutishauser 417

7.3.4 Remarks and Comments on Jacobi 420

7.4 Power Methods 422

7.4.1 Power Method 423

7.4.2 Inverse Power Method (Shift-and-Invert) 424

7.4.3 Orthogonal Iteration 425

7.5 Reduction to Simpler Form 429

7.5.1 Computing Givens Rotations 429

7.5.2 Reduction to Hessenberg Form 430

7.5.3 Reduction to Tridiagonal Form 434

7.6 QR Algorithm 436

7.6.1 Some History 437

7.6.2 QR Iteration 437

7.6.3 Basic Facts 437

7.6.4 Preservation of Form 438

7.6.5 Symmetric Tridiagonal Matrices 439

7.6.6 Implicit QR Algorithm 443

7.6.7 Convergence of the QR Algorithm 445

7.6.8 Wilkinson’s Shift 447

7.6.9 Test for Convergence and Deﬂation 448

7.6.10 Unreduced Matrices have Simple Eigenvalues 449

7.6.11 Speciﬁc Numerical Examples 451

7.6.12 Computing the Eigenvectors 453

7.7 Computing the Singular Value Decomposition (SVD) 453

7.7.1 Transformations 454

7.7.2 Householder-Rutishauser Bidiagonalization 454

7.7.3 Golub-Kahan-Lanczos Bidiagonalization 457

7.7.4 Eigenvalues and Singular Values 457

7.7.5 Algorithm of Golub-Reinsch 458

7.8 QD Algorithm 464

7.8.1 Progressive QD Algorithm 464

7.8.2 Orthogonal LR-Cholesky Algorithm 468

7.8.3 Diﬀerential QD Algorithm 472

7.8.4 Improving Convergence Using Shifts 474

7.8.5 Connection to Orthogonal Decompositions 478

7.9 Problems 482

Trang 17

XVI CONTENTS

Chapter 8 Diﬀerentiation 487

8.2 Finite Diﬀerences 491

8.2.1 Generating Finite Diﬀerence Approximations 494

8.2.2 Discrete Operators for Partial Derivatives 496

8.3 Algorithmic Diﬀerentiation 499

8.3.1 Idea Behind Algorithmic Diﬀerentiation 499

8.3.2 Rules for Algorithmic Diﬀerentiation 504

8.3.3 Example: Circular Billiard 505

8.3.4 Example: Nonlinear Eigenvalue Problems 509

8.4 Problems 514

Chapter 9 Quadrature 517

9.1 Computer Algebra and Numerical Approximations 518

9.2 Newton–Cotes Rules 521

9.2.1 Error of Newton–Cotes Rules 525

9.2.2 Composite Rules 527

9.2.3 Euler–Maclaurin Summation Formula 531

9.2.4 Romberg Integration 537

9.3 Gauss Quadrature 541

9.3.1 Characterization of Nodes and Weights 545

9.3.2 Orthogonal Polynomials 547

9.3.3 Computing the Weights 552

9.3.4 Golub–Welsch Algorithm 555

9.4 Adaptive Quadrature 561

9.4.1 Stopping Criterion 563

9.4.2 Adaptive Simpson quadrature 565

9.4.3 Adaptive Lobatto quadrature 569

9.5 Problems 577

Chapter 10 Numerical Ordinary Diﬀerential Equations 583

10.2 Basic Notation and Solution Techniques 587

10.2.1 Notation, Existence of Solutions 587

10.2.2 Analytical and Numerical Solutions 589

10.2.3 Solution by Taylor Expansions 591

10.2.4 Computing with Power Series 593

10.2.5 Euler’s Method 597

10.2.6 Autonomous ODE, Reduction to First Order System 603

10.3 Runge-Kutta Methods 604

10.3.1 Explicit Runge-Kutta Methods 604

10.3.2 Local Truncation Error 606

10.3.3 Order Conditions 608

10.3.4 Convergence 615

Trang 18

10.3.5 Adaptive Integration 617

10.3.6 Implicit Runge-Kutta Methods 625

10.4 Linear Multistep Methods 631

10.4.1 Local Truncation Error 635

10.4.2 Order Conditions 636

10.4.3 Zero Stability 638

10.4.4 Convergence 643

10.5 Stiﬀ Problems 646

10.5.1 A-Stability 650

10.5.2 A Nonlinear Example 653

10.5.3 Diﬀerential Algebraic Equations 655

10.6 Geometric Integration 656

10.6.1 Symplectic Methods 658

10.6.2 Energy Preserving Methods 661

10.7 Delay Diﬀerential Equations 664

10.8 Problems 666

Chapter 11 Iterative Methods for Linear Systems 673

11.2 Solution by Iteration 677

11.2.1 Matrix Splittings 677

11.2.2 Residual, Error and the Diﬀerence of Iterates 678

11.2.3 Convergence Criteria 680

11.2.4 Singular Systems 683

11.2.5 Convergence Factor and Convergence Rate 684

11.3 Classical Stationary Iterative Methods 687

11.3.1 Regular Splittings and M-Matrices 687

11.3.2 Jacobi 691

11.3.3 Gauss-Seidel 694

11.3.4 Successive Over-relaxation (SOR) 695

11.3.5 Richardson 702

11.4 Local Minimization by Nonstationary Iterative Methods 704

11.4.1 Conjugate Residuals 705

11.4.2 Steepest Descent 705

11.5 Global Minimization with Chebyshev Polynomials 708

11.5.1 Chebyshev Semi-Iterative Method 719

11.5.2 Acceleration of SSOR 724

11.6 Global Minimization by Extrapolation 726

11.6.1 Minimal Polynomial Extrapolation (MPE) 729

11.6.2 Reduced Rank Extrapolation (RRE) 733

11.6.3 Modiﬁed Minimal Polynomial Extrapolation (MMPE) 734

11.6.4 Topological ε-Algorithm (TEA) 735

11.6.5 Recursive Topological ε-Algorithm 737

Trang 19

XVIII CONTENTS

11.7 Krylov Subspace Methods 739

11.7.1 The Conjugate Gradient Method 740

11.7.2 Arnoldi Process 758

11.7.3 The Symmetric Lanczos Algorithm 761

11.7.4 Solving Linear Equations with Arnoldi 766

11.7.5 Solving Linear Equations with Lanczos 769

11.7.6 Generalized Minimum Residual: GMRES 773

11.7.7 Classical Lanczos for Non-Symmetric Matrices 780

11.7.8 Biconjugate Gradient Method (BiCG) 793

11.7.9 Further Krylov Methods 800

11.8 Preconditioning 801

11.9 Problems 804

Chapter 12 Optimization 817

12.1.1 How much daily exercise is optimal ? 818

12.1.2 Mobile Phone Networks 821

12.1.3 A Problem from Operations Research 828

12.1.4 Classiﬁcation of Optimization Problems 831

12.2 Mathematical Optimization 832

12.2.1 Local Minima 832

12.2.2 Constrained minima and Lagrange multipliers 835

12.2.3 Equality and Inequality Constraints 838

12.3 Unconstrained Optimization 842

12.3.1 Line Search Methods 842

12.3.2 Trust Region Methods 856

12.3.3 Direct Methods 859

12.4 Constrained Optimization 862

12.4.1 Linear Programming 862

12.4.2 Penalty and Barrier Functions 872

12.4.3 Interior Point Methods 873

12.4.4 Sequential Quadratic Programming 877

12.5 Problems 880

Bibliography 887

Index 895

Trang 20

Computational Science and Engineering (CS&E) is now widely accepted, along with theory and experiment, as a crucial third mode of scientiﬁc investigation and engineering design Aerospace, automotive, biological, chemical, semiconductor, and other industrial sectors now rely on simulation for technical decision support.

Introduction to the First SIAM Conference on tional Science and Engineering, September 21–24, 2000, Washington DC.

Computa-The emergence of scientiﬁc computing as a vital part of science and gineering coincides with the explosion in computing power in the past 50years Many physical phenomena have been well understood and have accu-rate models describing them since the late 1800s, but before the widespreaduse of computers, scientists and engineers were forced to make many simplify-ing assumptions in the models in order to make them solvable by pencil-and-paper methods, such as series expansion With the increase of computingpower, however, one can aﬀord to use numerical methods that are compu-tationally intensive but that can tackle the full models without the need tosimplify them Nonetheless, every method has its limitations, and one mustunderstand how they work in order to use them correctly

To get an idea of the kinds of numerical methods that are used in engineering

problems, let us consider the design of a simple suspension bridge The bridge

consists of a pair of ropes fastened on both sides of the gorge, see Figure

1.1 Wooden supports going across the bridge are attached to the ropes

at regularly spaced intervals Wooden boards are then fastened between thesupports to form the deck We would like to calculate the shape of the bridge

as well as the tension in the rope supporting it

Let us construct a simple one-dimensional model of the bridge structure byassuming that the bridge does not rock side to side To calculate the shape

of the bridge, we need to know the forces that are exerted on the ropes by the

supports Let L be the length of the bridge and x be the distance from one

W Gander et al., Scientific Computing - An Introduction using Maple and MATLAB,

Texts in Computational Science and Engineering 11,

DOI 10.1007/978-3-319-04325-8 1,

Trang 21

2 WHY STUDY SCIENTIFIC COMPUTING?

Supports

Deck boards (Others not shown) Ropes

Figure 1.1 A simple suspension bridge.

Figure 1.2 Force diagram for the bridge example.

end of the bridge Assume that the supports are located at x i , i = 1, , n, with h being the spacing between supports Let w(x) be the force per unit distance exerted on the deck at x by gravity, due to the weight of the deck and

of the people on it If we assume that any weight on the segment [x i −1 , x i]

are exerted entirely on the supports at x i−1 and x i , then the force f iexerted

on the rope by the support at x ican be written as

We now consider the rope as an elastic string, which is stretched by the force

exerted by the wooden supports Let u i be the height of the bridge at x i,

T i−1/2 be the tension of the segment of the rope between x i −1 and x i, and

θ i −1/2 be the angle it makes with the horizontal Figure1.2shows the force

diagram on the rope at x i

Since there is no horizontal displacement in the bridge, the horizontalforces must balance out, meaning

T i −1/2 cos(θ i −1/2 ) = T i+1/2 cos(θ i+1/2 ) = C,

Trang 22

where K is a constant Vertical force balance then gives

T i+1/2 sin(θ i+1/2)− T i−1/2 sin(θ i−1/2 ) = f i ,

where u0 and u n+1 are the known heights of the bridge at its ends and

u1, , u n are the unknown heights

Now, if we want to compute the shape of the bridge and the tensions T i−1/2,

we must ﬁrst calculate the forces f i from (1.1), and then solve the system

of linear equations (1.2) To calculate the f i, one must evaluate integrals,

which may not be analytically feasible for certain weight distributions w(x) Instead, one can approximate the integral numerically using a Riemann sum,

The next step is to solve (1.2) for the u i This can be rewritten as

Au = f ,

where A ∈ R n×n is a matrix, u ∈ R n is the vector of unknowns, and f

is the vector of forces we just calculated This system can be solved by

Gaussian elimination, i.e., by row reducing the matrix, as taught in a basic

linear algebra course So for n = 4, a uniform distribution w(x) = 1, and

Trang 23

⎞

⎟

⎠

Back substitution gives u = K h(−2, −3, −3, −2) However, one often wishes

to calculate the shape of the bridge under diﬀerent weight distributions w(x),

e.g., when people are standing on diﬀerent parts of the bridge So the matrix

A stays the same, but the right-hand side f changes to reﬂect the diﬀerent

weight distributions It would be a waste to have to redo the row reductions

every time, when only f has changed! A much better way is to use the LU

decomposition, which writes the matrix A in factored form and reuses the

factors to solve equations with diﬀerent right-hand sides This is shown inChapter3

In the above row reduction, we can see easily that there are many zeroentries that need not be calculated, but the computer has no way of knowingthat in advance In fact, the number of additions and multiplications required

for solving the generic (i.e., full) linear system is proportional to n3, whereas

in our case, we only need about n additions and multiplications because of the many zero entries To take advantage of the sparse nature of the matrix, one

needs to store it diﬀerently and use diﬀerent algorithms on it One possibility

is to use the banded matrix format ; this is shown in Section3.6

Suppose now that the people on the bridge have moved, but only by afew meters The shape of the bridge would have only changed slightly, sincethe weight distribution is not very diﬀerent Thus, instead of solving a newlinear system from scratch, one could imagine using the previous shape as

a ﬁrst guess and make small corrections to the solution until it matches the

new distribution This is the basis of iterative methods, which are discussed

in Chapter11

A well-designed bridge should never collapse, but there have been spectacularbridge failures in history One particularly memorable one was the collapse

of the Tacoma Narrows bridge on November 7, 1940 On that day, powerful

wind gusts have excited a natural resonance mode of the bridge, setting

it into a twisting motion that it was not designed to withstand As thewinds continued, the amplitude of the twisting motion grew, until the bridge

Trang 24

eventually collapsed1.

It turns out that one can study the resonance modes of the bridge by

considering the eigenvalue problem

Au = λu,

cf [37] Clearly, a two-dimensional model is needed to study the twistingmotion mentioned above, but let us illustrate the ideas by considering the

eigenvalues of the 1D model for n = 4 For this simple problem, one can

guess the eigenvectors and verify that

u (k) = (sin(kπ/5), sin(2kπ/5), sin(3kπ/5), sin(4π/5)) , k = 1, 2, 3, 4

are in fact eigenvectors with associated eigenvalues λ (k)=−2 + 2 cos(kπ/5).

However, for more complicated problems, such as one with varying massalong the bridge or for 2D problems, it is no longer possible to guess the

eigenvectors Moreover, the characteristic polynomial

P (λ) = det(λI − A)

is a polynomial of degree n, and it is well known that no general formula exists for ﬁnding the roots of such polynomials for n ≥ 5 In Chapter7, we

will present numerical algorithms for ﬁnding the eigenvalues of A In fact,

the problem of ﬁnding eigenvalues numerically also requires approximately

n3 operations, just like Gaussian elimination This is in stark contrast withthe theoretical point of view that linear systems are “easy” and polynomialroot-ﬁnding is “impossible” To quote the eminent numerical analyst NickTrefethen [139],

Abel and Galois notwithstanding, large-scale matrix eigenvalueproblems are about as easy to solve in practice as linear systems

of equations

When modeling the bridge in the design process, we must use many

parame-ters, such as the weight of the deck (expressed in terms of the mass density ρ per unit length) and the elasticity constant K of the supporting rope In real-

ity, these quantities depend on the actual material used during construction,and may deviate from the nominal values assumed during the design pro-cess To get an accurate model of the bridge for later simulation, one needs

to estimate these parameters from measurements taken during experiments

For example, we can measure the vertical displacements y iof the constructed

bridge at points x i , and compare it with the displacements u i predicted by

the model, i.e., the displacements satisfying Au = f Since both A and f

1http://www.youtube.com/watch?v=3mclp9QmCGs

Trang 25

depend on the model parameters ρ and K, the u i also depend on these rameters; thus, the mismatch between the model and the experimental data

pa-can be expressed as a function of ρ and K:

by many methods, the best known of which is Newton’s method Such

methods are discussed in Chapter5

2 The above approach has the disadvantage that (1.4) is satisﬁed by all

stationary points of F (ρ, K), i.e., both the maxima and the minima

of F (ρ, K) Since we are only interested in the minima of the

func-tion, a more direct approach would be to start with an initial guess

(ρ0, K0) (e.g., the nominal design values) and then ﬁnd successively

better approximations (ρ k , K k ), k = 1, 2, 3, that reduce the mismatch,

i.e.,

F (ρ k+1 , K k+1)≤ F (ρ k , K k ).

This is the basis of optimization algorithms, which can be applied to

other minimization problems Such methods are discussed in detail inChapter12

3 The function F (ρ, K) in (1.3) has a very special structure in that it

is a sum of squares of the diﬀerences As a result, the minimization

problem is known as a least-squares problem Least-squares problems, in

particular linear ones, often arise because they yield the best unbiasedestimator in the statistical sense for linear models Because of theprevalence and special structure of least-squares problems, it is possible

to design specialized methods that are more eﬃcient and/or robust forthese problems than general optimization algorithms One example is

the Gauss–Newton method, which resembles a Newton method, except

that second-order derivative terms are dropped to save on computation.This and other methods are presented in Chapter6

This book intentionally contains too many topics to be done from cover tocover, even for an intensive full-year course In fact, many chapters contain

Trang 26

enough material for stand-alone semester courses on their respective topics.

To help instructors and students navigate through the volume, we providesome sample courses that can be built from its sections

The following sections have been used to build the ﬁrst year numerical ysis course at the University of Geneva in 2011–12 (54 hours of lectures)

anal-1 Finite precision arithmetic (2.1–2.6)

10 Ordinary diﬀerential equations (10.1,10.3)

A ﬁrst term course at Stanford for computer science students in 1996 and

1997 (’Introduction to Scientiﬁc Computing using Maple and Matlab, 40hours of lectures) was built using

1 Finite precision arithmetic (2.2)

Trang 27

The following advanced undergraduate/graduate courses (38 hours of lectureseach) have been taught at Baptist University in Hong Kong between 2010and 2013 We include a list of chapters from which these courses were built

1 Eigenvalues and Iterative Methods for Linear Equations (Chapters 7,

11)

2 Least Squares (Chapter6)

3 Quadrature and Ordinary Diﬀerential Equations (Chapters9and10)

At the University of Geneva, the following graduate courses (28 hours oflectures, and 14 hours of exercises) have been taught between 2004 and 2011:

1 Iterative Methods for Linear Equations (Chapter11)

2 Optimization (Chapter12)

Chapter2on ﬁnite precision arithmetic and Chapter3on linear equations arerequired for most, if not all, of the subsequent chapters At the beginning ofeach chapter, we give a list of sections that are prerequisites to understandingthe material Readers who are not familiar with these sections should refer

to them ﬁrst before proceeding

Trang 28

In the past 15 years many numerical analysts have gressed from being queer people in mathematics depart- ments to being queer people in computer science depart- ments!

pro-George Forsythe, What to do till the computer scientist comes Amer Math Monthly 75, 1968.

It is hardly surprising that numerical analysis is widely regarded as an unglamorous subject In fact, mathematicians, physicists, and computer scientists have all tended to hold numerical analysis in low esteem for many years – a most unusual consensus.

Nick Trefethen, The deﬁnition of numerical analysis, SIAM news, November 1992.

The golden age of numerical analysis has not yet started.

Volker Mehrmann, round table discussion ”Future tions in Numerical Analysis,” moderated by Gene Golub and Nick Trefethen at ICIAM 2007.

Direc-Finite precision arithmetic underlies all the computations performed ically, e.g in Matlab; only symbolic computations, e.g Maple, are largelyindependent of ﬁnite precision arithmetic Historically, when the invention

numer-of computers allowed a large number numer-of operations to be performed in veryrapid succession, nobody knew what the inﬂuence of ﬁnite precision arith-metic would be on this many operations: would small rounding errors sum

up rapidly and destroy results? Would they statistically cancel? The earlydays of numerical analysis were therefore dominated by the study of round-ing errors, and made this rapidly developing ﬁeld not very attractive (seethe quote above) Fortunately, this view of numerical analysis has sincechanged, and nowadays the focus of numerical analysis is the study of algo-rithms for the problems of continuous mathematics1 There are nonetheless

a few pitfalls every person involved in scientiﬁc computing should know, andthis chapter is precisely here for this reason After an introductory example

in Section 2.1, we present the diﬀerence between real numbers and machinenumbers in Section 2.2 on a generic, abstract level, and give for the morecomputer science oriented reader the concrete IEEE arithmetic standard inSection2.3 We then discuss the inﬂuence of rounding errors on operations in

1Nick Trefethen, The deﬁnition of numerical analysis, SIAM News, November 1992

W Gander et al., Scientific Computing - An Introduction using Maple and MATLAB,

Texts in Computational Science and Engineering 11,

DOI 10.1007/978-3-319-04325-8 2,

Trang 29

10 FINITE PRECISION ARITHMETIC

Section 2.4, and explain the predominant pitfall of catastrophic cancellationwhen computing diﬀerences In Section2.5, we explain in very general termswhat the condition number of a problem is, and then show in Section 2.6

two properties of algorithms for a given problem, namely forward stabilityand backward stability It is the understanding of condition numbers andstability that allowed numerical analysts to move away from the study ofrounding errors, and to focus on algorithmic development Sections2.7 and

2.8represent a treasure trove with advanced tips and tricks when computing

in ﬁnite precision arithmetic

A very old problem already studied by ancient Greek mathematicians is the

squaring of a circle The problem consists of constructing a square that has

the same area as the unit circle Finding a method for transforming a circle

into a square this way (quadrature of the circle) became a famous problem

that remained unsolved until the 19th century, when it was proved usingGalois theory that the problem cannot be solved with the straight edge andcompass

We know today that the area of a circle is given by A = πr2, where

r denotes the radius of the circle An approximation is obtained by

draw-ing a regular polygon inside the circle, and by computdraw-ing the surface ofthe polygon The approximation is improved by increasing the number ofsides Archimedes managed to produce a 96-sided polygon, and was able

to bracket π in the interval (31071, 317) The enclosing interval has length

1/497 = 0.00201207243 — surely good enough for most practical

applica-tions in his time

Figure 2.1 Squaring of a circle

To compute such a polygonal approximation of π, we consider Figure2.1

Without loss of generality, we may assume that r = 1 Then the area F nof

the isosceles triangle ABC with center angle α n:= 2π

Trang 30

and the area of the associated n-sided polygon becomes

contradic-mations, i.e by expressing sin(α n /2) in terms of sin α n This can be achieved

by using identities for trigonometric functions:

Thus, we have obtained a recurrence for sin(α n /2) from sin α n To start the

recurrence, we compute the area A6 of the regular hexagon The length of

each side of the six equilateral triangles is 1 and the angle is α6 = 60◦, so

z=[A-pi n A s]; % store the results

while s>1e-10 % termination if s=sin(alpha) small s=sqrt((1-sqrt(1-s*s))/2); % new sin(alpha/2) value

n=2*n; A=n/2*s; % A=new polygon area

The results, displayed in Table 2.1, are not what we would expect:

ini-tially, we observe convergence towards π, but for n > 49152, the error grows again and ﬁnally we obtain A n = 0 ?! Although the theory and the program

are both correct, we still obtain incorrect answers We will explain in this

chapter why this is the case

Every computer is a finite automaton This implies that a computer can onlystore a finite set of numbers and perform only a finite number of operations

In mathematics, we are used to calculating with real numbersR covering thecontinuous interval (−∞, ∞), but on the computer, we must contend with a

Trang 31

Trang 32

discrete, ﬁnite set of machine numbers M = {−˜a min , , ˜ a max } Hence each

real number a has to be mapped onto a machine number ˜ a to be used on

a computer In fact a whole interval of real numbers is mapped onto onemachine number, as shown in Figure2.2

˜a ∈ M

Figure 2.2.

Mapping of real numbers R onto machine numbers M

Nowadays, machine numbers are often represented in the binary system.

In general, any base (or radix ) B could be used to represent numbers A real machine number or ﬂoating point number consists of two parts, a mantissa (or signiﬁcant) m and an exponent e

˜

m = D.D · · · D mantissa

e = D · · · D exponent

where D ∈ {0, 1, , B − 1} stands for one digit To make the representation

of machine numbers unique (note that e.g 1.2345 × 103= 0.0012345 × 106),

we require for a machine number ˜a = 0 that the ﬁrst digit before the decimal

point in the mantissa be nonzero; such numbers are called normalized One deﬁning characteristic for any ﬁnite precision arithmetic is the number of

digits used for the mantissa and the exponent: the number of digits in the

exponent deﬁnes the range of the machine numbers, whereas the numbers of digits in the mantissa deﬁnes the precision.

More specifically [100], a finite precision arithmetic is defined by four

integer parameters: B, the base or radix, p, the number of digits in the mantissa, and l and u deﬁning the exponent range: l ≤ e ≤ u.

The precision of the machine is described by the real machine number

eps Historically, eps is deﬁned to be the smallest positive ˜ a ∈ M such that

˜

a + 1 = 1 when the addition is carried out on the computer Because this

deﬁnition involves details about the behavior of ﬂoating point addition, which

are not easily accessible, a newer deﬁnition of eps is simply the spacing of

the ﬂoating point numbers between 1 and B (usually B = 2) The current

deﬁnition only relies on how the numbers are represented

Simple calculators often use the familiar decimal system (B = 10) cally there are p = 10 digits for the mantissa and 2 for the exponent (l = −99

Typi-and u = 99) In this ﬁnite precision arithmetic, we have

• eps= 0.000000001 = 1.000000000 × 10 −9 ,

• the largest machine number

˜

a max = 9.999999999 × 10+99,

Trang 33

• the smallest machine number

in 1956, was operational at ETH Zurich from 1956–1963 The representation

of a real number used 16 decimal digits: The ﬁrst digit, the q-digit, stored the

sum of the digits modulo 3 This was used as a check to see if the machineword had been transmitted correctly from memory to the registers The nextthree digits contained the exponent Then the next 11 digits represented themantissa, and ﬁnally, the last digit held the sign The range of positive ma-

chine numbers was 1.0000000000 × 10 −200 ≤ ˜a ≤ 9.9999999999 × 10199 Thepossibly larger exponent range in this setting from−999 to 999 was not fully

used

In contrast, the very ﬁrst programmable computer, the Z3, which wasbuilt by the German civil engineer Konrad Zuse and presented in 1941 to agroup of experts only, was already using the binary system The Z3 workedwith an exponent of 7 bits and a mantissa of 14 bits (actually 15, since thenumbers were normalized) The range of positive machine numbers was theinterval

[2−63 , 1.11111111111111 × 262]≈ [1.08 × 10 −19 , 9.22 × 1018].

In Maple (a computer algebra system), numerical computations are formed in base 10 The number of digits of the mantissa is deﬁned by thevariable Digits, which can be freely chosen The number of digits of theexponent is given by the word length of the computer — for 32-bit machines,

per-we have a huge maximal exponent of u = 231= 2147483648

Since 1985 we have for computer hardware the ANSI/IEEE Standard 754

for Floating Point Numbers It has been adopted by almost all computer

manufacturers The base is B = 2.

The IEEE single precision ﬂoating point standard representation uses a bit word with bits numbered from 0 to 31 from left to right The ﬁrst bit S is

Trang 34

32-the sign bit, 32-the next eight bits E are 32-the exponent bits, e = EEEEEEEE, and the ﬁnal 23 bits are the bits F of the mantissa m:

The value ˜a represented by the 32 bit word is deﬁned as follows:

normal numbers: If 0 < e < 255, then ˜ a = ( −1) S ×2 e −127 ×1.m, where 1.m

is the binary number created by preﬁxing m with an implicit leading 1

and a binary point

These are known as denormalized ( or subnormal) numbers.

If e = 0 and m = 0 and S = 1, then ˜ a = −0.

If e = 0 and m = 0 and S = 0, then ˜ a = 0.

If e = 255 and m = 0 and S = 1, then ˜ a = −Inf.

If e = 255 and m = 0 and S = 0, then ˜ a = Inf.

= smallest positive denormalized machine number

In Matlab, real numbers are usually represented in double precision The

function single can however be used to convert numbers to single precision.Matlabcan also print real numbers using the hexadecimal format, which isconvenient for examining their internal representations:

>> format hex

Trang 35

in single precision Dividing s by 2 gives zero because of underflow Thecomputation of z yields an undefined expression which results in NaN eventhough the limit is defined The final two computations for y and t show theexceptions Inf and -Inf

The IEEE double precision ﬂoating point standard representation uses a

64-bit word with 64-bits numbered from 0 to 63 from left to right The ﬁrst 64-bit

S is the sign bit, the next eleven bits E are the exponent bits for e and the

ﬁnal 52 bits F represent the mantissa m:

Trang 36

The value ˜a represented by the 64-bit word is deﬁned as follows:

normal numbers: If 0 < e < 2047, then ˜ a = ( −1) S × 2 e −1023 × 1.m, where

1.m is the binary number created by preﬁxing m with an implicit

lead-ing 1 and a binary point

which are again denormalized numbers.

If e = 0 and m = 0 and S = 1, then ˜ a = −0.

If e = 0 and m = 0 and S = 0, then ˜ a = 0.

If e = 2047 and m = 0 and S = 1, then ˜ a = −Inf.

If e = 2047 and m = 0 and S = 0, then ˜ a = Inf.

In Matlab, real computations are performed in IEEE double precision bydefault Using again the hexadecimal format in Matlab to see the internalrepresentation, we obtain for example

• The machine precision is eps = 2 −52.

Trang 37

• The largest machine number ˜a max is denoted by realmax Note that

mantissa F consisting of all ones:

1

+

12

2

+· · · +

12

= 21023× (2 − eps)

Even though Matlab reports log2(realmax)=1024, realmax does notequal 21024, but rather (2−eps)×21023; taking the logarithm of realmaxyields 1024 only because of rounding Similar rounding eﬀects wouldalso occur for machine numbers that are a bit smaller than realmax

• The computation range is the interval [−realmax, realmax] If an

op-eration produces a result outside this interval, then it is said to overﬂow.

Before the IEEE Standard, computation would halt with an error sage in such a case Now the result of an overﬂow operation is assignedthe number±Inf.

mes-• The smallest positive normalized number is realmin = 2 −1022.

• IEEE allows computations with denormalized numbers The positive

denormalized numbers are in the interval [realmin∗ eps, realmin] If

an operation produces a strictly positive number that is smaller thanrealmin∗ eps, then this result is said to be in the underﬂow range.

Since such a result cannot be represented, zero is assigned instead

Trang 38

• When computing with denormalized numbers, we may suﬀer a loss of

precision Consider the following Matlab program:

The ﬁrst result res is a denormalized number, and thus can no longer

be represented with full accuracy So when we reverse the operationsand compute res2, we obtain a result which only contains 2 correct dec-imal digits We therefore recommend avoiding the use of denormalizednumbers whenever possible

Let ˜a and ˜ b be two machine numbers Then c = ˜ a ×˜b will in general not be a

machine number anymore, since the product of two numbers contains twice

as many digits The computed result will therefore be rounded to a machinenumber ˜c which is closest to c.

As an example, consider the 8-digit decimal numbers

is called the relative rounding error.

On today’s computers, basic arithmetic operations obey the standard

model of arithmetic: for a, b ∈ M, we have

where r is the relative rounding error with |r| < eps, the machine precision.

We denote with⊕ ∈ {+, −, ×, /} the exact basic operation and with ˜⊕ the

equivalent computer operation

Trang 39

Another interpretation of the standard model of arithmetic is due toWilkinson In what follows, we will no longer use the multiplication sym-bol × for the exact operation; it is common practice in algebra to denote

multiplication without any symbol: ab ⇐⇒ a × b Consider the operations

Addition: a ˜ +b = (a + b)(1 + r) = (a + ar) + (b + br) = ˜ a + ˜ b

In each of the above, the operation satisﬁes

Wilkinson’s Principle

The result of a numerical computation on the computer is the result of an exact computation with slightly perturbed initial data.

For example, the numerical result of the multiplication a ˜ ×b is the exact

result a˜ b with a slightly perturbed operand ˜ b = b + br As a consequence

of Wilkinson’s Principle, we need to study the eﬀect that slightly perturbeddata have on the result of a computation This is done in Section 2.6

A special rounding error is called cancellation If we subtract two almost

equal numbers, leading digits will cancel Consider the following two numberswith 5 decimal digits:

best be 1.XXXXe −4, where the digits denoted by X are unknown.

This is exactly what happened in our example at the beginning of this

chapter To compute sin(α/2) from sin α, we used the recurrence (2.1):

Since sin α n → 0, the numerator on the right hand side is

1−1− ε2, with small ε = sin α ,

Trang 40

and suﬀers from severe cancellation This is the reason why the algorithmperformed so badly, even though the theory and program are both correct.

It is possible in this case to rearrange the computation and avoid lation:

1− sin2α n)

= sin α n2(1 +

1− sin2α n)

.

This last expression no longer suﬀers from cancellation, and we obtain thenew program:

Algorithm 2.2 Computation of π, Stable Version

oldA=0;s=sqrt(3)/2; newA=3*s; n=6; % initialization

z=[newA-pi n newA s]; % store the results

while newA>oldA % quit if area does not increase oldA=newA;

s=s/sqrt(2*(1+sqrt((1+s)*(1-s)))); % new sine value

This time we do converge to the correct value of π (see Table2.2) Notice

also the elegant termination criterion: since the surface of the next polygon

grows, we theoretically have

A6 < · · · < A n < A2n < π.

However, this cannot be true forever in ﬁnite precision arithmetic, since there

is only a ﬁnite set of machine numbers Thus, the situation A n ≥ A2nmustoccur at some stage, and this is the condition to stop the iteration Note that

this condition is independent of the machine, in the sense that the iteration

will always terminate as long as we have ﬁnite precision arithmetic, and when

it does terminate, it always gives the best possible approximation for the

precision of the machine More examples of machine-independent algorithms

can be found in Section2.8.1

A second example in which cancellation occurs is the problem of numerical

diﬀerentiation (see Chapter 8) Given a twice continuously diﬀerentiable

Định dạng
Số trang	926
Dung lượng	8,65 MB