Details

Designing High Availability Systems


Designing High Availability Systems

DFSS and Classical Reliability Techniques with Practical Real Life Examples
1. Aufl.

von: Zachary Taylor, Subramanyam Ranganathan

CHF 99.00

Verlag: Wiley
Format: EPUB
Veröffentl.: 09.10.2013
ISBN/EAN: 9781118753736
Sprache: englisch
Anzahl Seiten: 480

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>A practical, step-by-step guide to designing world-class, high availability systems using both classical and DFSS reliability techniques</b></p> <p>Whether designing telecom, aerospace, automotive, medical, financial, or public safety systems, every engineer aims for the utmost reliability and availability in the systems he, or she, designs. But between the dream of world-class performance and reality falls the shadow of complexities that can bedevil even the most rigorous design process. While there are an array of robust predictive engineering tools, there has been no single-source guide to understanding and using them . . . until now.</p> <p>Offering a case-based approach to designing, predicting, and deploying world-class high-availability systems from the ground up, this book brings together the best classical and DFSS reliability techniques. Although it focuses on technical aspects, this guide considers the business and market constraints that require that systems be designed right the first time.</p> <p>Written in plain English and following a step-by-step "cookbook" format, <i>Designing High Availability Systems:</i></p> <ul> <li>Shows how to integrate an array of design/analysis tools, including Six Sigma, Failure Analysis, and Reliability Analysis</li> <li>Features many real-life examples and case studies describing predictive design methods, tradeoffs, risk priorities, "what-if" scenarios, and more</li> <li>Delivers numerous high-impact takeaways that you can apply to your current projects immediately</li> <li>Provides access to MATLAB programs for simulating problem sets presented, along with PowerPoint slides to assist in outlining the problem-solving process</li> </ul> <p><i>Designing High Availability Systems</i> is an indispensable working resource for system engineers, software/hardware architects, and project teams working in all industries.</p>
Preface xiii <p>List of Abbreviations xvii</p> <p><b>1. Introduction 1</b></p> <p><b>2. Initial Considerations for Reliability Design 3</b></p> <p>2.1 The Challenge 3</p> <p>2.2 Initial Data Collection 3</p> <p>2.3 Where Do We Get MTBF Information? 5</p> <p>2.4 MTTR and Identifying Failures 6</p> <p>2.5 Summary 7</p> <p><b>3. A Game of Dice: An Introduction to Probability 8</b></p> <p>3.1 Introduction 8</p> <p>3.2 A Game of Dice 10</p> <p>3.3 Mutually Exclusive and Independent Events 10</p> <p>3.4 Dice Paradox Problem and Conditional Probability 15</p> <p>3.5 Flip a Coin 21</p> <p>3.6 Dice Paradox Revisited 23</p> <p>3.7 Probabilities for Multiple Dice Throws 24</p> <p>3.8 Conditional Probability Revisited 27</p> <p>3.9 Summary 29</p> <p><b>4. Discrete Random Variables 30</b></p> <p>4.1 Introduction 30</p> <p>4.2 Random Variables 31</p> <p>4.3 Discrete Probability Distributions 33</p> <p>4.4 Bernoulli Distribution 34</p> <p>4.5 Geometric Distribution 35</p> <p>4.6 Binomial Coeffi cients 38</p> <p>4.7 Binomial Distribution 40</p> <p>4.8 Poisson Distribution 43</p> <p>4.9 Negative Binomial Random Variable 48</p> <p>4.10 Summary 50</p> <p><b>5. Continuous Random Variables 51</b></p> <p>5.1 Introduction 51</p> <p>5.2 Uniform Random Variables 52</p> <p>5.3 Exponential Random Variables 53</p> <p>5.4 Weibull Random Variables 54</p> <p>5.5 Gamma Random Variables 55</p> <p>5.6 Chi-Square Random Variables 59</p> <p>5.7 Normal Random Variables 59</p> <p>5.8 Relationship between Random Variables 60</p> <p>5.9 Summary 61</p> <p><b>6. Random Processes 62</b></p> <p>6.1 Introduction 62</p> <p>6.2 Markov Process 63</p> <p>6.3 Poisson Process 63</p> <p>6.4 Deriving the Poisson Distribution 64</p> <p>6.5 Poisson Interarrival Times 69</p> <p>6.6 Summary 71</p> <p><b>7. Modeling and Reliability Basics 72</b></p> <p>7.1 Introduction 72</p> <p>7.2 Modeling 75</p> <p>7.3 Failure Probability and Failure Density 77</p> <p>7.4 Unreliability, F(t) 78</p> <p>7.5 Reliability, R(t) 79</p> <p>7.6 MTTF 79</p> <p>7.7 MTBF 79</p> <p>7.8 Repairable System 80</p> <p>7.9 Nonrepairable System 80</p> <p>7.10 MTTR 80</p> <p>7.11 Failure Rate 81</p> <p>7.12 Maintainability 81</p> <p>7.13 Operability 81</p> <p>7.14 Availability 82</p> <p>7.15 Unavailability 84</p> <p>7.16 Five 9s Availability 85</p> <p>7.17 Downtime 85</p> <p>7.18 Constant Failure Rate Model 85</p> <p>7.19 Conditional Failure Rate 88</p> <p>7.20 Bayes’s Theorem 94</p> <p>7.21 Reliability Block Diagrams 98</p> <p>7.22 Summary 107</p> <p><b>8. Discrete-Time Markov Analysis 110</b></p> <p>8.1 Introduction 110</p> <p>8.2 Markov Process Defined 112</p> <p>8.3 Dynamic Modeling 116</p> <p>8.4 Discrete Time Markov Chains 116</p> <p>8.5 Absorbing Markov Chains 123</p> <p>8.6 Nonrepairable Reliability Models 129</p> <p>8.7 Summary 140</p> <p><b>9. Continuous-Time Markov Systems 141</b></p> <p>9.1 Introduction 141</p> <p>9.2 Continuous-Time Markov Processes 141</p> <p>9.3 Two-State Derivation 143</p> <p>9.4 Steps to Create a Markov Reliability Model 147</p> <p>9.5 Asymptotic Behavior (Steady-State Behavior) 148</p> <p>9.6 Limitations of Markov Modeling 154</p> <p>9.7 Markov Reward Models 154</p> <p>9.8 Summary 155</p> <p><b>10. Markov Analysis: Nonrepairable Systems 156</b></p> <p>10.1 Introduction 156</p> <p>10.2 One Component, No Repair 156</p> <p>10.3 Nonrepairable Systems: Parallel System with No Repair 165</p> <p>10.4 Series System with No Repair: Two Identical Components 172</p> <p>10.5 Parallel System with Partial Repair: Identical Components 176</p> <p>10.6 Parallel System with No Repair: Nonidentical Components 183</p> <p>10.7 Summary 192</p> <p><b>11. Markov Analysis: Repairable Systems 193</b></p> <p>11.1 Repairable Systems 193</p> <p>11.2 One Component with Repair 194</p> <p>11.3 Parallel System with Repair: Identical Component Failure and Repair Rates 204</p> <p>11.4 Parallel System with Repair: Different Failure and Repair Rates 217</p> <p>11.5 Summary 239</p> <p><b>12. Analyzing Confidence Levels 240</b></p> <p>12.1 Introduction 240</p> <p>12.2 pdf of a Squared Normal Random Variable 240</p> <p>12.3 pdf of the Sum of Two Random Variables 243</p> <p>12.4 pdf of the Sum of Two Gamma Random Variables 245</p> <p>12.5 pdf of the Sum of n Gamma Random Variables 246</p> <p>12.6 Goodness-of-Fit Test Using Chi-Square 249</p> <p>12.7 Confidence Levels 257</p> <p>12.8 Summary 264</p> <p><b>13. Estimating Reliability Parameters 266</b></p> <p>13.1 Introduction 266</p> <p>13.2 Bayes’ Estimation 268</p> <p>13.3 Example of Estimating Hardware MTBF 273</p> <p>13.4 Estimating Software MTBF 273</p> <p>13.5 Revising Initial MTBF Estimates and Tradeoffs 274</p> <p>13.6 Summary 277</p> <p><b>14. Six Sigma Tools for Predictive Engineering 278</b></p> <p>14.1 Introduction 278</p> <p>14.2 Gathering Voice of Customer (VOC) 279</p> <p>14.3 Processing Voice of Customer 281</p> <p>14.4 Kano Analysis 282</p> <p>14.5 Analysis of Technical Risks 284</p> <p>14.6 Quality Function Deployment (QFD) or House of Quality 284</p> <p>14.7 Program Level Transparency of Critical Parameters 287</p> <p>14.8 Mapping DFSS Techniques to Critical Parameters 287</p> <p>14.9 Critical Parameter Management (CPM) 287</p> <p>14.10 First Principles Modeling 289</p> <p>14.11 Design of Experiments (DOE) 289</p> <p>14.12 Design Failure Modes and Effects Analysis (DFMEA) 289</p> <p>14.13 Fault Tree Analysis 290</p> <p>14.14 Pugh Matrix 290</p> <p>14.15 Monte Carlo Simulation 291</p> <p>14.16 Commercial DFSS Tools 291</p> <p>14.17 Mathematical Prediction of System Capability instead of “Gut Feel” 293</p> <p>14.18 Visualizing System Behavior Early in the Life Cycle 297</p> <p>14.19 Critical Parameter Scorecard 297</p> <p>14.20 Applying DFSS in Third-Party Intensive Programs 298</p> <p>14.21 Summary 300</p> <p><b>15. Design Failure Modes and Effects Analysis 302</b></p> <p>15.1 Introduction 302</p> <p>15.2 What Is Design Failure Modes and Effects Analysis (DFMEA)? 302</p> <p>15.3 Definitions 303</p> <p>15.4 Business Case for DFMEA 303</p> <p>15.5 Why Conduct DFMEA? 305</p> <p>15.6 When to Perform DFMEA 305</p> <p>15.7 Applicability of DFMEA 306</p> <p>15.8 DFMEA Template 306</p> <p>15.9 DFMEA Life Cycle 312</p> <p>15.10 The DFMEA Team 324</p> <p>15.11 DFMEA Advantages and Disadvantages 327</p> <p>15.12 Limitations of DFMEA 328</p> <p>15.13 DFMEAs, FTAs, and Reliability Analysis 328</p> <p>15.14 Summary 330</p> <p><b>16. Fault Tree Analysis 331</b></p> <p>16.1 What Is Fault Tree Analysis? 331</p> <p>16.2 Events 332</p> <p>16.3 Logic Gates 333</p> <p>16.4 Creating a Fault Tree 335</p> <p>16.5 Fault Tree Limitations 339</p> <p>16.6 Summary 339</p> <p><b>17. Monte Carlo Simulation Models 340</b></p> <p>17.1 Introduction 340</p> <p>17.2 System Behavior over Mission Time 344</p> <p>17.3 Reliability Parameter Analysis 344</p> <p>17.4 A Worked Example 348</p> <p>17.5 Component and System Failure Times Using Monte Carlo Simulations 359</p> <p>17.6 Limitations of Using Nontime-Based Monte Carlo Simulations 361</p> <p>17.7 Summary 365</p> <p><b>18. Updating Reliability Estimates: Case Study 367</b></p> <p>18.1 Introduction 367</p> <p>18.2 Overview of the Base Station Controller—Data Only (BSC-DO) System 367</p> <p>18.3 Downtime Calculation 368</p> <p>18.4 Calculating Availability from Field Data Only 371</p> <p>18.5 Assumptions Behind Using the Chi-Square Methodology 372</p> <p>18.6 Fault Tree Updates from Field Data 372</p> <p>18.7 Summary 376</p> <p><b>19. Fault Management Architectures 377</b></p> <p>19.1 Introduction 377</p> <p>19.2 Faults, Errors, and Failures 378</p> <p>19.3 Fault Management Design 381</p> <p>19.4 Repair versus Recovery 382</p> <p>19.5 Design Considerations for Reliability Modeling 383</p> <p>19.6 Architecture Techniques to Improve Availability 383</p> <p>19.7 Redundancy Schemes 384</p> <p>19.8 Summary 395</p> <p><b>20 Application of DFMEA to Real-Life Example 397</b></p> <p>20.1 Introduction 397</p> <p>20.2 Cage Failover Architecture Description 397</p> <p>20.3 Cage Failover DFMEA Example 399</p> <p>20.4 DFMEA Scorecard 401</p> <p>20.5 Lessons Learned 402</p> <p>20.6 Summary 403</p> <p><b>21. Application of FTA to Real-Life Example 404</b></p> <p>21.1 Introduction 404</p> <p>21.2 Calculating Availability Using Fault Tree Analysis 404</p> <p>21.3 Building the Basic Events 405</p> <p>21.4 Building the Fault Tree 406</p> <p>21.5 Steps for Creating and Estimating the Availability Using FTA 408</p> <p>21.6 Summary 416</p> <p><b>22. Complex High Availability System Analysis 420</b></p> <p>22.1 Introduction 420</p> <p>22.2 Markov Analysis of the Hardware Components 420</p> <p>22.3 Building a Fault Tree from the Hardware Markov Model 427</p> <p>22.4 Markov Analysis of the Software Components 427</p> <p>22.5 Markov Analysis of the Combined Hardware and Software Components 433</p> <p>22.6 Techniques for Simplifying Markov Analysis 437</p> <p>22.7 Summary 446</p> <p>References 447</p> <p>Index 450</p>
<p><b>ZACHARY TAYLOR</b> is a Systems Architect at Nokia Solutions & Networks with over thirty years' experience designing high availability and mission critical systems at GE, Lockheed Martin, and Motorola. He has a Masters in Electrical Engineering.</p> <p><b>SUBRAMANYAM RANGANATHAN</b> is a DFSS Master Black Belt at Nokia Solutions & Networks with over twenty years' experience in the high-tech industry including at Motorola. He has a Masters in Electrical Engineering and an MBA from the Kellogg School of Management.</p>
<p><b>A practical, step-by-step guide to designing world-class, high availability systems using both classical and DFSS reliability techniques</b></p> <p>Whether designing telecom, aerospace, automotive, medical, financial, or public safety systems, every engineer aims for the utmost reliability and availability in the systems he, or she, designs. But between the dream of world-class performance and reality falls the shadow of complexities that can bedevil even the most rigorous design process. While there are an array of robust predictive engineering tools, there has been no single-source guide to understanding and using them . . . until now.</p> <p>Offering a case-based approach to designing, predicting, and deploying world-class high-availability systems from the ground up, this book brings together the best classical and DFSS reliability techniques. Although it focuses on technical aspects, this guide considers the business and market constraints that require that systems be designed right the first time.</p> <p>Written in plain English and following a step-by-step "cookbook" format, <i>Designing High Availability Systems:</i></p> <ul> <li>Shows how to integrate an array of design/analysis tools, including Six Sigma, Failure Analysis, and Reliability Analysis</li> <li>Features many real-life examples and case studies describing predictive design methods, tradeoffs, risk priorities, "what-if" scenarios, and more</li> <li>Delivers numerous high-impact takeaways that you can apply to your current projects immediately</li> <li>Provides access to MATLAB programs for simulating problem sets presented, along with PowerPoint slides to assist in outlining the problem-solving process</li> </ul> <p><i>Designing High Availability Systems</i> is an indispensable working resource for system engineers, software/hardware architects, and project teams working in all industries.</p>

Diese Produkte könnten Sie auch interessieren:

Pulsed Power
Pulsed Power
von: Gennady A. Mesyats
PDF ebook
CHF 389.50
High-Frequency Oscillator Design for Integrated Transceivers
High-Frequency Oscillator Design for Integrated Transceivers
von: J. van der Tang, Dieter Kasperkovitz, Arthur H.M. van Roermund
PDF ebook
CHF 177.00
MEMS Vibratory Gyroscopes
MEMS Vibratory Gyroscopes
von: Cenk Acar, Andrei Shkel
PDF ebook
CHF 212.50