{"id":7262,"date":"2021-07-25T12:21:46","date_gmt":"2021-07-25T10:21:46","guid":{"rendered":"https:\/\/tekmart.co.za\/t-blog\/?p=7262"},"modified":"2021-07-28T18:03:34","modified_gmt":"2021-07-28T16:03:34","slug":"what-is-remaining-rated-write-endurance-in-ssds-technical-definition-parlance","status":"publish","type":"post","link":"https:\/\/tekmart.co.za\/t-blog\/what-is-remaining-rated-write-endurance-in-ssds-technical-definition-parlance\/","title":{"rendered":"What is Remaining Rated Write Endurance in SSDs technical definition parlance?"},"content":{"rendered":"<span class=\"span-reading-time rt-reading-time\" style=\"display: block;\"><span class=\"rt-label rt-prefix\">Reading Time-approximately:<\/span> <span class=\"rt-time\"> 7<\/span> <span class=\"rt-label rt-postfix\">minutes<\/span><\/span>\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"7262\" class=\"elementor elementor-7262\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-5b83ac1a elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"5b83ac1a\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-120e90d2\" data-id=\"120e90d2\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-d35c8e9 elementor-widget elementor-widget-image\" data-id=\"d35c8e9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"776\" height=\"310\" src=\"https:\/\/tekmart.co.za\/t-blog\/wp-content\/uploads\/2021\/07\/crucial-ssd-image-for-TRIM-explanation-776x310.jpg\" class=\"attachment-agama-blog-large size-agama-blog-large wp-image-7304\" alt=\"crucial ssd image for TRIM explanation\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-525325dd elementor-widget elementor-widget-text-editor\" data-id=\"525325dd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\n<h2 class=\"wp-block-heading\"><strong>Write endurance is the number of program\/erase (P\/E cycles) that can be applied to a block of flash memory before the storage media becomes unreliable.<\/strong><\/h2>\n\n\n\n<p><strong>By Tekmart Africa Enterprise Support Team<\/strong><\/p>\n\n\n\n<p>The Remaining Rated Write Endurance Threshold feature is&nbsp;supported on PCIe and SAS\/SATA SSDs. <\/p>\n\n\n\n<figure class=\"wp-block-pullquote\"><blockquote><p>The Remaining Rated Write Endurance Threshold feature provides added feature functionality in managing the threshold level for PCIe SSDs, or SAS\/SATA SSDs, or both based on your system configuration.<\/p><\/blockquote><\/figure>\n\n\n\n<p>SSD endurance is the total amount of data that an SSD is guaranteed to be able to write under warranty, often specified in \u201cTBW\u201d or \u201cDWPD\u201d (which we\u2019ll discuss a little later). The physics of SSD endurance are complicated, but the results are simple: SSDs wear out as you write to them.<\/p>\n\n\n\n<p>NAND flash&nbsp;wears out&nbsp;because the oxide layer that isolates the&nbsp;floating gates&nbsp;in a flash memory chip are compromised after repeated P\/E cycles. &nbsp;The floating gates are important because they store the electrical charge that indicates the state of each&nbsp;bit&nbsp;in a flash memory cell.&nbsp; For instance, in single level cell flash (SLC), each bit is a \u201c0\u201d or a \u201c1.\u201d <\/p>\n\n\n\n<p>When the insulating layer has sustained enough damage, the bit cells may not retain the right charge. \u201c1s\u201d may become \u201c0s\u201d and the data on the storage media becomes corrupt. Write endurance is calculated by estimating how often and how thoroughly the flash media is used.<\/p>\n\n\n\n<p>Write endurance is calculated by estimating how often and how thoroughly the flash media is used. Some manufacturers provide an estimate of longevity based on an arbitrary amount of data written\/erased per day but many vendors simply provide a mean-time-to-failure (MTTF) estimate without stipulating how they have arrived at estimate.&nbsp; SanDisk has proposed an industry metric for write endurance called Longterm Data Endurance (LDE). &nbsp;<\/p>\n\n\n\n<p>The metric, which sets a data retention period of one year, &nbsp;is based on a pre-determined usage pattern, assuming the data is written equally over the lifetime of the drive.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p>Old IDRAC versions before 7 don&#8217;t have information on drive health of this sort so you can&#8217;t depend on old IDRACs for predictive failure sorts of alerts.&nbsp; If you are within warrantee and have support, you can install OMSA, configured for Dell to get warnings and then alert you by text or Email.&nbsp; Supposedly, it can be set to alert you without the Dell intermediary but I&#8217;ve never gotten it to work.&nbsp; I try to keep my production servers under Dell support contract and then retire the server.&nbsp; Five years seems to be the maximum server life in <strong>my cost\/risk\/performance calculation.<\/strong><\/p><\/blockquote>\n\n\n\n<p>I would recommend making sure the firmware on the drive and controller are at the latest revision. You should also make sure you are using the latest version of the application used to monitor endurance. Updates may change how endurance is calculated or reported. It may also fix issues of incorrect reporting.<\/p>\n\n\n\n<p>Write endurance is one of the major differentiators separating enterprise and client-oriented drives, after all. As MLC-based storage continues pushing its way into spaces previously filled by SLC NAND, we have to keep a close eye on this difficult-to-benchmark, but still very important variable involved in evaluating solid-state storage.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Slow Solid State Hard Drive performance<\/strong><\/h2>\n\n\n\n<p>If you are experiencing what appears to be slow Solid State Drive performance, please try these steps.<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Open Device Manager.<\/li><li>Double-click IDE ATA\/ATAPI Controllers to display the list of controllers and channels.<\/li><li>Right-click the icon for the channel to which the device is connected.<\/li><li>Select Properties, and then click the Advanced Settings tab.&nbsp;<\/li><li>If the &#8220;Transfer Mode&#8221; box says &#8220;PIO Only,&#8221; change it to &#8220;DMA if Available,&#8221; click OK and reboot.<\/li><li>If the &#8220;Transfer Mode&#8221; box says &#8220;DMA if Available,&#8221; and the &#8220;Current Transfer Mode&#8221; box says &#8220;PIO,&#8221; set the transfer mode to PIO Only, apply and reset to &#8220;DMA if Available.&#8221; Reboot.<\/li><li>If the &#8220;Transfer Mode&#8221; box says &#8220;DMA if Available,&#8221; and the &#8220;Current Transfer Mode&#8221; box says anything but &#8220;UDMA5,&#8221; toggle the transfer mode, as in step 6.<\/li><\/ol>\n\n\n\n<p>You would then need to make a decision on the next steps to take depending on the outcome of your test. If both drives are at the same percentage then you would replace them immediately. If the endurance is accurate then once the drives reach <strong>0% <\/strong>they will become read only. If you wait until that happens then whatever data is being written at that time may be corrupted.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Understanding SSD endurance: drive writes per day (DWPD), terabytes written (TBW), and the minimum recommended for Storage Spaces Direct; a practical implementation.<\/strong><\/h2>\n\n\n\n<p>This Portion Courtesy of <\/p>\n\n\n\n<figure class=\"wp-block-image is-resized\"><img decoding=\"async\" src=\"https:\/\/techcommunity.microsoft.com\/t5\/image\/serverpage\/image-id\/151447iB01185B7AA18643D\/image-dimensions\/40x40\/image-coordinates\/0%2C0%2C400%2C400?v=v2\" alt=\"Cosmos Darwin\" width=\"70\" height=\"70\"\/><\/figure>\n\n\n\n<p><a href=\"https:\/\/techcommunity.microsoft.com\/t5\/user\/viewprofilepage\/user-id\/33564\">Cosmos Darwin<\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId-1703865306\"><strong>Background<\/strong><\/h2>\n\n\n\n<p>Storage Spaces Direct in Windows Server 2016 and Windows Server 2019 features a built-in, persistent, read and write cache to maximize storage performance. You can read all about it at&nbsp;Understanding the cache in Storage Spaces Direct&nbsp;. In all-flash deployments, NVMe drives typically cache for SATA\/SAS SSDs; in hybrid deployments, NVMe or SATA\/SAS SSDs cache for HDDs.<\/p>\n\n\n\n<p>In any case, the cache drives will serve the overwhelming majority of IO, including 100% of writes. This is essential to delivering the unrivaled performance of Storage Spaces Direct, whether you measure that in&nbsp;millions of IOPS&nbsp;,&nbsp;Tb\/s of IO throughput&nbsp;, or consistent sub-millisecond latency.<\/p>\n\n\n\n<p><br>But nothing is free: these cache drives are liable to wear out quickly.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId--848291655\"><strong>Review: What is flash wear<\/strong><\/h2>\n\n\n\n<p>Solid-state drives today are almost universally comprised of NAND flash, which wears out with use. Each flash memory cell can only be written so many times before it becomes unreliable. (There are numerous great write-ups online that cover all the gory details \u2013 including&nbsp;on Wikipedia&nbsp;.)<br><br>You can watch this happen in Windows by looking at the&nbsp;<strong>Wear&nbsp;<\/strong>reliability counter in PowerShell:<br><br><strong>PS C:\\&gt; Get-PhysicalDisk | Get-StorageReliabilityCounter | Select Wear<\/strong><br><br>Here\u2019s the output from my laptop \u2013 my SSD is about 5% worn out after two years.<br><br><img decoding=\"async\" src=\"https:\/\/techcommunity.microsoft.com\/t5\/image\/serverpage\/image-id\/107521iDEC61107343CAD20\/image-size\/large?v=v2&amp;px=999\" alt=\"thumbnail image 2 of blog post titled \n\t\n\t\n\t \n\t\n\t\n\t\n\t\t\t\t\n\t\t\n\t\t\t\n\t\t\t\t\n\t\t\t\t\t\t\n\t\t\t\t\t\t\tUnderstanding SSD endurance: drive writes per day (DWPD), terabytes written (TBW), and the minimum recommended for ...\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\n\t\t\t\t\t\n\t\t\t\n\t\t\n\t\n\t\t\t\n\t\n\t\n\t\n\t\n\t\n\"><\/p>\n\n\n\n<p><br><em>Note: Not all drives accurately report this value to Windows. In some cases, the counter may be blank. Check with your manufacturer to see if they have proprietary tooling you can use to retrieve this value.<\/em><br><br>Generally, reads do not wear out NAND flash.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId-894518680\"><strong>Quantifying flash endurance<\/strong><\/h2>\n\n\n\n<p>Measuring wear is one thing, but how can we predict the longevity of an SSD?<\/p>\n\n\n\n<p>Flash \u201cendurance\u201d is commonly measured in two ways:<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Drive Writes Per Day (DWPD)<\/li><li>Terabytes Written (TBW)<\/li><\/ul>\n\n\n\n<p>Both approaches are based on the manufacturer\u2019s warranty period for the drive, its so-called \u201clifetime\u201d.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId--1657638281\"><strong>Drive Writes Per Day (DWPD)<\/strong><\/h2>\n\n\n\n<p>Drive Writes Per Day (DWPD) measures how many times you could overwrite the drive\u2019s entire size each day of its life. For example, suppose your drive is 200 GB and its warranty period is 5 years. If its DWPD is 1, that means you can write 200 GB (its size, one time) into it every single day for the next five years.<\/p>\n\n\n\n<p>If you multiply that out, that\u2019s 200 GB per day \u00d7 365 days\/year \u00d7 5 years = 365 TB of cumulative writes before you may need to replace it.<\/p>\n\n\n\n<p>If its DWPD was 10 instead of 1, that would mean you can write 10 \u00d7 200 GB = 2 TB (its size, ten times) into it every day. Correspondingly, that\u2019s 3,650 TB = 3.65 PB of cumulative writes over 5 years.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId-85172054\"><strong>Terabytes Written (TBW)<\/strong><\/h2>\n\n\n\n<p>Terabytes Written (TBW) directly measures how much you can write cumulatively into the drive over its lifetime. Essentially, it just includes the multiplication we did above in the measurement itself.<br><br>For example, if your drive is rated for 365 TBW, that means you can write 365 TB into it before you may need to replace it.<br><br>If its warranty period is 5 years, that works out to 365 TB \u00f7 (5 years \u00d7 365 days\/year) = 200 GB of writes per day. If your drive was 200 GB in size, that\u2019s equivalent to 1 DWPD. Correspondingly, if your drive was rated for 3.65 PBW = 3,650 TBW, that works out to 2 TB of writes per day, or 10 DWPD.<br><br>As you can see, if you know the drive\u2019s size and warranty period, you can always get from DWPD to TBW or vice-versa with some simple multiplications or divisions. The two measurements are really very similar.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId-1827982389\"><strong>What\u2019s the difference?<\/strong><\/h2>\n\n\n\n<p>The only real difference is that DWPD depends on the drive\u2019s size whereas TBW does not.<br><br>For example, consider an SSD which can take 1,000 TB of writes over its 5-year lifetime.<br><br>Suppose the SSD is 200 GB:<br><br><strong>1,000 TB \u00f7 (5 years&nbsp;<\/strong><strong>\u00d7 365 days\/year&nbsp;<\/strong><strong>\u00d7 200 GB) = 2.74 DWPD<\/strong><br><br>Now suppose the SSD is 400 GB:<br><br><strong>1,000 TB \u00f7 (5 years&nbsp;<\/strong><strong>\u00d7 365 days\/year&nbsp;<\/strong><strong>\u00d7 400 GB) = 1.37 DWPD<\/strong><br><br>The resulting DWPD is different! What does that mean?<br><br>On the one hand, the larger 400 GB drive can do the exact same cumulative writes over its lifetime as the smaller 200 GB drive. Looking at TBW, this is very clear \u2013 both drives are rated for 1,000 TBW. But looking at DWPD, the larger drive appears to have just half the endurance! You might argue that because under the same workload, it would perform \u201cthe same\u201d, using TBW is better.<br><br>On the other hand, you might argue that the 400 GB drive can provide storage for more workload because it is larger, and therefore its 1,000 TBW spreads more thinly, and it really does have just half the endurance! By this reasoning, using DWPD is better.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId--724174572\"><strong>The bottom line<\/strong><\/h2>\n\n\n\n<p>You can use the measurement you prefer. It is almost universal to see both TBW and DWPD appear on drive spec sheets today. Depending on your assumptions, there is a compelling case for either.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId-1018635763\"><strong>Recommendation for Storage Spaces Direct<\/strong><\/h2>\n\n\n\n<p>Our minimum recommendation for Storage Spaces Direct is listed on the&nbsp;Hardware requirements&nbsp;page.&nbsp;As of mid-2017, for cache drives:<br><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>If you choose to measure in DWPD, we recommend 3 or more.<\/li><li>If you choose to measure in TBW, we recommend 4 TBW per day of lifetime. Spec sheets often provide TBW cumulatively, which you\u2019ll need to divide by its lifetime. For example, if your drive has a warranty period of 5 years, then 4 TB \u00d7 365 days\/year \u00d7 5 years = 7,300 TBW = 7.3 PBW total.<\/li><\/ul>\n\n\n\n<p><br>Often, one of these measurements will work out to be slightly less strict than the other.<\/p>\n\n\n\n<p>You may use whichever measurement you prefer.<\/p>\n\n\n\n<p><br>There is no minimum recommendation for capacity drives.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toc-hId--1533521198\"><strong>Addendum: Write amplification<\/strong><\/h2>\n\n\n\n<p>You may be tempted to reason about endurance from IOPS numbers, if you know them. For example, if your workload generates (on average) 100,000 IOPS which are (on average) 4 KiB each of which (on average) 30% are writes, you may think:<br><br><strong>100,000&nbsp;\u00d7 30%&nbsp;\u00d7 4 KiB = 120 MB\/s of writes<\/strong><br><br><strong>120 MB\/s&nbsp;\u00d7 60 secs\/min&nbsp;\u00d7 60 mins\/hour&nbsp;\u00d7 24 hours = approx. 10 TBW\/day<\/strong><br><br>If you have four servers with two cache drives each, that\u2019s:<br><br><strong>10 TBW\/day \u00f7 (8 total cache drives) = approx. 1.25 TBW\/day per drive<\/strong><br><br>Interesting! Less than 4 TBW\/day!<br><br>Unfortunately, this is flawed math&nbsp;because it does not account for write amplification.<br><br>Write amplification is when one write (at the user or application layer) becomes multiple writes (at the physical device layer). Write amplification is inevitable in any storage system that guarantees resiliency and\/or crash consistency. The most blatant example in Storage Spaces Direct is three-way mirror: it writes everything three times, to three different drives.<br><br>There are other sources of write amplification too: repair jobs generate additional IO; data deduplication generates additional IO; the filesystem, and many other components, generate additional IO by persisting their metadata and log structures; etc. In fact, the drive itself generates write amplification from internal activities such as garbage collection! (If you&#8217;re interested, check out the&nbsp;JESD218&nbsp;standard methodology for how to factor this into endurance calculations.)<br><br>This is all necessary and good, but it makes it difficult to derive drive-level IO activity at the bottom of the stack from application-level IO activity at the top of the stack in any consistent way.&nbsp;That\u2019s why, based on our experience, we publish the minimum DWPD and TBW recommendation.<\/p>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p><span class=\"span-reading-time rt-reading-time\" style=\"display: block;\"><span class=\"rt-label rt-prefix\">Reading Time-approximately:<\/span> <span class=\"rt-time\"> 7<\/span> <span class=\"rt-label rt-postfix\">minutes<\/span><\/span>Write endurance is the number of program\/erase (P\/E cycles) that can be applied to a block of flash memory before the storage media becomes unreliable. By Tekmart Africa Enterprise Support Team The Remaining Rated Write Endurance Threshold feature is&nbsp;supported on PCIe and SAS\/SATA SSDs. The Remaining Rated Write Endurance Threshold feature provides added feature functionality in managing the threshold level<\/p>\n<p><a class=\"more-link\" href=\"https:\/\/tekmart.co.za\/t-blog\/what-is-remaining-rated-write-endurance-in-ssds-technical-definition-parlance\/\">Read More<\/a><\/p>\n","protected":false},"author":114,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[38,35,8,36,4,268,30,3,12,196],"tags":[],"class_list":["post-7262","post","type-post","status-publish","format-standard","hentry","category-best-practices-for-data-center-operations","category-data-center-facilities","category-data-center-hardware","category-data-center-systems-management","category-datacenter-news","category-datacentre-energy-efficiency-and-green-it","category-expert-advise-and-opinion","category-industry-news-and-expert-advise","category-tekmart-enterprise-hardware-tips","category-tekmart-support-storage-systems-implementation-tutorials"],"_links":{"self":[{"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/posts\/7262","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/users\/114"}],"replies":[{"embeddable":true,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/comments?post=7262"}],"version-history":[{"count":6,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/posts\/7262\/revisions"}],"predecessor-version":[{"id":7309,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/posts\/7262\/revisions\/7309"}],"wp:attachment":[{"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/media?parent=7262"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/categories?post=7262"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/tekmart.co.za\/t-blog\/wp-json\/wp\/v2\/tags?post=7262"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}