If you want to jump to conclusions, but the study didn't actually isolate the cooling policy. One surprise: keeping disks cooled under 30C *reduces* life expectancy On the other hand, if you could exploit SMART data so as to get the same reliability with fewer redundant copies, that would be a win.Įither the Google paper or another that came out around the same time concluded that the best policy was to wait for a drive to fail, then replace it. If you perturb that system by replacing drives more often based on SMART data, I think you'll have a net loss. People set up storage systems so that the value of preserving the data is commensurate with the cost of preserving it. by copying from a mirror disk or backup tape. The rest of the time, the data is easily reconstructable, e.g. Often, the data is relatively unimportant, like a Google web page cache or a small part of a stream of undifferentiated experimental data. The disk, so it's quite an easy decision. However, the value of the data is usually much greater than the cost of
Posted 21:45 UTC (Fri) by giraffedata (subscriber, #1954) Set up an automated drive monitoring system. With the addition of a smallĪmount of glue-logic scripting, it should not be too difficult to If you are a systems administrator who needs to keep track of hardĭrive reliability data, Smartmontools be able to provide Shows the wide variety of drive information that SmartmontoolsĬan display.
The software was able to discover data from the one hard drive on The operation instructions from the README file were followed and Performed on an Ubuntu 7.04 system with no troubles.
The usual configure, make and make install steps were
Several Libata/Marvell driver improvements.Version 5.38 of Smartmontools was recently Approximately 30% of failures can be predicted by S.M.A.R.T." is to warn a user or system administrator of impending drive failure while time remains to take preventative action such as copying the data to a replacement device. " Mechanical failures, which are usually predictable failures, account for 60 percent of drive failure.
It should run on any modern Darwin (Mac OSX), Linux, FreeBSD, NetBSD, OpenBSD, Solaris, OS/2, eComStation, QNX, or Windows system.Īs the Self-Monitoring, Analysis, and Reporting Technology: In many cases, these utilities will provide advanced warning of disk degradation and failure. The smartmontools package contains two utility programs (smartctl and smartd) to control and monitor storage systems using the Self-Monitoring, Analysis and Reporting Technology System (SMART) built into most modern ATA and SCSI hard disks.