I recently had a big issue with my backup machine, on which I had a couple of disks mounted, and where I copied some important data to, once in a while, as a sort-of-backup strategy.
Off course, the one disk holding the most important data died, and I didn’t notice fast enough… And when the shit hits the fan, Murphy decided to crash my MS Exchange setup and database on my testing machine, which held _all_ my mails from the past 8 years… (very important things, config backups, license keys, important addresses, passwords, …) I realised: now is _the_time_ to get a decent backup strategy set up.
I already had a small 2U rackmountable box which I got from a former colleague and that could hold 5 normal 3,5″ disks. I had 3 Western Digital Raid Edition 250GB SATA-II disks in the old ‘backup server’ that could be wiped after some copying, and an Athlon XP 2600+ without any SATA on board. I plugged in 2 SiS PCI 2 port SATA cards and ordered a fourth WD RE 250GB disk at work.
Looking for a decent setup, easy to use, easy to backup and stable enough to count on, I started checking google, and almost instantly got the FreeNAS website as one of the best results. Decent background, FreeBSD based, good hardware support, easy to manage Software Raid setups and loooots of protocols for file transfer. I downloaded a LiveCD and gave it a try. At the first sight, I was overwelmed… NFS, SMB/CIFS, FTP, SSH, AFP, Rsync, and even iSCSI Target were among the list of supported services. Software Raid was easy and fast, and the hardware support was very good.
I started building a system around the old AMD Athlon XP 2600+ I still had lying around, and as AMD’s are pretty power-friendly, I thought this was the best solution to start with. I hooked up an IDE-CF adapter, put in an old 128MB CF Kingston card and installed the live CD onto this card. I started defining a Raid 5 volume on UFS, giving me a total of 715426MB of storage, around 700GB which was more than I need as a backup server. When finalizing the Raid setup, I was asked the option to Initialize And Format the Raid ‘device’, and I said ‘hell yeah!, go for it. make it even faster to start’
The initialization started, but for some reason, it always stopped working after a couple of minutes/hours, and the system just froze up. Nothing to check, nothing to see. Keyboard and screen just locked, No info… Giving the machine a simple Reset didn’t do anything: the Raid volume began Checking and Verification, and after about half an hour, freeze was there again.
At a local computer fair, I acquired a new Nvidia nForce-with-geforce-onboard mbo, and a new LE2xxx series AMD cpu, the total package costing me no more than €60. The board has the needed 4 SATA-II ports, so this one looked to be the perfect match.
Once the board was set in to the case, the CPU installed, heatsink clicked on, Ram and disks installed in place, I got the system installing into this new hardware setup. No network card was found, so I had to put in a PCI Realtek 8139 based card to enable the network connection. A restart later I was able to finish the Disk and Raid setup.
After clicking the ‘Initialize Raid Volume’ again, same things happened: system froze up after a while, nothing to be done… No access at all. A simple Reset didn’t help either. It just took 20 minutes and the system locked up again.
I was panicing now. I had the system up and running on a single disk before, no problems. I checked all four disks, no issues. I checked the power supply (450W should be enough, PSU was brand new), nothing there either. I checked the CF card, and replaced it twice. No change… What now?
I completely started over, as there is still no data on it I cannot miss. A full reinstall, reconfig, although this time, I didn’t set the mark for the automatic ‘Create and Initialize RAID’. I let the Raid array build itself, verify and waited until I got the Completed message (which took about a night… 700GB is a lot of diskspace to be verified and initialised). After that, I did the last steps: mounting and exporting the Raid volume through NFS and SMB/Cifs. It looked as if it worked without any error.
The machine had been running for two days since, and no lock ups, no errors.
I started putting in backup schedules (an rsync batch job scheduled through cron from 2 servers holding important data, and an ntbackup job backing up the recovered Exchange database and Active Directory info needed to restore all email stuff if Murphy would strike me again…) and let it run for a few nights, and that too looked pretty good.
As the newly bought mbo/cpu could come in more handy for another machine, I decided to replace this combo with the originally chosen hardware. I labelled the disks and SATA connection cables 1-4, and set them in what I thought would be the right order on the PCI SATA cards. The disk id’s chowed ad4-ad6-ad8-ad10 instead of what used to be (on the new nForce mbo with onboard SATA-II) ad4-ad5-ad6-ad7 and I noticed I switched disk 3 and 4 because of the cable for number 4 wasn’t long enough to reach the card. However, the Raid volume came back up after completing the boot, and even better: all testing data was still there and available. I guess FreeNAS built in some disk-id recognition into its Software Raid setup and config. The system was up and running in 5 minutes, without the need of a complete reconfiguration. How sweet!!!
Now, let’s see if this hardware keeps up, so I can use the newly bought AMD board for some other stuff.
Backups are set, in place and running. Let’s hope I don’t need them very soon!!!