Internet-Based Offsite Backup: Check Out CloudBerry Backup
July 29th, 2010 by Paul Sterley | 1 Comment | Filed in Backup and RestoreThere are, of course, many ways to send your data into the cloud. More of them appear every day. Today, I’m going to talk to you about one combination of backup software, cloud storage, and cloud storage management software that I am putting in place for one of my customers.
The local backup software of choice in this case is StorageCraft ShadowProtect. It is writing a continuous incremental backup to disk, and it works well. That just leaves the problem of offsite backup.
This customer has about 1.5 TB of data. That narrows down the playing field of internet-based offsite storage considerably, due to pricing. With that much data, it becomes necessary to use inexpensive storage. I selected Amazon S3 as the most likely affordable option, and chose a NAS device which supports Amazon S3 to store the backups.
Then I discovered the flaw in my plan. The NAS device has a very limited interface for Amazon S3, and does not warn me when it fails to upload a file because that file exceeds the 5 GB limit imposed by Amazon. I had to start looking around for other options.
I became aware of Cloudberry Backup, by Cloudberry Lab. I’m putting it through its paces now, and so far it seems to be a pretty good tool for this job. In addition to Amazon S3, it supports several other cloud-based storage systems, but I’m working with Amazon right now, so that’s what I’ll talk about in this article.
CloudBerry Backup has a small footprint, is easy to install and configure, and does not use a lot of resources on the server. It has a lot of configuration options. The most recent version allows you to upload files that are located at network paths, so I can run this on a utility server and use it to upload data stored on a NAS device.
Most importantly, it seems to be a robust utility for sending the data to the cloud, making sure it gets there, and making sure it gets there my way. That last bit is the most important. When I first cranked up the NAS and told it to start uploading, the users got pretty upset because suddenly all of their bandwidth was being used up by the NAS. There was no throttling capability. I couldn’t pause the backup and wait for the evening, either. It was an all or nothing proposition.
CloudBerry Backup has bandwidth throttling built in, and a Pause Backup button, right where you might go looking for one (Quick note about the Pause function – when the backup was paused, the Speed indicator kept jumping around as if it was still uploading data and calculating speed – but the “Files Uploaded” counter did not increment. That was creepy. Not sure what that was about.)
There are also has more advanced options for setting the process priority, and number of simultaneous threads to use per backup plan. You can create a single backup plan with all of your data in it, or several backup plans with different schedules.
Wait, what about the 5 GB file size? In “Advanced Mode”, CloudBerry Backup splits large files into “Chunks” of whatever size you specify. Unfortunately, this means you can’t download and re-assemble your data from the cloud without the CloudBerry Backup software, so you’re locked into a single tool – but that’s often the case anyway. The software does not expire, and CloudBerry Lab is talking about releasing a standalone tool for chunk re-assembly anyway.
I figured the chunking thing would make the file structure on Amazon completely incomprehensible, and that I would need to set up multiple buckets for the different servers I was uploading backups from, each with its own folder pair and schedule, etc – but I was wrong. I could have done that, but it wasn’t necessary. CBB does a great job of making the folder structure on Amazon understandable, even with the chunking going on. The best part of the chunking is, if your file is smaller than the chunk size you have specified, it doesn’t get chunked. That makes it a lot easier for verifying files if there aren’t many that are larger than your specified chunk size. The down side is that even though Amazon supports 5 GB files, CBB currently has a maximum chunk size of 1 GB. Oh well, you win some and you lose some.
Other cool things it can do include e-mail notifications on success or failure, deleting files from the destination that no longer exist on the source (including a number of days to keep them), keeping versions of files on the destination, encrypting the data, compressing the data, and it even includes an option for Amazon’s new “Reduced Redundancy Storage” for an additional price break.
So what happens when bad things happen?
You can choose from several logging levels, and where to put the log files, so that’s a good start.
After setting a large backup in motion, I decided to be mean to the software. I pulled the plug on the internet connection. It was down for maybe 2-3 minutes as the cable modem booted and re-trained. When the connection came back up, CBB resumed uploading as if nothing had happened. Not bad.
Then I rebooted the server. When the server came back up, things were not quite as I expected. I opened the software, and it showed a backup running, but it didn’t seem to be getting anywhere. I gave it quite a bit of time, but it did not progress. The Pause button was available but not functional. Eventually, I clicked the Stop Backup button, which did work, and then the Start Backup button, at which point it started recalculating things and uploading files again. This bears more testing, as I wouldn’t want to have to remember to check the CBB software after every reboot.
For my next mean trick, I told the CloudBerry Backup service to restart (not using the app interface, I went into services.msc). It restarted in about a half second, and the console didn’t even seem to notice, it just kept plugging away. Hmm. Then I stopped the service and left it off. The software kept running. OK, that’s weird. I guess the service is just a function for starting the software when the user is not logged in.
I went into Task Manager and ended task on CBBackupPlan.exe. That brought the console up short. At that point it seemed to be in pretty much the same state as when I rebooted. I stopped the backup again, started it again, and it once again counted up the files and started transferring them.
OK, so there’s a bit of streamlining needed in this product – but hey, for the great low price, it’s a pretty good setup. It’s cheap, it’s easy to use, it has a lot of configuration options, and it’s somewhat resilient to abuse.
There’s another module called CloudBerry Explorer which has some good features for managing your offsite storage. At the moment, it doesn’t seem to have inherited the Network Shares feature, so I can’t really use it, but I’m sure that’s coming soon.
Tags: Amazon S3, Offsite Backup

