Depends on what you want. If it is important for you that there is no downtime when one of your disks fails (and a disk fail has a much higher probability than an electronics/controller fail), use a RAID.
If it is more important that your data is safe (i. e. a backup), use separate disks and sync them manually (maybe even keep multiple generations as you can always accidentally delete important files).
A RAID is not a backup (in the rare case that the electronics fail). You will need a backup (at least if you think your data is somewhat important), but you will only need a RAID if uptime is important to you.
What happens if I have to change the underlying hardware behind the
zfs pool? Like the mobo / processor, what happens if that dies on me
in a year or two; can I port my zfs pool somehow?
A ZFS pool is not hardware dependent. Just make sure your HBA (Host Bus Adapter) isn't doing something like encrypting your data at the hardware level. ZFS works best with a HBA like an LSI 9211-8i or an IBM m1015 cross-flashed to use the 9211-8i firmware, not a full blow "hardware" RAID card.
I've got quite the set of different sized drives, and i'm trying to
get the most storage space out of it with redundancy. What is the best
setup for this config, and how much space will I be losing by using
these different size drives. I am not creating this for any speed
requirements, I just want a file server for multiple HTPCs. My
currently available drives for this are: 1x 500GB 'Hybrid' Drive 1x
1TB Drive 1x 3TB Drive 1x 4TB Drive (will be added to the pool later,
currently holding all the data from the the drives listed above)
If I were you I would sell the smaller drives and put the money towards larger drives of all the same size. It will make your life a lot easier. Also, you cannot just add drives to a ZFS pool. There are constraints. Read here.
Will adding the 4TB drive to the pool later be a problem of any kind?
Possibly. I am in a similar position. At some time in the future I will have to increase my storage capacity. At that time I plan on purchasing a second HBA and a new array of larger drives. I will then transfer all the data from my existing drives to my new drives then sell my existing drives. There may be other (cheaper) ways around this, but doing it this way:
- Keeps all of my drives the same size
- Only has the additional cost of an extra HBA, which isn't a bad thing to have laying around anyhow
- Does not require me to replace my drives one at a time, re-silvering after each replacement.
Any recommendations on a Linux OS to run this all on, and should I use
a separate drive for the OS? I'm familiar with Ubuntu, RHEL, and
OpenSUSE / SLES.
Don't use Linux, it does not have native ZFS support. Linux support of ZFS comes from ZFS on Linux and zfs-fuse. The current state of ZFS is in flux as Oracle tries their best to ruin it. ZFS will likely branch at version 28 in the very near future, so don't make your ZFS pool with any version greater than 28 unless you are 100% certain you want to stick with an Oracle solution. Currently FreeBSD and its spinoffs support ZFS version 28.
Since you are a self proclaimed ZFS noob I would recommend FreeNAS. I have been using it for awhile now and I'm pretty happy with it. It will definitely allow the most straight forward setup for you.
Additional Thoughts:
Make sure you choose the correct level of parity for your particular use case. Specifically, make sure you plan around URE. Basically you don't want to use RAID 5 (RAID Z1) if you are using anything larger than 2TB drives. There are some other factors to consider that may prompt you to increase your level of parity data as well. Here is a good article on the subject.
Update:
It has been 1.5 years since I posted this answer and in that time I have been giving ZFS on Linux (Ubuntu server specifically) another chance. It has come a long way since I tried first tried it and I'm pretty happy so far. My reason for switching was the installation restrictions on FreeNAS and the jailing system. I wanted to use my server for more than just a NAS server and FreeNAS makes that hard. The jailing system is good and very secure, but I didn't really need that level of security in my home and I didn't want to deal with logging into a jail every time I wanted to unzip a file. I think FreeNAS is still a good choice if you are just getting started with ZFS (because of the web interface) or if you just want a NAS appliance (i.e. no other server functionality needed).
Best Answer
A method some people use, but that is not recommended:
The number of "devices" that a vdev is created with, is permanent after creation. Therefore, you must have three "disks" at the time of creation. This method gives a way to create a degraded RAID-Z1 pool and later restore the absent redundancy.
This method is not recommended, because while copying and while the 3rd drive is absent, there is no redundancy/parity. But it can work.