My question is, if I purchase a standard Windows external hard drive with a USB connection, will I be able to copy the files from the Linux cluster's files server to the external drive?
Yes, there is no technical problem to this, however:
The hardware us not a "standard windows hard drive with USB connection". Please scrap the windows part from that sentence. And external USB HDD will work equally well with or without windows as the OS.
I am assuming that the Linux cluster has a USB port, but this is something that I will need to verify.
For a large amount of data (and 1TB is a lot) connecting the drive locally is probably a lot faster. However with USB2 you are still limited to 35-ish MB/sec. That means that copying 1TB over USB2 takes about 8-9 hours.*
You can speed that up a lot if the drive is locally mounted (via plain SATA), if the cluster and your drive have eSATA, if both have USB3 or if both have firewire.
Alternatively you can connect the drive to your own desktop and copy the files. In this case the network might be the speed limit. You also risk an angry administrator asking why you are making the network so slow for other users. :-)
It looks like many standard Windows external hard drives are formatted in either NTFS or FAT32, whereas our Ubuntu Linux file server uses NFS.
uhm, no.
The hard disk does not care which filesystem is used. It may come pre-formatted with NTFS (which is a sensible choice for most people who buy them), but nothing stop you from changing the filesystem and reformatting. That should only take a few minutes.
Also, your file server does not use NFS on its hard disks. It is probably using ext2, ext4 or ZFS. Neither of which you need to worry about. As long as you can read the data you can write it in any format.
(Consider the analogy: You copy the text written in a notebook. Do not worry about the form or the colour of the original notebook. As long as you can read it and have a large enough notebook of your own you can copy the content from one notebook to another).
*: 8-9 hours estimated based on this:
35 MiB/second
100 MiB per 3 seconds.
1000 MiB per 30 seconds, which is the same as 1GiB per 30 seconds.
1GiB per 30 seconds
1000GiB per 30000 seconds
1TiB per 30000 seconds. 30000/3600=8.3 (3600 seconds per hour)
Best Answer
There's no difference between an external drive and an internal drive in terms of the filesystem stored on it. The owner & group of the filesystem's root directory are stored in its root directory, the same way your root filesystem's owner & group are stored.
A corollary of this is that because UIDs and GIDs are stored only numerically, if you mount an external drive on a system with different users in
/etc/passwd
, you'll see that the owner & group have changed to whatever that UID & GID map to on the other system. (e.g. if on your system userme
is UID 1000, and you mount the drive on a system where UID 1000 isrms
, you'll see the directory owned byrms
.)On the other hand, if you use a filesystem that doesn't store UID/GID information (like FAT), then the UID/GID of every file on the filesystem is taken from the parameters you gave to the
mount
command (either directly or through/etc/fstab
).A second corollary is that it doesn't matter what owner or permissions
/mount/directory
had on your root filesystem. Once a filesystem is mounted there, that filesystem's permissions are the ones that matter. That's why I like tochmod a-rwx /mount/directory
before mounting the filesystem. It prevents me from accidentally writing to/mount/directory
when the filesystem is not mounted there.