Linux – What are the differences between Linux and Windows .txt files (Unicode encoding)

asciilinuxwindows

I am only using the 128 character set defined in the original ANSI standard.

But as a whole how are the files implmeneted differently.

I am not concerned with the display, i.e. if a tab is displayed with 6 or 8 characters but the actual internal representation in memory

One difference I've heard is the use of \r\n (Windows) vs. \n for line termination (Linux).

Best Answer

"Unicode" on Windows is UTF-16LE, and each character is 2 or 4 bytes. Linux uses UTF-8, and each character is between 1 and 4 bytes.

"The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"