Text Manipulation – Remove Duplicate Text Lines from Files Larger Than 4GB

64-bittext manipulationtext-editorswindows

I am looking for an open source (possibly 64 bit) windows text editor that will allow me to remove duplicate lines from an extremely large (4GB+) text file.

What do you use to remove duplicate lines from your large text files?

Best Answer

sort -u file > outfile

A handy Win32 native port of sort is in UnxUtils

For more complicated meanings of "remove duplicates" there is Perl (et al).

Related Question