Ubuntu – Encoding and decoding error in non-English name of compressed file

encoding

I have a rar file. After extraction, it generates a file with Chinese name which is shown in Nautilus as:

��������ѧ.������.����������ѧ������.2008.djvu (invalid encoding)

In terminal it is shown as:

<BD><FC><B4><FA><D7><E9><BA><CF>ѧ.<CD><F5><CC><EC><C3><F7>.<B4><F3><C1><AC><C0><U+DE64><B4><F3>ѧ<B3><F6><B0><E6><C9><E7>.2008.djvu

The content of the rar file listed by unrar is correct:

$ unrar l 近代组合学.王天明.大连理工大学出版 社.2008.rar    
UNRAR 3.93 freeware      Copyright (c) 1993-2010 Alexander Roshal    
Archive 近代组合学.王天明.大连理工大学出版社.2008.rar    
 Name             Size   Packed Ratio  Date   Time     Attr      CRC   Meth Ver
-------------------------------------------------------------------------------
 近代组合学.王天明.大连理工大学出版社.2008.djvu  6190416  6187189  99% 03-06-11 10:33  .....A.   98320D40 m3g 2.9
-------------------------------------------------------------------------------
    1          6190416  6187189  99%

The file cannot be open unless I change its name to be something like 1.djvu.

I was wondering why the characters are not shown properly with Chinese name of compressed file, while I can create a directory or file with Chinese name?

How shall I do?

Thanks and regards!

Best Answer

Perhaps the graphical archive program does not understand Chinese. Try extracting the archive using the command-line:

  1. Open a terminal.
  2. Navigate to the directory containing the file:

    cd /path/to/directory/
    

    You can use the Tab key to complete filenames and directory names. Double press Tab to get a list of possible completions in case there is more than one option.

  3. Run the unrar program to unpack filename.rar:

    unrar x filename.rar
    

    Here, you can use tab-completion too for the filename.

  4. The contents of the archive will be visible in the current directory.