View previous topic :: View next topic |
Author |
Message |
Lebkoungcity Apprentice
Joined: 16 Nov 2008 Posts: 212 Location: near Lebkoungcity (='Gingerbreadcity' =Nuremberg)
|
Posted: Thu Nov 19, 2009 11:22 am Post subject: [solved] KDE3->KDE4: some umlauts aren't recognized anymo |
|
|
Hi,
after fiddling a few days with some minor problems I switched with my main account to KDE4.3.1.
But now I have another issue I didn't think of before:
I have files with umlauts in their file-names.
Some are displayed normally and can be opened with any application. --> everything's fine
Some files are displayed with a ? (on console) or with a black rombus with a white ? (in KDE4) in their filename and can't be accessed in anyway (it says the file doesn't exist or the file is corrupted) [umlauts in some plain text-files aren't displayed too]
What makes it mysterious to me is that in KDE3.5.10 this problem doesn't seem to exist...
/etc/locale.gen
Code: |
en_US ISO-8859-1
en_US.UTF-8 UTF-8
de_DE ISO-8859-1
de_DE@euro ISO-8859-15
de_DE.UTF-8 UTF-8
de_DE.UTF-8@euro UTF-8
|
/etc/env.d/02locale
Code: |
LANG="de_DE.UTF-8@euro"
LC_COLLATE="C"
LC_CTYPE="de_DE.UTF-8@euro"
LC_MESSAGES="de_DE.UTF-8@euro"
LC_MONETARY="de_DE.UTF-8@euro"
LC_NUMERIC="de_DE.UTF-8@euro"
LC_TIME="de_DE.UTF-8@euro"
LC_PAPER="de_DE.UTF-8@euro"
KDM_LANG="de_DE.UTF-8@euro"
|
Output of "locale" is:
Code: |
LANG=de_DE.UTF-8@euro
LC_CTYPE=de_DE.UTF-8@euro
LC_NUMERIC=de_DE.UTF-8@euro
LC_TIME=de_DE.UTF-8@euro
LC_COLLATE=C
LC_MONETARY=de_DE.UTF-8@euro
LC_MESSAGES=de_DE.UTF-8@euro
LC_PAPER=de_DE.UTF-8@euro
LC_NAME="de_DE.UTF-8@euro"
LC_ADDRESS="de_DE.UTF-8@euro"
LC_TELEPHONE="de_DE.UTF-8@euro"
LC_MEASUREMENT="de_DE.UTF-8@euro"
LC_IDENTIFICATION="de_DE.UTF-8@euro"
LC_ALL=
|
Output of "locale -a" is:
Code: |
C
POSIX
de_DE
de_DE.iso88591
de_DE.iso885915@euro
de_DE.utf8
de_DE.utf8@euro
de_DE@euro
deutsch
en_US
en_US.iso88591
en_US.utf8
german
|
in /etc/make.conf I set a long time ago:
Code: |
...
USE="... nls ... unicode ..."
...
LINGUAS="de"
...
|
The KDE internationalization packages are installed
Code: |
kde-base/kde-l10n-4.3.1
kde-base/kde-i18n-3.5.10
|
_________________ "The most dangerous world view is the world view of the ones who haven't viewed the world."
Alexander von Humboldt (1769-1859)
Last edited by Lebkoungcity on Mon Nov 23, 2009 9:35 pm; edited 1 time in total |
|
Back to top |
|
|
Apheus Guru
Joined: 12 Jul 2008 Posts: 422
|
Posted: Thu Nov 19, 2009 12:36 pm Post subject: |
|
|
I had the same problem, but I was able to rename the files using konsole (and TAB completion). But I have no idea what causes this, and why only some files are affected. |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Thu Nov 19, 2009 5:02 pm Post subject: |
|
|
Lets start with a simple question: what is the filesystem encoding
of those files ? |
|
Back to top |
|
|
Lebkoungcity Apprentice
Joined: 16 Nov 2008 Posts: 212 Location: near Lebkoungcity (='Gingerbreadcity' =Nuremberg)
|
Posted: Fri Nov 20, 2009 6:31 am Post subject: |
|
|
Thanks for the hint!
1. Some text-file examples:
Code: | utrac -p Playlist.txt
ISO-8859-1 |
Code: | utrac -p Adressen\ Moskau-Karten
ISO-8859-1 |
and one with working umlauts:
Code: | utrac -p Mopped/MC-KW/Homepage_Nuernberg/TERMINE\ und\ Bilder\ aktualisieren
UTF-8 |
==> so these text-files aren't displayed with umlauts because they're saved with ISO-8859-1 instead of ISO-8859-15 or UTF-8? OK, but how could that have happened? Back then when I wrote these files they were displayed with umlauts (I wrote them with SuSE 10.0 or 9.something).
2. What about binary files?
example:
Code: | utrac -p Documents/Kulturgeschichte-Ann�herungsversuche.pdf
utrac: Binary data (error 310) |
_________________ "The most dangerous world view is the world view of the ones who haven't viewed the world."
Alexander von Humboldt (1769-1859) |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Fri Nov 20, 2009 5:29 pm Post subject: |
|
|
That's not filesystem encoding.
If that pdf is one of those incorrectly displayed files,
paste what does 'ls -b' prints on it, if not, do the same
with a file that is. And paste the same result for a file that's
displayed correctly (one with a name outside ASCII range). |
|
Back to top |
|
|
Lebkoungcity Apprentice
Joined: 16 Nov 2008 Posts: 212 Location: near Lebkoungcity (='Gingerbreadcity' =Nuremberg)
|
Posted: Mon Nov 23, 2009 7:30 am Post subject: |
|
|
Oh... sorry about this!
Code: |
ls -b Documents/Kulturgeschichte-Ann�herungsversuche.pdf
Documents/Kulturgeschichte-Ann\344herungsversuche.pdf
|
Code: |
ls -b Documents/umlaut-test_ä_ü_ö.odt
Documents/umlaut-test_ä_ü_ö.odt
|
_________________ "The most dangerous world view is the world view of the ones who haven't viewed the world."
Alexander von Humboldt (1769-1859) |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Mon Nov 23, 2009 1:07 pm Post subject: |
|
|
Well, there's your problem: \344 is 0xe4 - that file name is encoded
as latin1. That's a common pitfall of unix filesystems (at least such as ext2/3, not 100% about other)
- due to legacy reasons, filenames were created in system locale encoding,
which, till you've switched to utf8 locale, was most probably latin1.
You'll need a tool like convmv to fix such files in a convenient way. |
|
Back to top |
|
|
Lebkoungcity Apprentice
Joined: 16 Nov 2008 Posts: 212 Location: near Lebkoungcity (='Gingerbreadcity' =Nuremberg)
|
Posted: Mon Nov 23, 2009 7:12 pm Post subject: |
|
|
Hey thanks a lot! That tiny tool is exactly what I needed - right now it is crawling in a test-run through my home-directory _________________ "The most dangerous world view is the world view of the ones who haven't viewed the world."
Alexander von Humboldt (1769-1859) |
|
Back to top |
|
|
|