View previous topic :: View next topic |
Author |
Message |
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
Posted: Fri Jun 03, 2011 4:38 pm Post subject: Incorrect diacritic in system enviroment |
|
|
I have problem with diacritics of other languages than slovak, english in my system. Name of songs, movies... everything what has some special character.
http://img7.imagebanana.com/img/hl4k026z/Selection_001.png
it should be there: Tцck's Taunt
Last edited by rado3105 on Fri Jun 03, 2011 7:36 pm; edited 1 time in total |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Fri Jun 03, 2011 6:21 pm Post subject: |
|
|
Code: | identify Selection_001.png
Selection_001.png PNG 301x176 301x176+0+0 8-bit DirectClass 8.93KBB 0.000u 0:00.000 |
...and that's about all that can be said about that picture.
Now, you didn't say what program did produce the image, that screenshot shows, nor what encoding/filesystem the filename uses, nor even what's your locale. |
|
Back to top |
|
|
John R. Graham Administrator
Joined: 08 Mar 2005 Posts: 10590 Location: Somewhere over Atlanta, Georgia
|
Posted: Fri Jun 03, 2011 6:36 pm Post subject: |
|
|
Well, yes, but he did provide the text, which contains the Unicode character that he's looking for, namely ц, which is D186 (Cyrillic Small Letter Tse). This shows up just fine in my standard en_US.UTF-8 locale.
- John _________________ I can confirm that I have received between 0 and 499 National Security Letters. |
|
Back to top |
|
|
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
Posted: Fri Jun 03, 2011 6:52 pm Post subject: |
|
|
make.conf:
LINGUAS="en sk cz"
/etc/locale.gen:
Code: |
en_US ISO-8859-1
en_US.UTF-8 UTF-8
#ja_JP.EUC-JP EUC-JP
#ja_JP.UTF-8 UTF-8
#ja_JP EUC-JP
#en_HK ISO-8859-1
#en_PH ISO-8859-1
#de_DE ISO-8859-1
#de_DE@euro ISO-8859-15
#es_MX ISO-8859-1
#fa_IR UTF-8
#fr_FR ISO-8859-1
#fr_FR@euro ISO-8859-15
#it_IT ISO-8859-1
cs_CZ ISO-8859-2
cs_CZ.UTF-8 UTF-8
sk_SK ISO-8859-2
sk_SK.UTF-8 UTF-8 |
|
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Fri Jun 03, 2011 7:26 pm Post subject: |
|
|
What was pasted into a forum post doesn't need to reflect anything on the system in question - browser/forum software may have converted it a few times.
Not caring much about locale.gen, output of 'locale' much more on topic.
Still no info about filesystem/filename encoding used.
What does i.e. 'LANG=C ls -b' print for that file ? |
|
Back to top |
|
|
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Tue Jun 07, 2011 2:57 pm Post subject: |
|
|
Do you want another 'identify' snippet for that ?
As I don't know the language, all I can say is that the output looks correct...if somebody got a bright idea to run a program in an utf8 locale and redirect the output to a terminal running in an 8bit locale. |
|
Back to top |
|
|
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
Posted: Tue Jun 07, 2011 4:43 pm Post subject: |
|
|
Sorry, but I dont understand what you mean. It is slovak language, it is incorrect. I wrote that all other languages displays with bad diacrytics. If I go to that server, or I list files on my hardisk from other systems(maemo, ubuntu) I dont have any problem.
It must be some problem with gentoo language configuration. |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Tue Jun 07, 2011 5:26 pm Post subject: |
|
|
You must mean "it's problem of my own configuration".
IIRC, (in regard of filesystems) there's not such thing as filesystem encoding - filenames are simply 'const char*' written as-is in the current locale. I tell you one thing - mixing filenames with different encodings on one partition leads to major headaches.
In regard to that output, I suspect if the output went through '| iconv -f utf8 -t latin2' pipe, it would 'suddenly' become readable.
So once again, what's the output of 'locale' ? |
|
Back to top |
|
|
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
Posted: Tue Jun 07, 2011 5:31 pm Post subject: |
|
|
Code: | r-c@gentoo-rc-desktop ~ $ locale
LANG=
LC_CTYPE="POSIX"
LC_NUMERIC="POSIX"
LC_TIME="POSIX"
LC_COLLATE="POSIX"
LC_MONETARY="POSIX"
LC_MESSAGES="POSIX"
LC_PAPER="POSIX"
LC_NAME="POSIX"
LC_ADDRESS="POSIX"
LC_TELEPHONE="POSIX"
LC_MEASUREMENT="POSIX"
LC_IDENTIFICATION="POSIX"
LC_ALL=
|
The files are mainly in windows enkoding(cp-1250), because 95% of people uses windows and majority of them create that files. What can I do with that. I would like to have everything in UTF-8, but its ... micorsoft problem. |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Tue Jun 07, 2011 10:25 pm Post subject: |
|
|
...and we're back to my initial question: what's that filesystem type on that partition ?
If it's not ntfs/vfat family, then your ramblings about Windows encodings just don't make sense - unless you're talking about file content, which is a completely different topic.
Setting LANG to sk_SK.UTF-8 would help sanitizing the situation. |
|
Back to top |
|
|
rado3105 Apprentice
Joined: 14 Jul 2007 Posts: 293
|
Posted: Wed Jun 08, 2011 1:01 pm Post subject: |
|
|
I added
LC_CTYPE=sk_SK.UTF-8
in
/etc/env.d/02locale
then env-update && source /etc/profile
nothing has change |
|
Back to top |
|
|
VoidMage Watchman
Joined: 14 Oct 2006 Posts: 6196
|
Posted: Wed Jun 08, 2011 3:54 pm Post subject: |
|
|
Well, third time's the charm:
- 'LANG=C ls -b' on the files in question
- filesystem type and other mount options of the partition in question
You do know, that changes in /etc/profile apply only to the apps run from the shell, that sourced that file after the changes, right ?
Also, IIRC sudo by default won't source it.
Read docs of convmv a bit. |
|
Back to top |
|
|
|