![]() |
![]() |
#1 |
Jun 2010
Pennsylvania
947 Posts |
![]()
I just noticed that Top Producers listings that include characters with accent marks, are not showing up properly, whereas they used to up until very recently. Has something changed to affect how accented characters are displayed? This is happening on every PC I've checked with, so I know it's not a settings change at the user end.
To give a couple of examples: #485 in the TF Top Producers, André Jordi, is showing up as "Andr� Jordi", with a small box where the "é" should be. And #713, Jean-François Nies, is rendered as "Jean-Fran�ois Nies." You get the idea. What happened to change this, and can it get fixed? R�drig� (just kidding there) |
![]() |
![]() |
![]() |
#2 |
"/X\(‘-‘)/X\"
Jan 2013
5×72×13 Posts |
![]()
It looks like the pages are all setting their character set to UTF-8, but I bet nobody converted existing names in the database which are probably ISO-8559-1 encoded.
|
![]() |
![]() |
![]() |
#3 |
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
![]()
I am seeing the '�' character under my normal Unicode encoding setting in Firefox. Changing the encoding to Western shows that space as containing- (see attachment). It displays yet other sets of characters under other settings like Central European Windows or ISO.
So this is just a bit of cleanup on the migration, as I understand your answer, Mark. Last fiddled with by kladner on 2014-09-07 at 06:03 Reason: forgot attachment |
![]() |
![]() |
![]() |
#4 | |
Aug 2002
3·43·67 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#5 |
Jun 2010
Pennsylvania
947 Posts |
![]()
FWIW, when I manually typed in the special characters to show what they were supposed to look like on the producers list, I entered the é as ALT-130, while the ç was ALT-135 (in Windows).
Rodrigo |
![]() |
![]() |
![]() |
#6 |
Aug 2002
3×43×67 Posts |
![]()
It is interesting to see how the special characters appear when "viewed" by od:
Code:
$ echo "To give a couple of examples: #485 in the TF Top Producers, André Jordi, is showing up as "Andr� Jordi", with a small box where the "é" should be. And #713, Jean-François Nies, is rendered as "Jean-Fran�ois Nies." You get the idea." | od -c 0000000 T o g i v e a c o u p l e 0000020 o f e x a m p l e s : # 4 0000040 8 5 i n t h e T F T o p 0000060 P r o d u c e r s , A n d r 0000100 303 251 J o r d i , i s s h o 0000120 w i n g u p a s A n d r 357 0000140 277 275 J o r d i , w i t h a 0000160 s m a l l b o x w h e r e 0000200 t h e 303 251 s h o u l d b 0000220 e . A n d # 7 1 3 , J e a 0000240 n - F r a n 303 247 o i s N i e s 0000260 , i s r e n d e r e d a s 0000300 J e a n - F r a n 357 277 275 o i s 0000320 N i e s . Y o u g e t t 0000340 h e i d e a . \n 0000351 |
![]() |
![]() |
![]() |
#7 |
Jun 2010
Pennsylvania
16638 Posts |
![]()
I guess that the questions now are, is it possible to fix this, and what would the fix involve?
Maybe somebody has a way to convert the characters in some automated fashion. Or maybe somebody could volunteer to take guesses at what the special characters are supposed to be, and then feed them to the right person. I'd have to verify, but I'm confident I could get most of them right (though not nearly all without research, about 13/21 from the Top TF list). Curiously, not all special characters got messed up this way. Check out #1951 on that list: Code:
Ś�ṇȩł Rodrigo Last fiddled with by Rodrigo on 2014-09-08 at 22:48 Reason: typo |
![]() |
![]() |
![]() |
#8 |
Jun 2010
Pennsylvania
3B316 Posts |
![]()
I see now that the special characters have been fixed -- fabulous!
Rodrigo |
![]() |
![]() |
![]() |
#9 | |
Serpentine Vermin Jar
Jul 2014
D4E16 Posts |
![]() Quote:
The database itself is using varchar for some columns where ideally it should be nvarchar, but after looking at it in total, changing those column definitions and all the other things that touch it would be a larger task, so we're hoping to "fix" it for now on the PHP side of things. As it is, while accented characters are being stored and output in ANSI, I wondered what would happen if a Japanese user tried to set their public username to something in Hiragana ... the system probably wouldn't allow it or, if it did, the mix of encoding would produce some bizarre results. So yeah, it'd be good to use some proper fields long term. For now I'm just happy that the hourly reports are showing accented western characters properly now. |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Primenet and GMP-ECM | ET_ | PrimeNet | 9 | 2018-07-04 20:28 |
Wanted Lists | R.D. Silverman | Cunningham Tables | 1 | 2010-09-21 16:16 |
56.0-57.x on PrimeNet v5 | ckdo | Lone Mersenne Hunters | 0 | 2008-09-04 05:54 |
Question about work units and confusion about mailing lists | jasong | NFSNET Discussion | 5 | 2006-05-17 01:42 |
True ignore lists? | xilman | Forum Feedback | 1 | 2006-04-23 18:14 |