[Chinese Sorting] Sort by name is messed up

Found a bug in "Everything"? report it here
Post Reply
775405984
Posts: 3
Joined: Tue May 30, 2017 5:32 pm

[Chinese Sorting] Sort by name is messed up

Post by 775405984 » Mon Apr 23, 2018 3:52 am

Hi, my name is Matt and I'm Chinese.

I find this software very useful. But when I type a random word into the search bar, the sorting method is quite messed up.

The files and folders with Chinese names were not sorted alphabetically(Pinyin), instead, they use an outdated method called Stroke order.

No one uses this kind of ancient technique, except someone with zero knowledge of Pinyin.

Some might say you can change the sorting method in Control Panal on Windows. I did, and nothing happens.

So I was wondering if you guys fix it, please!!! Thank you!!!

https://en.wikipedia.org/wiki/Stroke_order
https://en.wikipedia.org/wiki/Pinyin

775405984
Posts: 3
Joined: Tue May 30, 2017 5:32 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by 775405984 » Thu Sep 20, 2018 12:32 pm

were you ever gonna fix this ever? Come on, please.

NotNull
Posts: 919
Joined: Wed May 24, 2017 9:22 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by NotNull » Thu Sep 20, 2018 12:59 pm

775405984 wrote:were you ever gonna fix this ever? Come on, please.
It is number 227 on the to do list

Cut @void some slack ....

Debugger
Posts: 208
Joined: Thu Jan 26, 2017 11:56 am

Re: [Chinese Sorting] Sort by name is messed up

Post by Debugger » Mon Oct 22, 2018 7:37 am

NotNull - such a long list will be realized but in the next century :lol:

Debugger
Posts: 208
Joined: Thu Jan 26, 2017 11:56 am

Re: [Chinese Sorting] Sort by name is messed up

Post by Debugger » Mon Oct 22, 2018 7:40 am

I am most annoyed with the need to introduce the preceding expressions, it is a waste of time for me and not everyone remembers all these.

void
Site Admin
Posts: 4207
Joined: Fri Oct 16, 2009 11:31 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by void » Tue Oct 23, 2018 6:54 am

Currently, Everything sorts filenames by unicode code points, which is completely wrong, but fast!

I have added to my TODO list to support Unicode Collation Algorithm (UCA). Hopefully this will be available in the next release of Everything.

While this is not pinyin, it might be 'good enough'.
Implementing Pinyin at this stage will be unfeasible. The sorting rules are too complex, I can't use third party sorting or the Windows API to sort as these could change at any time and it is critical the Everything database is sorted in a specific way.

I also have concerns about UCA as storing these collation lookup tables will require quite a bit of data.. (128K+)
There is also a small performance hit with adding collation lookup tables.

https://unicode.org/faq/collation.html

Post Reply