This is a new version that allocates a buffer before to sort, using VirtualAlloc.
When using this buffer I don't have to call malloc and to initialize the structure for each call.
Sorting 14130 words takes between 15/20 seconds.
If you have some ideas to be quicker I accept the suggestion.
The words to sort be in UTF-8.