Home Forums Software ELAN Text encoding importing Toolbox lexicon (ELAN 5.4 Mac)

Text encoding importing Toolbox lexicon (ELAN 5.4 Mac)

This topic contains 2 replies, has 2 voices, and was last updated by  Weijian 2 months, 2 weeks ago.

Viewing 3 posts - 1 through 3 (of 3 total)
Author Posts
Author Posts
February 6, 2019 at 01:52 #12733

Weijian

Hi all,

I was importing a toolbox lexicon (.txt, utf-8), but the IPA characters weren’t converted correctly in the .xml produced by ELAN. Converting the source lexicon to utf-16 prior to importing didn’t work either. Has anyone experienced this issue? Any workarounds?

Thanks!

February 6, 2019 at 10:59 #12734

Han

Hi,
Thanks for reporting this, I can reproduce the problem. It appears that in the release the import assumes a system default encoding. The source code for the import function allows for the encoding to be set or selected by the user, but the current import window doesn’t have a drop down menu to specify the encoding yet. And instead of “utf-8” as a default, the releases version has no encoding set (leaving it to the system’s default encoding).

I’ve uploaded a “jar” library with a quick fix, setting utf-8 as the default. Here is the link to this lexiconcomponent-1.5.jar library. If you download it, you can replace the original file with the same name by this one. It is located in a subfolder of the ELAN app folder. Choose Show Package Content from the context menu of the .app folder and then navigate to Contents/Java.
I hope this works.

-Han

February 6, 2019 at 23:54 #12736

Weijian

Works perfectly thank you!

Viewing 3 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic.