Page 1 of 1

[Solved] Getting Thesaurus content

Posted: Sat Aug 08, 2015 8:10 pm
by RedPanda89
Hi everyone,

I am looking for a Thesaurus I can put into my SQL database. Since Open Office has a pretty good thesaurus and is open source I was wondering if there is any possibility to use the content of the thesaurus outside from oo. Does anyone know how I can do that?

Thanks for any ideas.

Re: Getting Thesaurus content

Posted: Sat Aug 08, 2015 8:18 pm
by RoryOF
If you download dict-en.oxt from the OpenOffice extensions repository and open it with an archive manager, you will find files th_en_US_v2.dat and th_en_US_v2.idx inside. You may need to manipulate them slightly put the keywords and suggestions in a form that suits you.

Re: Getting Thesaurus content

Posted: Sat Aug 08, 2015 8:33 pm
by RedPanda89
Thank you so much that's great! I actually just need the synonymes so i have to figure out how I can get the word-synonyme-pair. So I have to take a closer look to the syntax of this file. Do you know if there is a documentation of this?

Re: Getting Thesaurus content

Posted: Sat Aug 08, 2015 8:35 pm
by RoryOF
Perhaps on the Hunspell site as OpenOffice dictionaries are in Hunspell format.

Re: Getting Thesaurus content

Posted: Sat Aug 08, 2015 8:40 pm
by RedPanda89
Thank you again :)