[Solved] Getting Thesaurus content

Discussions about using 3rd party extension with OpenOffice.org
Post Reply
RedPanda89
Posts: 3
Joined: Sat Aug 08, 2015 7:59 pm

[Solved] Getting Thesaurus content

Post by RedPanda89 »

Hi everyone,

I am looking for a Thesaurus I can put into my SQL database. Since Open Office has a pretty good thesaurus and is open source I was wondering if there is any possibility to use the content of the thesaurus outside from oo. Does anyone know how I can do that?

Thanks for any ideas.
Last edited by RedPanda89 on Sat Aug 08, 2015 8:40 pm, edited 1 time in total.
OpenOffice 4.1.1 on Windows 10
User avatar
RoryOF
Moderator
Posts: 34612
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Getting Thesaurus content

Post by RoryOF »

If you download dict-en.oxt from the OpenOffice extensions repository and open it with an archive manager, you will find files th_en_US_v2.dat and th_en_US_v2.idx inside. You may need to manipulate them slightly put the keywords and suggestions in a form that suits you.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
RedPanda89
Posts: 3
Joined: Sat Aug 08, 2015 7:59 pm

Re: Getting Thesaurus content

Post by RedPanda89 »

Thank you so much that's great! I actually just need the synonymes so i have to figure out how I can get the word-synonyme-pair. So I have to take a closer look to the syntax of this file. Do you know if there is a documentation of this?
OpenOffice 4.1.1 on Windows 10
User avatar
RoryOF
Moderator
Posts: 34612
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Getting Thesaurus content

Post by RoryOF »

Perhaps on the Hunspell site as OpenOffice dictionaries are in Hunspell format.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
RedPanda89
Posts: 3
Joined: Sat Aug 08, 2015 7:59 pm

Re: Getting Thesaurus content

Post by RedPanda89 »

Thank you again :)
OpenOffice 4.1.1 on Windows 10
Post Reply