พจนานุกรม
/poht ja naa nuu grohm/
Translation: dictionary
Elizabeth writes: Hello everyone! Payap University has officially re-opened, which means Robin is back to work. This month, we thought we’d share with you a project that he worked on recently.
Here in Thailand, one of our next-door neighbors is the nation of Cambodia. Cambodia is a beautiful place, filled with rice paddie and ancient temples, and even though it’s a small country (about the size of Missouri), it has more than 20 different languages! Recently, one of our members in Cambodia was approached by a major university in Phnom Penh, the capital of Cambodia. This university plans to publish an updated and expanded dictionary of the Khmer (Cambodian) language, and they wanted to know if the Language Forge website could help. Language Forge is a website that our organization developed to help create a dictionary for any language — and Robin has done a lot of work on it. Making a dictionary is a long process, but Language Forge has made it a lot easier! The Khmer language already has a Bible translation, but we wanted to help out anyway in order to foster a good relationship with one of the countries where we work. Our partnership with universities in Cambodia gives us access to work with the minority languages of Cambodia which need Bible translation.
To make a dictionary of a written language, you need a lot of text - the more the better. Lexicographers collect books, letters, emails, speeches, stories, sermons, and more, and they make lists of all the words they find. If you have a large variety of sources, you will find a larger variety of words, and you will get a better idea of all the different ways these words can be used. For example, you might already have the word “season” in your dictionary, defined as “one of the four divisions of the year”, but looking at a cookbook might remind you that “season” can also mean to add flavoring to food. Dictionaries also need lots of example sentences to illustrate how a word is used, and they use this collection of texts to find these examples.
The Royal Academy of Cambodia had dozens of Word documents, each with a list of Khmer words. They also had collected 85,000 articles to glean words from - roughly five and a half million words of text in Khmer! But in order to use Language Forge to organize it all into a dictionary, first they had to upload it all to the website. The website is designed to import data from our linguistic analysis software, but not from Word documents — and they didn’t want to have to retype everything into the website!
Fortunately, Robin could help. He quickly wrote a custom program to take the text from all these different Word documents and change it to a format that could be instantly uploaded to the website. It saved them weeks of work! And now the Khmer dictionary team can use Language Forge to create a quality dictionary.
|