We focus on FREE translation memory tools. While CAT tools such as TRADOS and Wordfast are widely used in the translation industry, free open source softwareTM can be used as an alternative to mainstream CAT tools.
We select two software called OmegaT TM and Anaphraseus TM and analyze their advantages and disadvantages.
OmegaT is a CAT tool whose features include user-customizable segmentation using regular expressions, translation memory, approximate matching, match propagation, glossary matching, translation memory context search, and keyword search.
in reference materials.
OmegaT uses the international standard TMX (Translation Memory eXchange) as its native translation memory format that can be imported and exported by most CAT tools. TMX is an open XML standard that allows easier exchange of MT between translators with different application tools.
Although it lacks features compared to other CAT tools, OmegaT is fast at segmenting a source file, compiling target files (known as “cleaning”), and approximate matching them. In its previous version, OmegaT segments the source file by paragraph, instead of sentences. You can now select segmentation at the paragraph or sentence level. By using the feature called Edit Behavior in Options, you can have the option to display in the source text or the best fuzzy match. This is a CAT tool worth trying.
OmegaT includes the following features:
– Approximate match
– Party spread
– Simultaneous processing of projects from multiple files
– Simultaneous use of multiple translation memories
– External glossaries
– Document file formats include:
XHTML and HTML
Microsoft Office 2007
– Unicode support (UTF-8): can be used with non-Latin alphabets
– Support for languages from right to left
– Compatible with other translation memory applications (TMX)
Some third-party programs that can be used with OmegaT are OpenOffice.org, Rainbow, bitext2tmx, and Samuel Murray’s scripts and how-tos.
Download site: http://www.omegat.org/en/downloads.html
Anaphraseus is a CAT tool to create, manage and use bilingual translation memories. Some of its main features include text segmentation, terminology recognition, plain text TM (Unicode UTF-16), TM fuzzy search, UTF-16 TMX export/import, user glossary, and OpenOffice. organization extension.
Anaphraseus works in OpenOffice.org as an extension and works seamlessly with TM created in Wordfast. You can install it using the Extension Manager under Tools on OpenOffice.org. During the translation process, Anaphraseus breaks the translated text into segments or sentences. When a segment is selected, Anaphraseus displays the closest match found in the TM.
As an extension to OpenOffice.org, it allows you to use all the functions of the word processor “on the fly”. When it comes to performance, it is slower than Wordfast, especially when handling large files. Anaphraseus works with translation memory in TMX format, which allows it to work with the majority of CAT tools on the market.
Anaphraseus includes the following features:
– Text segmentation
– Recognition of terminology
– Plain TextTM (Unicode UTF-16)
– Fuzzy search in the translation memory
– Unicode UTF-16 TMX export/import
– User Glossary
Since Anaphraseus is still in beta testing, it currently only works with plain text, omitting all formulas, images, and similar objects. If you need a CAT tool that costs nothing and works with OpenOffice.org, Anaphraseus is worth a try.
Download site: http://anaphraseus.sourceforge.net/
Professional translators talk about their experience with translation memories in the translator column of our monthly newsletter at http://www.synergy-focus.com
*List of translation jobs available in the newsletter edition