WORDFAST CLASSIC

USER MANUAL


Version 7.xx ~ All rights reserved
Ⓒ 1999-2019, Yves Champollion

Quick start

Double-click wordfast.dot to install WFC, or see the installation section. This is done only once.

Click the icon to expand the toolbar, then click that same icon again to open the main Wordfast dialog box:

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT
 

This TM is active






 

Current TM:
C:\wordfast\NewTM-EN2FR.txt

Number of TUs: 234
File size: 28 Kbytes
Date: 2016-01-15 at 11:10:26

Source language: EN-US (English, Latin-1)
Target language: FR-FR (French, Latin-1)
 

Click the "New TM" button to create a new translation memory. You will be prompted for TMX-compliant codes of the source and target language used in your TM. Once WFC has created the TM (which is a Ms-Word document in text-only format) you will be prompted to name and save it. Close the WFC dialog box, you're ready.

A basic translation session consists of two steps.

1. Translation & proofreading

Open the document to be translated, press Alt+Down or click the Next Segment icon . The first source segment appears against a blue background. Type your translation in the target segment, which is the lower (green, yellow, or grey) box:

Document1.docx - Microsoft Word
X

File Edit View Insert Format Tools Table ?

This is my first translation.

Voici ma première traduction.

 

Press Alt+Down, or click to validate/close the current segment and open, then translate the next one. Translate the entire document that way. To pause or end translation, press Alt+End (or click ), save the document, close Ms-Word if needed. To resume translation, click anywhere in the document and press Alt+Down.

Proofreading: to edit a segment, place the cursor anywhere in that segment, press Alt+Down to open it, edit it, then move to the next or previous segment (Alt+Down or Alt+Up), or close the segment (Alt+End).

2. Clean-up

When proofreading is complete, click the Cleanup icon.

You're done! The translated document can be delivered. There is more to learn, but the essentials are covered.

Introduction

Wordfast Classic (WFC) is a Computer-Aided Translation (CAT) program designed as a Microsoft Word™ (thereafter written Ms-Word) add-on. Its primary purpose is to assist professional translators dealing with Ms-Word documents. WFC combines three core technologies: Segmentation, Translation Memory (TM), Terminology Recognition (TR). The reader who is not familiar with these concepts can read Appendix I for an overwiew.

WFC offers advanced terminology functions: three simultaneous glossaries, concordance search in unlimited numbers of TMs, reference search in various documents formats, links to various terminology databases, etc. The client's critical terminology can easily be entered in a WFC glossary, usually by copy-pasting; all segments will be checked for terminology consistency during the translation process.

WFC includes real-time Quality Assurance functions that include a typography checker, a terminology compliance checker, etc. Documents can be verified in batch mode so that project managers can have a detailed report on the typography/terminology quality of the documents they receive after translation.

WFC's TM format is open - it can be viewed or edited with Ms-Word™, Excel™, Access™ and many other popular programs. Furthermore, WFC ran read from or write into the TMX format used by other translation tools like TWB (Trados Translator's Workbench™), DéjàVu™, Star Star Transit™, SDL Suite™, MemoQ™, etc.

All this power is packed into a compact Ms-Word template, for Ms-Word versions 2000 to 2016, as well as Ms-Word for Mac 2011. An unlimited number of users can share the same translation memory and/or background memory over a local area network. WFC can also be linked to a Machine translation (MT) program or server (locally or through a network) to provide MT when no match is found in the TM.

Translation managers can develop project-specific extensions to meet specific requirements thanks to Ms-Office's programming platform (VBA) used by WFC.

Our hope is that this tool will help you increase productivity and provide a better work environment.

The Wordfast Team
www.wordfast.net

Disclaimer (must read)

If you start using WFC, take time to get acquainted with it before engaging in large or complex projects. Make sure you test WFC on your system and are aware of its limitations. Not all projects or documents can be translated with WFC. There is a level of document complexity (layout, formatting, embedded objects, dynamic documents linked to sophisticated templates, documents with very large tables, etc.) beyond which WFC, and any CAT tool, will give up. The more a document acts like a program (questionnaires, surveys, document with fields and automation), the less it is fit for translation using WFC. Ms-Word documents that are formatted to mimick another application, for example by using many textboxes to look like an Excel spreadsheet or a PowerPoint presentation, will be difficult to handle. Those include PDF conversions, OCRed material, documents that contain numerous textboxes, forms and input fields, so-called "Wordart" shapes, callouts, etc.

Size matters. Very large or complex documents with over 100 pages of text can be prepared for translation, for example, by splitting them into smaller, more manageable sub-documents, removing large graphics, etc.

When you consider starting a translation project where there are formats or layouts you have not handled before, test-drive WFC on sample paragraphs to make sure it will behave correctly before accepting the job. Proceed with caution when there are technicalities you are not comfortable with.

WFC does not replace the translator's ability to handle documents. Like all tools, WFC requires skills - it does not replace them.

The WFC website (www.wordfast.net) download page has training guides that are illustrated, step-by-step methods for beginners. Do not attempt to use WFC unless you learned the basics with the training guide level 1. The short instructions for use further below assume you're already comfortable with basic Ms-Word operation.

Do not print this manual

As with most electronic documents, you can quickly find information in this manual's 100+ pages using the standard Ctrl+F (Find) shortcut.

This is why printing the manual is not a good idea: you will find information in an electronic document much faster than flipping through a hundred pages. And this manual is constantly maintained up-to-date, as opposed to paper versions.

We are reluctant to answer hotline calls if the answer is easily and obviously found in the manual, or if the problem is strictly related to the use of the Operating System and/or Ms-Word rather than WFC.

Technical specifications

Translation memory (TM)

Size: up to 1,000,000 Translation Units (TU) per single TM. A TM is a simple Text file. You can create as many TMs as you want, in as many languages as you want, then enable/disable them as required. Note that two TMs can be used simultaneously: the main TM in read-write mode, and a Background TM (BTM), in read-only mode. WFC offers a tool called the Data Editor to create, split, merge, join, edit, maintain TMs.

Format: WFC uses TMs in either plain text format (ANSI), or Unicode text format (UTF-16 only, on Mac or PC). The Unicode format is the default, and preferred, format. The WFC TM format is open, straightforward, easy to read, maintain, share, and store. It is fully described in this manual in the WFC TM format section. Most text editors (the deceptly diminutive Notepad in Windows, or TextEdit in OSX) can open a WFC TM for viewing, editing, merging, proof-reading etc. Ms-Word can be used to manage TMs, but WFC itself offers tools to edit TMs.
This format guarantees robust data, superior versatility and compatibility. It is probably the most compact format in the industry: a WFC TM is typically three to four times less bulky than its competitors. This is important when vast collections of TMs are considered.
The WFC TM format has never changed: WFC versions 1, 2, 3, 4, 5, 6 all share the same format, while other TM solution editors keep changing their formats, regularly breaking compatibility. WFC's format is extensible for future features without destroying compatibility with previous versions. This is rarely seen in the TM industry.

Compatibility: WFC can read and write (import from, export to) the TMX format with minimal losses.

TM engine performance: The WFC TM engine is built to spot exact and/or fuzzy matches in less than half a second in most cases.

Integration: The WFC TM engine is totally integrated in Ms-Word: you don't need to run another application.

Networking: An unlimited number of users can share the same TM over a LAN (Local Area Network) or over the web using Wordfast Server.

Glossaries

WFC can use up to three simultaneous glossaries.

Size: the size of a glossary in WFC has been voluntarily limited to 250,000 entries. Most project-specific glossaries supplied by clients have far less than 10,000 entries - closer to 1000 for most. Important note: glossaries are meant to assist a translator with uncommon terminology, like technical jargon, that (s)he arguably may not know. Filling up glossaries with common words is a bad idea, it has consistently proved to decrease the tool's efficiency.

Format: like the TM format, the glossary format is plain text (Unicode or not), tab-delimited. It is therefore easy to feed terminology into a WFC glossary by simply copy-pasting it from a client's glossary, combine glossaries, etc.

Features: WFC glossaries offer a full range of services, from querying a term or expression, to full-fledged terminology recognition that highlights known terms in the source segment in real-time.

Fuzzy terminology recognition: WFC can recognize exact or fuzzy terminology in glossaries. Glossaries can be used as they are, or fine-tuned with the use of wildcards to meet special requirements.

Integration: WFC glossaries are totally integrated in Ms-Word - you don't need to run another application.

Networking: An unlimited number of users can share the same glossaries over a LAN (Local Area Network) or orver the web using Wordfast Server.

Supported languages & scripts

WFC can be used to translate any of the languages supported by Ms-Word. Scripts, rather than languages, are actually what WFC and Ms-Word handle - supported scripts include European, Latin-based scripts, Chinese/Japanese/Korean, right-to-left scripts (Arabic, Hebrew, etc.), Cyrillic, in addition to Central European, Greek, various forms of Hindi, etc.

Document format

WFC uses Ms-Word as translation editor, thereby taking all formats recognised by Ms-Word. WFC can handle files that have been tagged. WFC is compatible with the "tagged" format produced by RWS Rainbow, Trados Stagger etc., so WFC can easily be integrated in a Trados-based architecture to translate tagged files for FrameMaker, SGML, Quark Xpress, PageMaker, InDesign, etc. Note that the translation of tagged files requires great skills, and attention to minute details.

System requirements

WFC operates smoothly on any system that comfortably runs Ms-Word 2000 or higher (for Windows or Linux+Microsoft Office) or Ms-Word 2011 (Mac OSX).
Supported Ms-Word versions are: Word 2000, Word 2002 (a.k.a. Word XP), Word 2003, Word 2007, Word 2010, Word 2013, Word 2016. The only supported Mac version is Word 2011. The better Ms-Word works (in speed and reliability), the better WFC works. Even a modest 120-Mhz PC/Windows running Ms-Word 2000 will do fine.

Installation & maintenance

Automatic installation

In Ms-Word's File/Options/Security Center or Tools/Macros/Security dialog box, set security to "low". If you see a "Trusted sources" pane, check all options present in it. Then close Ms-Word and open it again before automatic installation can be done.

Mac Word 2011: Try the automatic installation by double-clicking wordfast.dot. If automatic installation fails, drop Wordfast.dot in a folder of your choice (preferably not the desktop -- create a new folder from the root where wordfast.dot will be dropped; make sure that both path name and file name do not use accented characters or Unicode characters: stick to the 25 unaccented latin letters for hard disk name, folder name). In Word 2011, use the Tools menu then Templates & Add-Ins dialog box to open (select) Wordfast.dot.

Systems using Unicode, like Chinese, Japanese, Korean, Russian, Central European languages, Arabic, Hebrew, etc should read the note on using latin-character path (folder) names and file names.

Windows: To perform an automatic installation, start Ms-Word, open the Wordfast.dot template using Ms-Word's File/Open dialog box (as when opening regular documents), enable macros if prompted to do so, and press Ctrl+Alt+W.

Systems using Unicode, like Chinese, Japanese, Korean, Russian, Central European languages, Arabic, Hebrew, etc should read the note on using latin-character path (folder) names and file names.

Automatic installation is the only case when you actually open Wordfast.dot as a document. After installation, Wordfast.dot has been added as a startup template and resides in Ms-Word's Files/Options/Word Templates dialog box. Wordfast.dot does not need to be opened as a document again.

Manual installation

Manual installation should be used if the automatic installation fails. When performing a manual installation, Wordfast.dot should not be opened as a document, but added to Ms-Word's list of templates, as follows:

Close Ms-Word. Copy the file Wordfast.dot into your Ms-Word Startup folder. Here are the typical locations for such folders (yours may be different):

Ms-Word 97 (all systems):
 \Program files\Microsoft Office\Office\Startup

Ms-Word 2000 and above:
 Windows 9x: \Windows\Application Data\Microsoft\Word\Startup
 Windows NT: \WinNt\Profiles\User name\Application data\Microsoft\Startup
 Windows 2k, XP: \Documents and settings\User name\Application data\Microsoft\Word\Startup

Windows Vista, Seven, 8, 10:
 \User name\AppData\Roaming\Microsoft\Word\Startup

MacIntosh:
 Applications:Microsoft Office X:Office:Startup:Word

Note that the exact location of your Startup folder is given by Ms-Word in the Tools/Options/Default folders (or Preferences/Default folders on Mac versions) dialog box. If no startup folder is specified in this dialog box, please specify one. Ms-Word must have a startup folder for add-ons to be loaded at startup time. If you cannot see your Ms-Word Startup folder in your hard disk, see the note below on hidden folders.

For systems using Unicode, like Chinese, Japanese, Korean, Russian, CE, Arabic, Hebrew, etc., users should read the note on using latin-character path (folder) names and file names.

Open Ms-Word. If you do not see the WFC icon, use Word's View/Toolbars menu to check the WFC toolbar. If this menu does not have a "WFC" option, see below.

Some systems have drastic Read/Write restrictions on system folders. Some anti-virus software, or strict network administrators, may impose such restrictions, making it impossible to add startup templates and add-ins to Ms-Word, by fear of macro-viruses. Although this fear and restriction may be legitimate (most network administrators and antivirus packages don't use such restrictions and live happily), it makes WFC operation impossible. To solve this problem, create a folder in an unprotected part of your hard disc (anywhere you can create a folder is unprotected). Then, in Ms-Word, use the Tools/Options menu, then "Default folders", to assign the folder you just created as startup folder. Copy Wordfast.dot into this folder, close and restart Ms-Word. The bottom line is: WFC is entirely written in Ms-Word macro language, and if your network administrator, or your antivirus, refuses the installation of any macro-based program, then running WFC is impossible.

Another manual installation

If the above method fails, start Ms-Word. Use Ms-Word's Tools/Templates & Add-Ins dialog box. Click the "Add" button, find then add Wordfast.dot. Note that every time your start Ms-Word, you will have to use the same dialog box, and check the "WFC" template. Manual installation is used on Macs.

If neither automatic, nor manual installation work
Create a new folder at the root of your hard disk, for example C:\WFC or C:\Program files\Wordfast. Copy Wordfast.dot into that newly-created folder.
Start Ms-Word. Use the Tools/Options/Default folders, click the Startup folder, then click the change button and specify the newly-created folder so it becomes the new Startup folder. Close and re-start Ms-Word. If the WFC toolbar does not appear, use the View/Toolbars menu to activate it.
If step 2 still does not work, in Ms-Word, use the Tools/Templates & Add-Ins dialog box, click the "Add" button to add the Wordfast.dot template jus copied in step 2. If the WFC toolbar does not appear, use the View/Toolbars menu to activate it.

Note 1: If you have difficulty locating Ms-Word's Startup folder: start Ms-Word, see Tools/Options then Default folders. Make a note of the startup folder's full name.
Note 2: If, at any time, Ms-Word asks you whether you want to "save" changes made to the WFC template, answer no. The WFC template should stay unchanged.
Note 3: Having Wordfast.dot in Startup will activate WFC every time Ms-Word is started. If Wordfast.dot is copied into Templates, you will have to open the Tools/Templates dialog box, click the Add button, select Wordfast.dot and press OK. You should never open Wordfast.dot as a document.
Note 4: Ms-Word 2000 or above: use the Tools/Macro/Security menu to set the security level to low, then restart Ms-Word.
Note 5: Ms-Word 97 users: see the troubleshooting section on Ms-Word 97
Note 6: If you have two different versions of Ms-Word on the same hard disk, have two copies of Wordfast, one in each "Startup" or "Templates" folder for each version of Ms-Word. This way, each Wordfast.dot will have its own INI file, where its own license number will be kept. Apply twice to receive a license number for each version of Ms-Word, since each version of Ms-Word will make WFC produce a different Install number.

Click the multi-coloured WFC icon . If the multi-coloured WF icon does not appear, use the Tools/Templates & Add-ins menu. In the Templates dialog box, click the Add button, find Wordfast.dot in your hard disk and open it. Close the Templates dialog box.

A toolbar should expand:

Note that the default toolbar, as it appears after a fresh installation, only has a few icons. More icons can be enabled using the WFC > Setup > UI dialog box.

Removing WFC

Automatic removal

Start the main WFC window by clicking the last icon in the toolbar, or press Ctrl+Alt+W. In the last tab (the "About Wordfast" tab, with a question mark ? ), click the Remove WFC button. Only the program (wordfast.dot) is removed. Ancillary files (setup, etc.) are removed on request.

Manual removal

Close Ms-Word. Using your system's file search utility (Windows: "Windows" key + F) search for WORDFAST.* then delete all WFC files that appear. You're done.

WFC does not modify your system in any way, does not add/remove entries to your registry base, does not add/remove fonts, does not create hidden files for protection or for hidden purposes, does not add/delete folders, does not add/remove DLLs etc. Thus, all it takes to un-install WFC is to delete wordfast.dot.

Important note
Operating systems have hidden or system folders, and Ms-Word's Startup folder may be located in a hidden folder (perhaps like C:\Documents and Settings\...). If this is the case, set your Windows Explorer or your File search utility to browse and display hidden or system folders. To do so in Windows Explorer or in Windows' File search utility, use the Tools/Folder Options menu, then View then Hidden files and folders and make hidden files and folders visible. Other systems may have different methods for making hidden files and folders visible to the file explorer.

Upgrading WFC

Download the most recent version of WFC from www.wordfast.net. Proceed as if it were a first installation.

Manual upgrade

Repeat the manual installation procedure, or:

You may wish to actually rename your previous Wordfast.dot (to Wordfast.old, for example) so that you may fall back on it if you need.

Regularly visit www.wordfast.net or www.wordfast.com to make sure you are using the latest version. Upgrade to newer versions of WFC preferably between jobs, when you are not under pressure, unless you really need a new feature found only in a newer version.

Buying a license

An unlicensed copy of WFC is randomly limited to approximately 5,000 Translation Units. Note that without a valid license, WFC accepts larger translation memories, but it will only index the last 5,000 TUs.

You will find FAQs on this topic here.
You will find the WFC End User License Agreement here.

You must have downloaded and installed WFC before buying a license. You must have tried WFC on your system before you decide buying it.

After buying a license to use WFC, you receive a set of credentials (an email address and a password). With those, log in at www.wordfast.net, then proceed to the WFC download page, and download a registered version of WFC.

If your computer is connected to the Internet, a registered WFC should automatically recognize whether it is licensed or not. This automatic licensing method works even if your internet connection is intermittent. WFC needs to connect to the centralized licensed server at www.wordfast.net once every fortnight.

If automatic licensing is successful, the ? (About Wordfast) tab in WFC's setup window displays an obfuscated view of the email address you provided when buying a license. After four different installations or re-installations in the same year, we require a simple email communication to the WFC hotline. Simply state why there were four installations in the same year on four different disk drives. The reasons can be that you moved to four new or different computers in the same year, or you had to frequently format your hard disk. Your counter will be reset, and you will be able to license WFC again for other drives. The End User License Agreement can be consulted at http://www.wordfast.net/?go=agreement. Licenses can be used only by the buyer, and cannot be given to, sold to, transferred to any third party.

During the automatic internet licensing, WFC sends the license server an encrypted string of characters that contain WFC's unique serial number, as well as a so-called "Install number" which reflects a few non-confidential, anonymous characteristics of the local hard disk. Here is an example of what that encrypted number looks like:

09876543azKm�%*�;hgs(=q+Q?,wjk.m=@

The license server decrypts the identifier, verifies the corresponding license validity, and sends back a yes/no reply. No confidential or personal information (such as names, email address, location, etc.) is sent over the internet. If no internet connection is found, WFC runs in full mode to make sure you can work.

If you intend to use WFC for professional activity, do not wait until the last minute to buy a license, as this process make take a few hours to complete with a credit card, or a week by bank wire, cash transfer, check, etc.

The entire WFC application is contained in one single template (Wordfast.dot) and this file is the same for all platforms (PC/Windows, Mac, Linux, etc). You can check www.wordfast.net from time to time or join a mailing list (see the community page in the website) to see if an upgrade has been released.

Disclaimer

The author or distributor(s) of WFC do not accept any liability for the use or misuse of WFC. When buying a license, users recognise they had sufficient time to try and test WFC on their particular system and are willing to use it as it is, however imperfect WFC may be. Specifications outlined in this manual may be changed at any time without prior warning, and are not binding.

Detailed instructions for use

Essential icons and shortcuts:

Expand (Alt+PgDn) expands a segment, when the sentence actually extends beyond a final punctuation mark. Note that a segment cannot be extended beyond a paragraph mark, page break, tabulator or table cell.
Shrink (Alt+PgUp) reverses any use of the Expand segment command.
Copy Source (Alt+Ins) copies the source segment over the target segment.
Translate Translates until a non-exact match is found.
Concordance (Ctrl+Alt+C) scans the BTM & TM and displays all TUs containing a specific word. By default, the search for concordance is done in the TMs source segments. However, if, during a translation session, the selected expression is in the target segment, WFC will search for concordance in the TMs target segments.
Reference (Ctrl+Alt+N) scans the files located in the folder specified with Terminology/Reference search folders to retrieve and display reference material.
Dictionary1 (Ctrl+Alt+D) looks up a word/expression in the currently active external dictionary#1.
Dictionary2 (Ctrl+Alt+F) looks up a word/expression in the currently active external dictionary#2.
Glossary (Ctrl+Alt+G) looks up a word/expression in glossaries.
Memory (Ctrl+Alt+M) displays the contents of the relevant TU above a proposed segment.
Quality assurance (Shift+Ctrl+Q) toggles real-time QA on/off during translation.
Quick-clean (Ctrl+Alt+Q) cleans up a document without updating the memory (the real, full clean-up is performed from WFC's Tools tab). Quick-clean can be used if you revised the document by re-opening segments, so that changes are recorded in the TM. If WFC proposes to process bookmarks without cleaning up the document, see the note on Bookmarks
Data editor Data Editor. This tool lets you browse, edit, maintain your TM, BTM, glossaries, and a few other resources used by WFC.
Ctrl+Alt+X Deletes the contents of the target segment.
Ctrl+Alt+Ins Copies the source segment's text attributes/style to the target segment. This is useful if, on an opened segment, you have pasted text that has a different font or style.
Shift+Alt+Down Forces WFC to either segment the text you selected (a selection of text is made), or re-segment from the cursor position (no selection of text is done). The selected text, or the cursor can be outside (usually after) the current segment, or inside the source segment. If the cursor is inside the target segment when you press Shift+Alt+Down, WFC will close (and submit, or "save") the current segment as if you pressed Alt+Down, skip the next segment, and open the second segment it finds. If Shift+Alt+Down is double-pressed, the behaviour is the same, except that WFC will skip the next paragraph rather than just the next segment.
Shift+Ctrl+G Loads the glossaries into the toolbar, if their size is less than 200 Kbytes. That's practical only for pre-2007 Word versions.
Alt+Up Can be used to return to the previous segment.
Alt+Right/Left If more than one match was found in the TM, this shortcut will cycle through all TUs, inserting the target text.
Ctrl+Alt+
Left/Right/Down


Selects the next/previous placeable or glossary term (in the source segment); Ctrl+Alt+Down  ⇓  pastes the selected placeable at the position of the cursor (in the target segment). A placeable (an untranslatable element) is simply copied "as is"; with a glossary pair, the target term is pasted.
F10 Marks a segment as provisional. Read the note on provisional segments for this important feature
Ctrl+Comma Toggles hidden text on/off. This lets you "preview" the final translation, then get back to full view. All editing, spell-checking, revision, etc should be done in "full view", i.e. with hidden text visible.
Alt+F12 Copies any selection of text (from any Ms-Word document) into the current target segment, if a session is opened. If, in the target segment, the selection has a zero length (it's just an insertion point), the selected text will be pasted at the insertion point. If the selection has any length, or if the selection (or insertion point) is outside the target segment, the text will be pasted at the end of the target segment. If the newly pasted text has a format or style that is different from the target segment's general style, remember that the Ctrl+Alt+Ins shortcut can copy the source segment's style and format to the target segment.

Backup your original (source) document before translating it.

Alt+Home can be used to resume a translation session. It will re-open the segment that was last closed. If Alt+Home is used when the last document is not yet opened (as when you simply launch Ms-Word), it will re-open the last closed document, and resume translation at the last closed segment.

Beside Alt+End (validate + close the current segment, and end session), there are two other ways of closing the current segment and ending a session:

Shift+Alt+End Closes the current segment without writing (committing) it into the TM.
Alt+Delete When a segment is opened (the source segment appears against a pale blue background), this shortcut deletes the contents of the target segment, then closes the segment (and the session) and restores the source segment as it was before segmentation.
When no segment is opened, unsegments the entire document (returns the document to a source-text-only state).

Notes
If you wish to exclude some portions of the document from the translation process: create a new style named "tw4winExternal". Apply that style to untranslatable portions of the document. Another, simpler way, is to choose one text attribute in Ms-Word's Format/Font dialog box (either Font > DoubleStrikeThrough, Gray highlight or, on older versions of Ms-Word, Animation/Marching Red Ants), apply it to the untranslatable text, then check the corresponding option in Wordfast > Setup > Segments > "Use ... as untranslatable attribute" option. DoubleStrikeThrough is recommended.

Segments with a red frame

When opening an already-segmented segment, where the contents of the target segment does not correspond 100% to the one in the TM, the target segment will be framed in red. This can occur if you translate a document, then manually edit typos in some target segments (without opening the segments with WFC for edition - manually here means you directly edit the text in Ms-Word). The red frame around the target segment indicates that the target text has been changed since the time the segment was originally written into the TM.

Document1.docx - Microsoft Word
X

Here is a segment with a red frame.

Voici un segment encadré en rouge.

Pressing Ctrl+Alt+M and inspecting the target segment thus displayed against a gray background should bring up the difference. In the example below, the French "entouré" in the TM does not match "encadré" in the target segment.

Document1.docx - Microsoft Word
X
TM (100%) - 2016-03-14 11:23:18 by YC ~client: EU-DEP ~ domain: IT
Here is a segment with a red frame.
Voici un segment entouré en rouge.
 

Here is a segment with a red frame.

Voici un segment encadré en rouge.

Committing the red-frame segment (moving out of it with Alt+Down or properly ending the session with Alt+End) will stamp the corrected version of the entire TU into the TM, and erase the existing TU which caused the red frame. When re-opening the segment, the red frame should be gone.

Provisional segments

If you want to leave a segment in a temporary, "provisional" state because it has not been completely translated (because its translation requires knowledge you will receive only later, or because you're missing some specific terminology), press F10 on the segment while it is opened. This will mark the current segment as provisional with a fluorescent yellow marker, and move to the next segment:

Document1.docx - Microsoft Word
X
  The balancer shaft must be properly mounted with the modifier spindle (part# B-4534).<}0{>L'arbre compensateur doit êon;tre correctement installé avec le $$$ (pièce# B-4534).<0}
 

The aircraft will not take off if the landing gear is not properly mounted.

L'avion ne décollera pas si le train d'atterrissage n'est pas correctement monté.

Later (the translation session being closed, i.e., no segment being opened), pressing F10 again will take you back to the first provisional segment in the current document and open it again so you can finalize it. When you close (validate, or commit) the segment by pressing Alt+Down (Next segment) or Alt+End (End translation session), the segment will lose its provisional segment status, and the yellow marker will be removed.

A provisional segment can be finalized (its translation completed) at any time, even days after you marked it with F10.

! Cleaning-up a document is impossible as long as the document still contains provisional segment marked with .

If you deliver uncleaned (segmented, or bilingual) documents, make sure they do not contain provisional segments. Simply press F10 on a document to see if it contains any provisional segment.

This method is safer and faster than leaving marks like $$$ or XXX, which can be forgotten.

Functionalities

Translation Memory

This section lets you select a TM or create a new one, define TM attributes, set TM rules, set up a Background TM, set up a remote TM with Wordfast Server or Wordfast Anywhere, and set up machine translation.

TM

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT
 

This TM is active






 

Current TM:
C:\wordfast\NewTM-EN2FR.txt

Number of TUs: 234
File size: 28 Kbytes
Date: 2016-01-15 at 11:10:26

Source language: EN-US (English, Latin-1)
Target language: FR-FR (French, Latin-1)
 

When creating a new TM, WFC will prompt you for TMX-compliant language codes for the source and target languages. These codes consist of 5 characters (2 characters for the language, a dash, and 2 characters for the country or local variant). See the important remark 3 below for TMX interchange with other translation tools, like Trados.

Here are a few language codes. A larger list of TMX-compliant language codes can be found here.

af-ZA (Afrikaans)fa-01 (Farsi)no-NY (Norwegian)
ar-01 (Arabic, Egypt)fi-FI (Finnish)pl-PL (Polish)
be-01 (Byelorussian)fr-CA (French, Canada)pt-BR (Portuguese, Brazil)
bg-BG (Bulgarian)fr-FR (French, France)pt-PT (Portuguese, Portugal)
ca-ES (Catalan)hr-01 (Croatian)ro-RO (Romanian)
cs-CZ (Czech)hu-HU (Hungarian)ru-RU (Russian)
da-DA (Danish)in-01 (Indonesian)sh-01 (Serbo-Croatian)
de-AT (German, Austria)is-01 (Icelandic)sk-01 (Slovak)
de-CH (German, Switzerland)it-CH (Italian, Switzerland)sl-01 (Slovenian)
de-DE (German, Germany)it-IT (Italian, Italy)so-01 (Sorbian)
el-GR (Greek)iw-IL (Hebrew)sq-01 (Albanian)
en-CA (English, Canada)ja-JP (Japanese)sv-SE (Swedish)
en-GB (English, UK)ko-KR (Korean)tr-01 (Turkish)
en-US (English, USA)lt-LT (Lithuanian)uk-UA (Ukrainian)
es-AR (Spanish, Argentina)lv-LV (Latvian)vi-VN (Vietnamese)
es-CL (Spanish, Chile)mk-01 (Macedonian)zh-CN (Chinese, PRC)
es-ES (Spanish, Spain)mt-01 (Maltese)zh-SG (Chinese, Sing. simpl.)
et-01 (Estonian)nl-BE (Dutch, Belgium)zh-TW (Chinese, Taiwan, simpl.)
eu-01 (Basque)nl-NL (Dutch, Netherlands)

Beside its own native format, WFC can open TMX translation memories. TMX is the standard format for Translation Memory eXchange. If your client supplies you with TM data, ask for a TMX export.

For example, to re-use a WFC TM with Trados Translator's Workench (TWB):

Reorganize. The Reorganize button reorganizes ("indexes") a TM. Since this will usually reduce the size of the TM by permanently erasing TUs that were marked for deletion, it is advised to perform this reorganization before e-mailing or archiving a TM, or before sharing it with another translator. Reorganising a TM should also be done if the TM seems to return no matches.

Note: When exporting or importing a WFC-generated TMX TM to or from another tool, the usual reason for failure is consistency in TMX language codes across the two tools.

Working in network mode (sharing a WFC TM over a LAN)

The same translation memory can be shared by an unlimited number of users over a LAN (Local Area Network). Every WFC user that shares a TM over a LAN should simply open the shared translation memory through the network.

Windows users: use mapped networked drives/folders rather than long network drive/folder names. To map a network drive, use Windows Explorer's Tools/Map Network Drive menu and assign a volume letter to the drive (or even to the drive + folder) where the shared TM is located. As a result, the TM's path would be perhaps Q:\MyFolder\MyTm.Txt rather than \\BillysMachine\MyFolder\MyTm.Txt.

Every user should have a different set of user initials.

Glossaries can be shared over a LAN. Proceed as with TMs.

Do not index a glossary when it is shared (WFC will prevent you from doing so anyway).
As with TMs (see above), use mapped networked drives.

Translation Memory Attributes

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT

Attribute #1User ID MAW Mary Ann White
Attribute #2SubjectPH Pharmacy
Attribute #3ClientMSC Master Sec Corp.
Attribute #4Document{doc}
Attribute #5
Values for Attribute #1 (Ins/Del, Enter to add/delete, edit values)
JD John Doe

Enable penalties



The TM Attributes tab displays five attributes, four of which can be customised, the first attribute being reserved for the User ID (User initials and name). I recommend reserving attribute #2 for Subject, and attribute #3 for Client, as in the example provided in WFC, to facilitate the interchange of TMs. You remain free, however, to define attributes according to your own needs. Use the Sample button to load a set of typical attributes, which you can then customise.

Click in the desired "Attribute #" list to customize an attribute's name, then use the Enter key to change values.

Then click in the lower drop-down list to add attribute items (also called attribute values) using the following keys:

Insert or +, to add an entry;
Enter, to edit an entry;
Delete or -, to delete an entry.

The "active attribute value" (the one that will be recorded from now on in the Translation Units in the TM) is the one currently displayed in the lower drop-down list.

Entries consist of a mnemonic (an abbreviation, made of 2, 3 or 4 letters) followed by a space, then the narrative. WFC will record only the mnemonic in the individual TUs, to optimize space.

! Note: The first attribute is always the "User ID" attribute. By default (if you don't specify a User ID or name), the value for this attribute is the current Ms-Word user initials and name, as they are found in Ms-Word/Tools/Options or File/Options/User info. You can, however, customize this User ID as you wish. If the TM was used by other users, the drop-down list will show all the translators who have used the TM in the past (maximum number of translators: 60). If you workgroup, this feature lets you see the TM's pedigree.

Attributes are stored in the current WFC setup - the INI file. When working in a translation session, WFC will record the mnemonics of the set of the currently active attribute values into any new, or updated, TU. If you stop the translation session, open WFC and change active attributes values, the TUs generated in the next translation session(s) will receive the new set of attributes values, but the attribute values of the previously existing TUs are not affected.

Applying penalties based on attributes.
Penalties are numbers entered between parentheses (see the sample attributes for examples). A penalty lowers the percentage of match rate of a TU when it is found in the TM (if WFC finds a 100% match in the TM, but one of the TUs attribute values has a penalty of 5, the match rate will be lowered to 95%).

There are two types of penalties: absolute penalties and relative penalties.

Absolute penalties

Those are defined for attribute values (i.e., items in the drop-down list). When WFC proposes a TU which has that attribute value, it will receive the corresponding penalty.
Example: your translator ID is JTB John T. Bisham. You import, in your TM, 200 TUs coming from another translator whose ID is MAT Mark A. Tweed. You wish to unconditionally apply a penalty of 5 to propositions coming from TUs created by Mark Tweed.

Create or edit the

MAT Mark A. Tweed

attribute entry so it reads

MAT Mark A. Tweed (5)

From then on, every time a proposition comes from a TU created by Mark Tweed, it will have a penalty of 5. As a result, a Mark A. Tweed TU will never appear green, it will appear as 95% match at best.

Relative penalties:

Those are defined per attribute (in the attribute caption). These penalties will be applied if the particular TUs attribute value is different from the attribute value of the current session (as you defined it in WFC's TM Attributes section).

Example: you apply a relative penalty of 8 to the User ID attribute. Edit the User ID caption so it reads User ID (8) . From then on, if a TU's User ID is different from the one currently defined - supposedly your ID - then the TU will receive a penalty of 8, regardless of which translator it is.

Absolute and relative penalties are cumulative. So, if Mark A. Tweed already has an absolute penalty of 5, and the entire User ID category has a relative penalty of 8, then a TU with Mark A. Tweed will receive a total penalty of 13.

The basic purpose of penalties is that a TU, which would otherwise appear green (an exact match), does not appear green but yellow, so that the translator's attention is drawn at that point. Penalties should be modest (a penalty of 2 is enough to prevent a TU from appearing green), because, if they are cumulated, they may actually bring the match rate below the fuzzy threshold. Penalties for TUs created by machine translation, however, are traditionally strong (10 to 15).

One other purpose of the Attribute system, using the TM/glossary editor utility is to manage (extract, merge, classify etc) TMs by taking into account their TUs' individual attributes.

Translation Memory Rules

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT

Penalty on TM=0
Penalty on BTM=0 Penalty on WFA/WFS=0
Penalty for case difference=0
Penalty for different numbers=0
Penalty for whitespace difference=0
Penalty for different quotes/apostrophes/dashes=0
Definition level for creating In-context matches=2
When editing (changing) a 100% match=0 (Overwrite TU)
When re-using an existing TU, update it if attributes are different=0 (No)

Preset profiles
 

Translation Memory (TM) rules are used to fine-tune WFC's TM engine. The TM engine's task is to find the best suitable match for the source segment you are currently translating when a segment is opened. Unfortunately, in many instances, there is no "perfect match", or objective identity between the source segment in your document, and the closest candidate in the TM. In this situation, the TM engine has to draw a subjective match through a process that uses artificial intelligence to "figure out" whether the degree of fuzziness makes the candidate TU a good choice. In some cases, WFC uses a substitution algorithm to update the proposed segment and bring it closer to an exact match. The elements that are updated or substituted are typically untranslatable items (like numbers, fields, tags), also called placeables. The goal is to relieve the translator from the chore of spotting and updating placeables.

This is obvious when numbers are involved. WFC will consider the following two sentences to be "exact" matches:

The net weight is 1,000 Kg.
The net weight is 2,000 Kg.

This is so because WFC can easily detect numbers and carry out a substitution. In this situation, numbers like 1,000 or 2,000 are considered placeables by the TM engine, and they are updated to reflect the document's reality rather than the TM.

The method is a great help and time-saver in most situations. "Most" here is so overwhelming that, by default, most translation tools are set to automatically substitute placeables like numbers, or fields.

This method can fail when the placeable substitution requires a grammatical or syntactical update of the target segment - a task which WFC cannot perform. In the following example:

The process takes 2 years to complete.
The process takes 8 years to complete.

The substitution process (replacing 2 with 8) would work flawlessly with most languages, but would produce a grammatically incorrect sentence in a few languages, like Russian.

The TM rules tab offers a high level of customization in this respect.

Some penalties only apply to exact (so-called 100%) matches, others on lower values of match values, exact or fuzzy.

!  Note: The three penalties below (on TM, BTM, and Remote TM) are made visible to the translator, and constitute a temporary penalty. The match rate (the small purple number between the two segments) will appear bold and red to warn the translator that a temporary penalty has been applied (as is the case with attribute-based penalties). Contrary to other penalties further below, those three penalties do not turn a 100% match into a "real" fuzzy match, which means that if a penalized 100% proposition is accepted as is by the translator, the translation unit is not written into the TM or VLTM.

Penalty on TM: (100% and fuzzies) this penalty is applied when a proposed match is drawn from the TM.

Penalty on BTM: (100% and fuzzies) this penalty is applied when a proposed match is drawn from the BTM.

Penalty on Remote TM: (100% and fuzzies) this penalty is applied when a proposed match is drawn from a remote TM, either through Wordfast Anywhere or through Wordfast Server.

!  Note: In all cases below, a penalty of 1 point or more would produce a so-called fuzzy match. If the translator accepts the translation as is, WFC will write the (now new) translation unit into the TM, therefore adding an additional version of the previously existing TU, this time with a different case. It is important to note that although penalties produce a more strict TM engine, they tend to populate TMs with more translation units.

Penalty for case difference: (100% only) this penalty is applied when an exact match is found in the TM, but case is the only difference. Example:

Meet us at the ATA!
MEET US AT THE ATA!

Penalty for different numbers: (100% only) This penalty is applied when different numbers are found in a segment. Example:

The process takes 2 years to complete.
The process takes 8 years to complete.

The last two items apply when an existing TU is re-used, or edited, after WFC has proposed it as a 100% match. A TU is re-used if you validate a proposed 100% (green) TU without editing (modifying) the target segment (the translation). A TU is edited if you edit (modify) the target segment. The following rules apply immediately after you validate such "100% match" TUs, to control the way they are stored into the TM.

In-Context Matches: This features enables In-Context Matches (ICM). ICMs are matches where the previous and the following segments match at 100%. The idea is that if a segment is embedded in a series of three exact matches, the trustworthiness of that segment greatly increases. ICMs have a score of 101 so they are picked first in case there are other competing 100% matches. Remember that match scoring, in TMs, carries little linguistic sense.

If your TM had no previous ICM detection, you can reprocess it to enable ICM matches:

Enable a level of ICM support in WFC's TM rules tab.
If the TM has been previously sorted or shuffled, segments may not be in their original sequence any more. If that is the case, use WFC's Data editor (one icon before the last in the WFC toolbar), click "Tools", then sort the TM on date. That will restore a decent level of historical sequence, which is important for ICMs.
Back in the WFC setup dialog box, in the Translation Memory pane, click the "Reorganise" button. Wordfast will create the necessary indexes for ICMs.

Penalty for whitespace difference: (100% only) This penalty is applied when an exact match is found in the TM, but the only difference is in spaces found at either beginning or end of the segment, or where there is a different number of repeated spaces within the segment. Example:

Meet us at the ATA!
Meet   us   at   the   ATA!

Penalty for different quotes/apostrophes/dashes: (100% only) this penalty is applied when an exact match is found in the TM, but the types of Quotes, Apostrophes, or Dashes (QADs), are different.

Note that ' is sometimes used as a closing quote, sometimes as an apostrophe. WFC assumes ' is a closing quote when the same segment contains � before '.

WFC is blind to QADs when a 100% match is found, and when, in the TM's segment, the only difference is made of different QADs which WFC can substitute without any ambiguity, as in:

This is a "quoted sentence".
This is a « quoted sentence ».

This penalty will force WFC's TM engine to consider the two segments above as not being 100% matches.

Editing an existing TU

This feature offers 4 choices:

When WFC finds more than one possible translation for a source segment, the Alt+Right shortcut will let you cycle through all the possible translations.

In case there are many identical translation units in the TM, the first match proposed by WFC should be the most recent one, based on its date stamp.

When re-using an existing TU, update if attributes are different: if the currently active attributes (as set in WFC > Translation Memory > Attributes) are different from the candidate TU's own attributes (as found in the TM), you may choose to update the TU in the TM with the new set of attributes (the TU will be rewritten "as is", but the current set of attributes will replace the existing ones). Check the "Update existing TU if attributes are different" checkbox. The usage counter will be incremented, and the new set of attributes will replace the TU's existing attributes; source and target text remain the same.

Background Translation Memory (BTM)

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT
 

This BTM is active






 

Current TM:
C:\wordfast\BackTM-EN2FR.txt

Number of TUs: 8234
File size: 208 Kbytes
Date: 2015-01-15 at 10:22:16

Source language: EN-US (English, Latin-1)
Target language: FR-FR (French, Latin-1)
 

Select BTM: A background translation memory (BTM) is a read-only translation memory which WFC will scan for exact or fuzzy matches after scanning the current TM. If a match is found in the BTM, Ms-Word's status bar and a beep sound will inform the translator that the proposition comes from the BTM. In the AutoSuggest drop-down list, BTM propositions have a match score in green, rather than in blue.

Make sure the "This BTM is active" checkbox is checked for the BTM to be used.

Which TM comes first?
By default, if the BTM is used, the BTM is preferred over the TM. If the BTM and the TM (and WFS or WFA, when applicable) yield Translation Units (TUs or "matches") with the same match rate, the BTM's TU will be displayed first. The Alt+right/left shortcuts can be used to display other TUs.

Note that there is a setting in the AutoSuggest (AS) setup that decides which proposition (from the TM, BTM, WFA, WFS, MT1, MT2, etc.) is pasted in the empty target segments when the segment opens.

Remote TM

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT

  
Use a remote TM & glossary
VLTM
 
WFS admin password (optional)
 
 
Note: remote terminology appears as Glossary #3
 
Enable on already-translated segments     Use only for concordance
 

This section of Wordfast offers a connection to TMs from a Wordfast TM Server.

Wordfast Anywhere (WFA) accounts are, as of going to press (Summer 2018) are free for all. In WFA, get the URL from "TMs & Glossaries > Setup > View/Edit". It looks like this:

wf://CznlO71SE7.EN2FR:@207.223.244.237:47110/14+?$yCv1S

That way, you can connect to the TM and glossary that are currently active in your WFA account. WFA TM connection is read-write. You can even set your TM as "shared" in your WFA account, if you wish so, and invite others to share it.

Wordfast Anywhere is found at http://www.freetm.com/

Wordfast Server
This option is reserved for organizations that have set up a Wordfast TM server. A free version of Wordfast Server in demo mode for up to three simultaneous connections can be downloaded from http://www.wordfast.net/zip/WfServer.zip. It can be used locally by translators who run very large TMs (over one million TUs), or who want to share their TM with one or two other translators.

Which TM comes first?
If a remote TM and the TM (and the BTM as well, when applicable) yield Translation Units (TUs or "matches") with the same match rate, the TM's TU will be displayed. The Alt+right/left shortcuts can be used to display other TUs. The AS (AutoSuggest) section in the setup can be used to set the order of preference, for example, having the VLTM, or the BTM, come before the TM.

Machine Translation (MT)

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
TM TM Attributes TM Rules BTM Remote TM MT

Ms-Word-based machine translation (see manual)   
 
 
Web-based machine translation:
No MT 
No MT 
No MT 

Enable on already-translated segments   
 
Using a local (installed in Ms-Word) MT resource.

During a translation session, when no match is found in the translation memory, WFC can request an on-the-fly translation from an installed translation program, such as Systran™, Power Translator Pro™, PROMT Reverso™, etc. After purchase and installation, these MT programs act as Ms-Word add-ons, just like WFC. There are four typical ways your Ms-Word MT add-on operates to have the document, or a selection of text, or the current paragraph, machine-translated:

For each method, you will need to provide two parameters that tell WFC how to request the translation. These two parameters are entered with a comma as separator. Here are the parameters you will have to provide for each situation:

The menu name, then the sub-menu that triggers the translation of the selection or the current paragraph (not the entire document's translation). This could be "Systran,Selection" or "Translate,Selection" for example, with Systran 4, or Power Translator Pro.
The toolbar name and the icon name. Your MT add-on's toolbar name is found in Ms-Word's "View/Toolbars" menu. You don't need to quote the entire toolbar's name, just a keyword that is special to this toolbar's name (maybe like "PROMT" or "Systran"). The icon name appears as "tip" when the mouse hovers over the icon. Note this icon name. This could be "Translate paragraph" for example. So the entire parameter could be "PROMT,Translate paragraph".
Select a portion of text and right-click on it. Note the name of the contextual menu that's used to translate the current paragraph (this could be "Translate paragraph", for example). The parameter to enter would then be "Contextual,Translate paragraph".
Note the macro's exact name (like "MTMacro"). The parameter to enter would be "Macro,MTMacro".

To set up MT activation:

Go to WFC's Translation memory/MT tab. Check the "Menu, sub-menu for MT" checkbox.
In the text box immediately after the checkbox, enter the parameter as defined above. If you work on tagged files with an MT package that does not support tags, check the "Remove tags" option (if you are not sure what this means, check "Remove tags").
Close WFC. In Ms-Word, test your translation package on a short sentence to see if it is correctly set up and running.

This is the normal procedure, and it works with Systran, Power Translator Pro, PROMT Reverso on all versions, and most other packages. Some trial-and-error may be required to have it run.

On systems running Systran4, the Systran add-on that links Ms-Word to the Systran engine must be in Ms-Word's "Startup" folder (as is the case after Systran's regular installation procedure is carried out), so that it is loaded on startup. Systran may not work if its add-on is simply activated after startup.

Using a remote, Web-based MT resource

WFC can connect to various Machine Translation sources. Note that many of those sources are subscription-based, and you will need one or two secret keys (aka "API ID") to connect to them.

WFC has a special deal with WorldLingo, which is why WorldLingo MT is free and unlimited for WFC users. WL covers a wide array of languages. If your language pair is covered, as the saying goes - go for it!

Multiple MT sources can be enabled at the same time. It's an exciting way to have giants like Microsoft, Google, or deepL compete to offer you their best MT. Oh well, your skill will probably be an order of magnitude better than those of the multi-billion-dollar giants.

Note that alternate settings for other MT sources can be set up, provided those sources can be queried via URL and are using the REST protocol.

! Note on confidentiality, secrecy, NDA compliance, etc. As is obvious, using a remove MT resource means that each source segment, as you translate, travels over the web to a service that will machine-translate it, then send back the proposed translation, usually of questionable quality. There are two importants things to note:

Creating a custom MT connector

Note: this section is DIY (Do It Yourself). Our hotline cannot assist in the customization of an MT engine, because that requires knowledge of the remote provider's specifications. However, public discussion groups may offer help.

If your remote Machine Translation provider is not listed, it is possible to create a custom connector for it. This is only possible if your MT provider's API is using a REST standard, and returns results in a JSON, or similar, format. That is the case with major MT providers currently available with WFC (Google, Microsoft, WorldLingo, deepL, MyMemory, etc.).

Let's assume your preferred MT provider is WorldLingo and we create a custom engine for it. You explore WorldLingo's API documentation. It essentially boils down to a query URL, with parameters.

In WFC's Machine Translation setup, select a "Custom" MT engine, then enter the following as API key:

url=http://www.worldlingo.com/S000.1/api?wl_data={ss}&wl_srclang={sl}&wl_trglang={tl}&wl_password=secret{jsonkey=}

Note the various elements:

url= tells WFC what URL is used.

Inside the URL:

Testing: In the following example, the raw URL was customized for an English-to-French language pair, to translate "Hello World". Your real URL will look different, the following is an example based on WorldLingo. Open a browser.

http://www.worldlingo.com/S000.1/api?wl_data=Hello%20world&wl_srclang=en&wl_trglang=fr&wl_password=secret

Copy-paste the above URL into the browser's address bar. The browser should display:

Bonjour le monde

You can try with other MT providers, and they may use a more complex JSON reply. In that case, the URL in WFC's setting must specify the JSON key so that WFC can identify the result. Here the JSON key is "translation" so you would use {jsonkey=translation}:

{"responseData":{"translation":"Bonjour le monde","match":1} }

Your MT provider may require more parameters, such as a secret ID key (aka an API key), or other elements, in which case, you should hard-code those in the URL.

Terminology

Glossaries

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference
 

This glossary is active






 

Current glossary:
C:\wordfast\myGLO-EN2FR.txt

Number of entries: 834
File size: 22 Kbytes
Date: 2017-09-15 at 08:09:26

Highlight colour:     Turquoise    ▼

Use fuzzy terminology recognition
Lock target case on entire glossary
 
 
Getting started
In Ms-Word, create a new document. In this new document, type a short series of source terms followed by a tabulator (press the tabulator key, shown here below as  ↦ ), followed by their translation, then Enter, as in the following example:

work visa↦ visa de travail
country↦ pays
country of birth↦ pays de naissance

Name and Save the new document as "Text-only" (preferably Unicode or Encoded Text). Congratulations, you have created a WFC glossary. Close the glossary document.
In WFC, go to the dialog box shown above (Terminology/Glossary X). Click the "Select glossary" button, find and open the glossary you just created (in the "File type" list, select "Text", or "All files").

Click the "Reorganise" button. This will make WFC sort the glossary on source terms, and index all entries.

Make sure the "This glossary is active" checkbox is checked, so WFC performs terminology recognition using this glossary during translation sessions. If you uncheck this checkbox, terminology recognition is suspended.

Close WFC.

In a new document with some text that includes any of the source terms listed above (like "work", "country" etc), start a translation session. Normally, these terms should be highlighted in light blue when a source segment includes them. This means that WFC has recognised that these terms are present in the glossary #1. You can select blue-highlighted terms with the Ctrl+Alt+Left/Right shortcuts and see their translation in the status bar, or copy their translation at insertion point in the target segment with Ctrl+Alt+down. If you place the cursor on a blue-highlighted term and press Ctrl+Alt+G, the glossary drop-down list will open and show the glossary entry. This same toolbar also enables you to open the glossary editor window.

The "Lock target case" checkbox forces a glossary-wide case lock on target terms. See further below for an explanation of "Lock target case", a feature which also exists at the level of each individual glossary entry, which is the recommended option.

Adding terminology
On a document, during a translation or at any time, select source term, press Ctrl+Alt+T; select a target term, Ctrl+Alt+T again. This will display the dialog box below to finalize the pair of terms you want to add to any glossary:

Wordfast (c:\wordfast\wordfast.ini)
X

Source entry Lock source case
 double-furnace boiler

Target entry
Lock target case
 chaudière à double foyer

Comment
 a special type of boiler used in power plants.




F1 F2 F3
  
 
 
 

Lock source case
The Lock source case checkbox is unchecked by default. When checked, it renders terminology recognition case-sensitive. Terminology Recognition is WFC's ability to 1. spot source glossary terms, or entries, in the currently opened source segment, and 2. highlight those terms in the source segment. This is generally not needed, and source entries are best entered in lower-case, unless case is an integral part of the source term.

The rare situations when this setting is needed is when source terms have different translations according to case. The example below, although imperfect (the common noun wap could be written entirely in capital letters, although that is unlikely) will require two or three glossary entries, each with a checked Lock source case option:

Lock target case
The Lock target case checkbox is unchecked by default. When checked, it locks the target term's case when the target term is proposed in the course of translation.

In linguistics and dictionaries, the default appearance of terms is lower-case, except when case is a defining part of the term as in proper names, acronyms, etc. To summarize, a glossary, like a dictionary, generally contains lower case terms. With WFC, when you place terminology in the target segment, using the AutoSuggest feature or the Ctrl+Alt+Left/Right/Down set of shortcuts, WFC tries to replicate the source term's case to save time. This, of course, assumes that case generally follows the same principles in the source and target languages, which is not always the case.

Suppose your glossary contains a pair of source -> target terms such as summary -> résumé. You may have to translate various segments like:

Please send us a summary of your work.
Summary: see page 22.
3. SUMMARY

If the Lock target case checkbox is unchecked for that term, WFC will automatically adapt "résumé" to the source case, so it will correctly place "résumé" or "Résumé" or "RÉSUMÉ" depending on the source segment. WFC dresses up the target term's case in accordance with the source term's case.

There are cases, as in German, for example, where this feature is not recommended. Or when having terms whose case usually differs between two languages. Month and day names in English have an upper-case first letter (January, February...) while that is generally not the case in French (janvier, février - unless those start a sentence). This is when Lock target case comes handy for a glossary entry.

You can check, then right-click the Lock target case checkbox in the glossary entry dialog box to make it the default setting when adding terminology. That is recommended only if most of your terminology (like working into German) has a different case.

You can also enforce a glossary-wide, unconditional "Lock target case" by checking that same checkbox in the Terminology > Glossary setup. That glossary-level setting supersedes the entry-level setting. However, it is not a recommended setting.

Fields
The Terminology addition dialog box has three Fields that are made to receive codes or special mentions that do not belong in "Source entry", "Target entry", or "Comment" fields.
Many translators add their own codes to glossary entries so they can later sort glossaries and extract selected terms.

For example, if you work on a project for a certain client, you may wish to add a client code to each glossary entry you create for this client, so that later you may distinguish them from other entries.

Since entering these codes is usually a repetitive task, two automatic features are added here:

There are many other ways to add terminology. One way is to open the glossary with Ms-Excel, then type, or even copy-paste, rows and columns of data. Do not forget to close the glossary in Ms-Excel before using Ms-Word and WFC, because Ms-Excel keeps the glossary locked at all times when it is opened.

The Data Editor is also a way to manage and review glossaries.

Glossary format
A WFC glossary is a tab-delimited, text-only file containing 2 or 3 columns (source term, target term, optional comment). Additional columns can be present. Unicode text is accepted. "Columns" in a tab-delimited text-only file are items separated by tabulators. If opened with Excel, the items in such a tab-delimited TXT file will be neatly distributed into columns. If opened with Ms-Word, you would need to select the text and use the Table/Convert text to table menu to actually see items in a table format, with visible columns (but, before saving the text document, you would need to convert the table back to tab-delimited text).
Glossaries can be created or edited using Microsoft Excel. The first column (column A) should contain source terms, the second column (column B) should contain target terms, the third column should contain comments, if any. The Excel spreadsheet thus created should be saved as "Tab-delimited text" using Excel's File/Save as... menu.

Format when saving
If the glossary is a Ms-Word table, immediately before saving it, select the entire table (with the Table/Select table menu), use the Table/Convert to text menu and convert the table to text, with the tabulator set as delimiter. Save your document as Text-only, or Unicode text if needed.
If the glossary is an Excel spreadsheet, save it as Tab-delimited text with Excel's File/Save as... menu. The Tab-delimited Text format is selected in the "File type" drop-down option list.

Terminology format
Terms can use upper and/or lower case. Avoid unnecessary characters like brackets, quotes, slashes, dashes, etc. unless absolutely necessary.

The * (asterisk) wildcard can be used at the end of a term, if different endings of a term are possible (this is called MFTR and is described below). Here is a sample English-French glossary:

Maintenance*↦ Entretien*
Interview*↦ Entrevue*
minimum wage*↦ salaire* minim*

Do not place the * wildcard less than four characters from the beginning of an entry. So pa* the bill* is not valid; use three entries like pay the bill*, pays the bill* and paid the bill*. However, if necessary, the * wildcard can be placed at the beginning of word - as the first character, for languages where the inflection is a prefix.

The combination of more than one wildcard in the same term is not recommended. It may produce unreliable results.

Multiple glossary entries
WFC accepts multiple glossary entries as follows:

avocat↦ attorney
avocat↦ barrister
avocat↦ lawyer
avocat↦ avocado

etc.

Add {preferred} to either the Comment field, or any of the three F1, F2, F3 fields to show WFC which entry is preferred when propagation is used.

Fuzzy Terminology Recognition (FTR)

FTR in WFC can be automatic (AFTR), or manual (MFTR).

MFTR is done by manually adding asterisk wildcards (*) at the end of words in the glossary so that most inflections of the glossary entry will be recognized in the document. For example, a glossary source entry like

Digital Analog* Converter*

will allow WFC to recognize, in the document, various approaching forms such as

Digital Analog Converters Digital Analogic Converter

etc.

if they are found in the source segment.

The asterisk can be placed inside a word. For example, if the following entry is in the glossary:

Methyl*one

that entry will match methylisothiazolinone, methylprednisolone, etc, in the document, but it will not match methylisoline.

The pipe (|) can be placed inside a term, and is equivalent to an ending asterisk: anything after the pipe will be ignored.

The question mark (?) can replace any single character:

Methyl?one

will match Methyleone or/and Methylhone, but not Methylheone

The sharp sign (#) can replace figures:

$#-fine

will match $200,000-fine or/and $200-fine.

Do not overload glossaries with common source language terms. Glossaries are meant to assist the translator with technical jargon, not with common languages. Glossaries are not a device to save typing.

AFTR is useful on raw glossaries, where the translator has no time to manually place asterisks as explained above. WFC uses various techniques that attempt to automatically make up for the possible inflections of terms found in the document's source text.

Note that glossaries can be hybrid: they can contain both AFTR (raw) and MFTR (asterisked) entries. If any entry has an asterisk, WFC will not attempt AFTR on that entry, but make use of the asterisk. If two entries match the same queried term, the MFTR entry will be chosen rather than the match brought up by AFTR. However, if an un-asterisked glossary entry perfectly matches a queried term (no AFTR neither MFTR needed) then of course this entry will prevail over all others.

WFC can use more than one glossary. This enables you to simultaneously use both client terminology, and your own terminology, in two distinct glossaries. You can even set color schemes to immediately spot from which glossary a term has been recognised.

Client terminology is usually rushed together with the job, and in some cases, it can even be rushed after the job started, by overworked project managers. Manually fuzzying-up a glossary takes time and is best done between jobs, on spare time, this is why AFTR is acceptable for rushed client terminology, in the heat of a live project.

AFTR attempts to recognize most inflections. AFTR is by nature an imprecise (fuzzy) process, and may bring up occasional mismatches, which should simply be ignored, or, if time permits, lead to manual fixing/fuzzying (MFTR) in the glossary. Here are a few observations:

The conclusion is that AFTR should not be attempted on large glossaries with many similar entries. And in no case can AFTR be used for saving typing time, "autoassembly" schemes, or be a substitute for machine translation.

Typical client-supplied terminology looks like this (target terms omitted):

two-way multiplexed autoresponder
double-furnace boiler
dichotomic search
DOS-based application

etc. This is where AFTR really helps (complex specialist jargon), and yields best results. Once the job is completed, and you have a spare hour, you may consider integrating client terminology into one of your existing glossaries, and manually add asterisks as follows:

two-way multiplexed autoresponder*
double furnace boiler*
dichotomic search*
DOS-based application*

etc. This way, your homegrown glossary runs on MFTR rather than AFTR.

The essence of AFTR is to determine what is a word's stem by gradually stripping letters from the word's end. Note that we deal here with statistics - there are exceptions to this rule, and every language has its requirements. The verb go, for example, will change into went in the past tense, thereby defeating any AFTR attempt. By chance, client terminology is primarily made of technical words and expressions, where nouns outnumber verbs. And technical jargon (some of which is imported) is a less prone to wild variations than literary language. Glossaries are primarily used for jargon, and more precisely, client jargon: the translator is supposed to understand common language.

Stripping is done gradually, by increments of one trailing letter, to a maximum of four letters. A word like applications (found in the source segment) will first be reduced to application, then to applicatio, then applicati etc. Obviously, the first attempt (producing application) would hit a match in the glossary, provided the glossary has an entry for application*.

How to load a glossary
Three glossaries can be selected in the WFC/Terminology/Glossary tabs. Click the "Select glossary" button to find and specify the glossary you want to use (WFC glossaries ahve a TXT extension). Then click the "Reorganise" button to have the glossary sorted and indexed by WFC. You can view/edit the glossary with Ms-Word.

Using glossaries for QA
Check the appropriate option in the QA pane in WFC > Setup. From then on, during a translation session, when the translator validates a translation, WFC will look for each source term in the source segment. If a source term is found in the source segment, WFC will expect to find the corresponding target term in the target segment. If it fails to do so, it will warn the user, giving a choice of editing the translation or ignoring the warning.

Select/deselect glossaries
Use the "Select glossary" button to select a glossary.

If you want to keep a glossary selected, but don't want this glossary to be active, i.e., if you do not want WFC to perform terminology recognition on this glossary, uncheck the "This glossary is active" checkbox. Otherwise, keep this checkbox checked. This checkbox is automatically checked each time you use the "Select glossary" button.

For propagation to occur, the corresponding "Propagate" command must be activated in Pandora's box.

Thesaurus

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference

Autosuggest synonyms

Add antonyms to synonyms.
 
 
Show synonyms/antonyms only when I press Ctrl+Down.
 
 
I don't use the thesaurus
 

If your version of Ms-Word has a thesaurus function for the target language you use, WFC can make good use of it. To check whether you have a thesaurus for your target language:

The thesaurus function can be useful to translators who engage in transcreation, or any form of creative literary translation, and who like to have a broad choice of terminology in the target language. In literary translation, TMs and glossaries are of limited use, but the help of a thesaurus in the target language is appreciated.

One issue with Ms-Word's thesaurus is that it's a pull feature. In other words, you need to do an action (use a shortcut, or click an icon) to get to the list of synonyms for a selected word. WFC changes this "pull" behaviour to a more productive and comfortable "push" mode. Any time you finish typing a word in the target segment, with the cursor at the end of word, WFC will pop down a list of synonyms. You can then replace your last typed word with one of its synonyms very easily.

To ensure compatibility with older versions of WFC, it is possible to disable the thesaurus feature, and use a third glossary instead. This becomes possible only if both a first and a second glossary are already selected. To replace the thesaurus with a third glossary, click the "I don't use the thesaurus" checkbox and immediately select a third glossary. Note that even in that mode, pressing Ctrl+Down on a word brings up synonyms - in a "pull" and unobtrusive mode.

To re-enable the thesaurus feature, remove the third glossary.

Blacklist

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference
 

This blacklist is active

 


 


Current blacklist:
C:\wordfast\myBL-EN2FR.txt

Number of entries: 34
File size: 2 Kbytes
Date: 2016-01-15 at 18:19:06
 

WFC can check target segments for unwanted words or expressions. As for the glossary feature, the check is not case-sensitive and the * wildcard can be used to end a word. The format is Text-only, in two columns. The second column can be left empty; as an option, it can contain the recommended term that should replace the blacklisted term. There is no AFTR on blacklists.

Find-Replace list

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference
 


 


 


Current blacklist:
C:\wordfast\myFR-list.txt

Number of entries: 20
File size: 1 Kbyte
Date: 2016-01-15 at 18:19:06
 

The FR (Find-Replace) list contains lines that make WFC execute a Find-replace action on the target segment before it is committed.

The first two fields of the list contain the text that one would enter in the "Find" and the "Replace with" fields of Ms-Word's own Find-Replace dialog box, to effect some find-replace action. The Find and Replacement actions are limited to the target segment.

If no replacement text is specified, only a Find action is performed, and the translator is warned if the Find is positive.

FR commands are executed only on new segments, or on text newly added to a target segment. FR actions are not performed on existing segments, unless the translator presses the Ctrl+Alt+H shortcut.

Note that only marked lines are executed. A line is marked if there is a check sign, or a # character appearing to the far left of the line, in the very first, thin column.

FR commands are useful for Quality Assurance purposes, verification, etc. The versatility of Ms-Word's Find-Replace facility makes this feature very powerful, and practically unfound in other translation tools.

The Find and the Replace texts are exactly what you would write in Ms-Word's Find-Replace dialog box in the Find or Replace fields.
A note (comment) field is offered to comment the line. The next three columns are used to activate three standard Ms-Word Find/Replace switches: the /wc switch turns on the Use wildcards option, the /mc switch turns on the Match case option, the /ww switch turns on the Whole word option. Any text in those fields is taken as activating the corresponding switch. Leave those fields empty to disable the corresponding switch.

The /warn switch, if present in the Note field switch, prompts the translator for a confirmation before the replacement is done.
When no replacement is required (the Replace argument is empty), the translator is always warned if the command has found the desired text.

You can add basic formatting options to the Find or the Replace fields, such as +{tw4winInternal}, which will be interpreted as a "tw4winInternal" style in the Find or Replace argument. Likewise, +{<b>}, +{<i>}, +{<u>} will be interpreted as bold, italic, or underlined font attributes.

Refrain from having hundreds of active FR lines at any given time: do not use FR as a substitute for machine translation, text processing, etc. FR is offered as a last resort, for example, to convert financial formats, and make up for common typos.

Thoroughly test your FR parameters (using Ms-Word's Find-replace dialog box) on a test file. FR can backfire. The sample list which is provided when you create a new Find-replace list in the WFC user interface under > Terminology > Find-replace contains a few example. Here are a few more examples:

Replace <Tag1> with <Tag2> - but only if they have the tw4winInternal style:
Find: <Tag1>+{tw4winInternal} Repl: <Tag2>+{tw4winInternal}

How to make sure the target segment has no more than 100 signs or characters, including spaces:
Find: ?{100} Repl: [empty] Switch: /warn /wc

Reverse "David John" into "John David" in the target segment:
Find: (John) (David) Repl: \2 \1 Switch: /wc

Replace endashes (–) and emdashes (—) with simple dashes (minus signs, -) in the target segment:
Find: [^0150-^0151] Repl: - Switch: /wc

Force a non-breaking space before :;!? in the target segment (two passes):
Find: ([a-z,A-Z,0-9]) ([\:\;\!\?]) Repl: \1\2 Switch: /wc

Find: ([a-z,A-Z,0-9])([\:\;\!\?]) Repl: \1^s\2 Switch: /wc

Reference

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference

External dictionaries:
     
Right-click on a source term brings up the following web glossary:
http://iate.europa.eu/SearchByQuery.do?method=search &query=  
 
Reference search folders (Ins/+ to add, Del/- to delete)
C:\wordfast\reference 
 

Select Dictionary (Windows only): WFC can be linked to external dictionaries. You can select an external dictionary application (like Trados MultiTerm™, the Harrap's Shorter™, the Collins™ version 100, Microsoft Encarta™, etc). The Keys button is used to define the keystrokes used to interrogate the dictionary (see the Dictionary section below for details). During a translation session, or at any other time, place the cursor on a word, or select an expression, and press Ctrl+Alt+D or click the Dictionary icon.

Reference
A reference search is like a concordance search, but it is done on any sort of documents (not only TMs). The Ctrl+Alt+N shortcut or the 🔍 icon launches the Reference search from within the document you translate, just like Ctrl+Alt+C launches a concordance search.
The material is usually of monolingual content. The following formats can be searched by WFC: DOC, RTF, TXT, HTML, SGML, XML, MIF, CSV. Other formats need to be saved as (or converted to) a text format. For example, if you have PDF material, export is a Text using PlusTools (a free utility distributed at www.wordfast.net), or by copy-pasting the PDF file into a Word document.

Rules for searches are the same as for Concordance search (see above). All Pandora's box commands concerning the behaviour of the Concordance window apply to the Reference window. WFC will run the reference search on all files present in the folder(s) specified for reference material. As with Concordance search, it is possible to use the Escape key (or the same shortcut, i.e. Ctrl+Alt+N) to cancel a search.
Use the Insert (or +) key or the Delete (or -) key to add or remove folder(s) where the "raw material" for Reference search is located.

Concordance

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Glossary 1 Glossary 2 Thesaurus Blacklist Concordance F-R Reference

Search Concordance in the TM.

Search in all sibling TMs
 
Search Concordance in the BTM
 
Search Concordance in the Remote TM
 
Hide Concordance headers (TU properties)
 
  Maximum Concordance results:    50 
 

You can specify which TMs are searched for Concordance.

Search in all sibling TMs. With this option, concordance searches extend to other TMs present in the same folder as the currently active TM.

Hide Concordance headers (TU properties). This options simplifies the display of results by hiding the top information on each TU properties. It lets you sift through data faster.

Maximum Concordance results: limits the number of hits that are returned. Avoid a large number, as WFC can be overhelmed by too many hits. Values range from 10 to 500.

Tools

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
Docs (1/4) Clean-up Analyse Translate QA Extract

My Document
Another document.docx
Yet another document.docx
Deliverable document.docx

 

 


document    footnotes    headers/footer    frames

To browse files on disk: close all documents. Right-click: (un)select all.

Before tools are used, the files on which they are used must be selected.

When starting WFC, if documents are already opened in Ms-Word, they will appear in the "Selected files" list. Otherwise (no document opened in Ms-Word when you start WFC), the files present in the current folder are listed. Click the "browse" button to change folder if needed.

Checked files are processed; unchecked files are not processed.

Clean-up deletes all segmentation marks and source segments from the selected files, leaving only the translated text. This is the final step in a translation process, unless the client, usually a translation agency, specifically requires a segmented (bilingual) document. The TM is updated if the target segment has been manually edited after it was created. Manual edition means you edited the segment without actually opening it, or without WFC interaction, or using a different TM.

Note: the Quick-clean icon in the WFC toolbar lets you clean up a document much faster, but without updating the TM, and without producing a report.

Analyse gives an analysis of selected document(s) before translation, reporting the number of segments and words, with the match ratings of the segments in relation to the current TM.
If the document is already translated and segmented, Analyse will not be carried out.

The analysis report created after analysis details the following points:

ANALYSIS-REPORT.docx - Microsoft Word
X

File Edit View Insert Format Tools Table ?

ANALYSIS REPORT 16:48:49 11-22-2007 Scanned: document, footnotes, headers/footers, textboxes. ========================================================= C:\My Documents\Doc-EN2FR.doc Match rate segments words char. % --------------------------------------------------------- Repetitions 1 10 72 34% 100% 0 0 0 0% 95%-99% 0 0 0 0% 85%-94% 0 0 0 0% 75%-84% 0 0 0 0% 00%-74% 2 19 140 66% Total 3 29 212 ========================================================= Note: The character count includes spaces.

 

Repetitions refers to repetitions found within the document(s) that was/were analyzed (this does not concern the translation memory). For example, if a same sentence (segment) appears 3 times in the set of analyzed documents, the repetition counter will show 2 repetitions.

Match rate per percentage: this is a comparison made between the segments that are found in the documents and any source segment found in the translation memory that WFC deems analogous.

Segments reported in the "100%" category are not always perfectly identical. In that case, a 100% match is "considered" as such by WFC. WFC may have to overlook case differences, differences in quotes/apostrophes styles, and more important, differences in tags or numbers. WFC computes a sophisticated substitution of numbers, or tags, or quotes, or apostrophes, in order to justify its claims. When the substitution is not totally reliable, WFC makes every effort to detect the ambiguity, and presents the purported "exact match" against a yellow background to raise the translator's attention.

TM rules can be used so that WFC enforces a more strict definition of what a 100% match is.

Note that all character counts include spaces.

Translate will pre-translate the selected document(s), with the use of the current translation memory. Unknown (no-match) segments will be copied over the target segment if you specified "CopySourceWhenNoMatch" in Pandora's box. However, if a link with a machine translation program is activated (see MT), unknown segments will be machine translated.
Once pre-translation is done, start a regular WFC session and translate your document(s) as usual. Work will be faster, because segmentation and matching have already been done. When cleaning up such a document, use the regular clean-up tool, and answer "yes" at the question "Update translation memory?".

Quality Assurance performs a quality assurance scan & report on all selected files; a detailed report is given for each file, with an overall summary of QA errors found on all files. Set up the required QA options in the WFC/Setup/QA check tab before running this tool.

Extract opens all selected documents and extracts all segments into a text document named "WfExtracted.txt". This document in text mode is presented to you so you can save it under a different name and/or folder if needed (save the document as Unicode if your language requires unicode).

For example, when preparing the extracted text to be used during alignment with PlusTools, this extraction process should be performer twice, once for each set of documents in each of the two languages. Each of the text document should then be named and saved separately (like "source.txt" and "target.txt") so they can be specified in PlusTools..

The Extract tool also produces a second file named WfRepetitions.txt, located in the same folder as WfExtracted.txt, which contains all segments that were found repeated more than once. This allows a project manager to have repetitions translated before the project starts, and to add these translated repetitions to the TM being distributed to translators. This method ensures consistency across the project, and further cost-cutting.

Setup

General

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Fuzzy threshold=70
End-of-segment punctuation=: . ! ? ^t
Target segment font=
Source segment colour=9 (Dark blue)
Fuzzy target segment colour=0 (No colour)
100% target segment colour=0 (No colour)
Unknown target segment colour=0 (No colour)
Insert the following character(s) after segments=
Protect delimiters=1 (Regular)
Separators for thousands, decimals, date (target language)=

         wordfast.ini | ▼
 

Note: some options must simply be checked or unchecked. Some options must receive a value (a number or some text). This is the case if the option has an equal (=) sign. In this case, click the relevant line, then press Enter on the option to create/edit/delete the value.

Fuzzy threshold=75
This is the minimum percentage for a fuzzy match to be considered fuzzy, and under which it will be considered unknown (or "no-match"). The default value is 75. Values can range from 50 to 99%. Values lower than 75 are not recommended, because you may receive very fuzzy propositions. Remember that the Ctrl+Alt+X shortcut deletes the contents of the target segment (the proposed translation) quickly and safely.

End of Segment Punctuation=. : ? ! ^t ^l
Choose the punctuations that end a sentence. Default values are strongly recommended. The default setting is . : ? ! ^t ^l , where ^t means tabulator and ^l manual line break.

Target segment font
Defines the font used for target segments. This is particularly useful when the target segment cannot use the same font as the source document, like translating from English to Russian, French to Greek, Italian to Hebrew, Chinese, etc.

Colours
These values will set up colours that will be applied to the segmented text, at validation time. These colours will be reset to the default ("Auto") colour at clean-up time. Whoops - If you started to translate with colours set, and realized after a few segments that you should not have used colours at all (as this is the case if the source text has colours that have to be preserved in the translated text), please note that, at clean-up time, WFC will reset the cleaned, target text to the "Auto" color, which appears black on most systems. In such a case, enter the parameter "LeaveColours" in Pandora's box to instruct WFC not to reset colours after clean-up.

Insert the following characters(s) after segment=
This option sets the characters, or short text, which can be added right after every segment.
Note the following convention for specifying some special characters:

{space}  space
{tab}tabulator;
&'AA;any character where AA is the hexadecimal code of the character; example: &'AB; for ANSI 171
&#00;any character where 00 is the decimal code of the character; example: &#171; for ANSI 171
Unicode values are also accepted (ranging from 256 to 65535).

Protect delimiters
This option sets delimiter protection. The default is "On".

This options does the following:

Protection is designed to help beginners by preventing most common accidents. Note that Ctrl+Alt+F12 toggles protection on or off for the segment you are currently working on. The protection you have set will resume at the next segment. Thus, Ctrl+Alt+F12 allows you to temporarily edit delimiters or tags, which would otherwise be protected or blocked.

Reset
Will reset all settings to WFC's default values.

Reverse all
Reverses the Translation Memories (TM and BTM), as well as the glossaries, in one pass. Note that TMs and glossaries are reversed, keeping their own names: original files are rewritten. If you need to keep a version of those files before they are reversed, you must manually back them up.

Save setup as...
Saves the current setup to an INI file. Ini files are saved in the same folder as the folder where Wordfast.dot is located. This folder is usually Ms-Word's startup folder (If you cannot locate your Ms-Word Startup folder, see the note on hidden folders). Using the browse... option lets you open an ini file anywhere, including network folders or floppy disks.

Segments

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Set Fuzzy threshold=70
End-of-segment punctuation=: . ! ? ^t
Target segment font=
Set target language to TM's target language.
Set target segment language to Word's default language.
A number + an ESP end a segment.
An ESP without a trailing space ends a segment.
An ESP + a space + a lowercase end a segment.
Tag bookmarks before translation
Use Double Strikethrough as untranslatable attribute.
Use Highlight Gray 25% as untranslatable attribute.
Use Marching Red Ants as untranslatable attribute.
Apply AutoFormat to target segments.

Abbreviations
Corp.,Dec.,Dept.,Dr.,D.C.,e.g.,etc.,Jr.,Inc.,i.e.

Set target segment language to TM's target language
WFC's QA options can require that the target segment be spell-checked before validation. If you select this option (the default one), WFC will apply the current TM's target language to each target segment (just as if you were opening the Tools/language... menu and applying a language to the target segment yourself), so that, if spell checking is done, the right language is used.

Set target segment language to Word's default language
If your target language is not in WFC's list of languages, or if for some other reason WFC cannot recognise your specific language: select this option, get back to Ms-Word, set the target language as default language in Ms-Word (menu Tools/Language): WFC will apply that default language to target segments. If you select "leave unchanged", then WFC will not apply a language definition to the target segments during sessions. See Appendix II for a brief discussion on this subject.

A number + an ESP end a segment
Normally, WFC will not consider a number followed by an ESP as ending a sentence. Checking this option will disable this rule.

An ESP without a trailing space ends a segment
Normally, WFC will consider an ESP as ending a sentence only if it is followed by at least one space. Checking this option will disable this rule.

An ESP + a space + a lowercase end a segment
Normally, WFC will consider that an ESP followed by a space followed by a lower-case letter do not end a sentence. Checking this box will disable this rule.

Tag bookmarks before translation
When a translation session begins on a document that contains bookmarks, this will pop up a reminder that special steps must be taken to have bookmarks tagged before the translation process and that bookmarks must be transferred to the target segment during the translation process.

Use DoubleStrikeThrough as untranslatable attribute
This is a DoubleStrikeThrough text example..
Rather than defining an external style that excludes text from the segmentation/translation process, you can choose a font attribute. Choose one font attribute that defines text not to be translated. Remember to uncheck this feature after use, otherwise, it may remain active and produce unexpected results. This feature slightly reduces segmentation speed.

Use Highlight Gray 25% as untranslatable attribute
This is a Highlight Gray 25% text example.
Same as above. This effect is visible only if Tools/Options/View/Highlights is checked. Important: the use of this text attribute (Highlight Gray 25%) should be limited to documents that have no highlighted text at all before translation.

Use Marching Red Ants as untranslatable attribute
This is a Marching Red Ants text example.
Same as above. This effect is visible only if Tools/Options/View/Animations is checked in Ms-Word.

Protect segment delimiters.
Protects from misuse or Enter, Delete and Backspace. When checked, this option re-routes the Deletion keys to a routine that prevents accidental deletion of segment delimiters. This feature is active only when the WFC toolbar is expanded. When this feature is activated, one limitation is that the use of Delete or Backspace inside some of Ms-Word's dialog boxes can cause a problem. This feature does not protect segment delimiters from being overwritten by other means, so one should remain careful anyway.

Apply AutoFormat to target segments.
WFC can perform an AutoFormat series of corrections to any text that WFC introduces in your target segment (rather than text you type). The AutoFormat corrections are set up in Ms-Word's Options > AutoCorrect section, under AutoFormat. They are limited to quotes, dashes, and apostrophes. They are applied to TM propositions, glossary items, or placeables, at the moment WFC places them in the target segment.
Please note that this feature refers to the AutoFormat feature in Ms-Word's options, not to AutoCorrect. AutoCorrect can be activated in Ms-Word, but it applies to what you type, not to what WFC pastes in your target segment.
Special case: If you wish WFC to execute Ms-Word's list of AutoCorrect's "Search for / Replace with" substitutions, you only need to have the replacement's text begin with the two lower-case letters wf.

! b>Note: Ms-Word's AutoCorrect and AutoFormat settings are defined per language. Whenever you set AutoCorrect, ensure that, the current selection has the proper language, , prior to opening Ms-Word's AutoCorrect dialog box. A text's language is an attribute just like Bold, or Italic, except that it's invisible -- but it appears in Ms-Word's status bar. While most translators open text in their source language, AutoCorrect should be set on a document, or a selection, with text marked as target language.

Abbreviations
Enter the most common abbreviations in your language. WFC will not end a sentence at a word belonging to this list. Separate abbreviations with a comma:

D.,Dr.,M.,Mr.,Mrs.,P.,Pr.,Pres.

Remember that the Expand function can expand a segment to fit the actual sentence, even if an unknown abbreviation ends the segment too soon, and that Shift+Alt+Down will force WFC to segment the text you selected.
An abbreviation must have less than 16 characters.

Quality Assurance

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Check terminology from Glossary#1
Check terminology from Glossary#2
Check terminology from Glossary#3
Check blacklisted terms
Run the Find-replace list on translations
Find typos in proper names
Warn if character count in target>80
Warn if ending punctuation is different in target
Warn if source/target ratio is >=2

Enforce a strict numeric Untranslatable' definition
Enable QA on already-translated segments

This part of WFC is used to setup the actions performed during quality assurance.
If QA is activated during translation, target segments are QA'ed before validation (this is the real-time mode), i.e., immediately after the user has pressed "NexSegment", but immediately before the segment is stored in the TM.

Remember that you can associate your own macro to QA, by entering it in Pandora's Box MacroQualityCheck command. See Appendix III for examples.

If Quality Assurance is started outside a translation session:

In both cases: if the cursor is in the first sentence of the document, WFC will ask whether you want to QA the entire document and produce a report, or QA one segment at a time, stopping at every problem so that you may correct errors step-by-step. If the cursor is not in the first sentence of the document, the second option (step-by-step QA) will be assumed.

Spell/grammar check are available only in real-time Quality Assurance mode (during translation sessions), but not in batch mode, when a report has to be produced.

Check terminology from Glossary #1, 2, 3
This option lets WFC monitor the use of terminology in the translation process. If glossary terms are found in the target segment, the corresponding target term (or at least one of the target terms when one source glossary entry has multiple translations).

Check blacklisted terms
This option lets WFC monitor the target segment to make sure no blacklisted terms are used.

Run the Find-Replace list on translations
WFC can maintain a list of find-replace actions to be performed on target segments when translations are committed, usually right before moving to the next segment. Find-replaces are equivalent to a manual use of Ms-Word's own "Find-replace" dialog box.

The Find-Replace operation is executed on:

Find typos in proper names
This features attempts to spot proper names in the source segment, and verifies whether they are found "as is" in the target segment. False positives are rare but possible. This feature should not be used with languages that inflect or modify proper names.

Warn if character count >=N
WFC can ensure that the length (in characters including spaces) of the target segment is not greater than a given quantity

Warn if ending punctuation is different in target
WFC can ensure that both source and target segments have the same ending punctuation. This is useful to verify that the ending punctuation is not missing in target segments - a common typo.

Warn if source/target ratio >=N
This verification will statistically detect possible cases where the target segment is empty, or nearly non-existent.

Warn if first source/target letters have different case
This verification will statistically detect cases where the first letter in source and target segments have a different cases. It is mostly used to detect a missing uppercase initial letter when it should be expected. False positives are rare, because the source segment is taken as clue to the expected target initial letter case.

Source untranslatables must be found in target
Untranslatables are figures (any combination of figures and letters, or any contiguous series of figures and letters, are considered as untranslatables). URLs and email addresses, as well as fields (to the exception of index fields and hyperlinks), are also considered as being untranslatable. WFC can ensure that all source untranslatables are found in the target segment.

Target untranslatables must be found in source
Same as above - but this time, the other way.

Source tags must be found in target
WFC can ensure that source internal tags are found in the target segment.

Target tags must be found in source
WFC can ensure that target internal tags are found in the source segment.

Identical bookmarks
WFC can check whether there is the same number of bookmark markers (brackets like [ or []) in source and target segments. This is useful when the client wants source bookmarks to be transferred into the translated text. If bookmarks must be preserved during translation, please refer to the special section on bookmarks.

Enforce a strict numeric "Untranslatable" definition
QA attempts to spot discrepancies in source/target numbers. A "number" or "numeric placeable" here means any series of characters that contain at least a number. The problem is that translation may alter those numbers (23rd turning into 23àme, or 23ste), so that QA will bring up "false positive" warnings. Without this option, both 23 and 23rd will match 23, 23àme, and 23ste. With this option checked, 23 will ony match 23, and 23rd will only match 23rd.

Enable QA on already-translated segments
This option will force WFC to QA existing segments. That may be necessary when working on a document that has been already segmented by the client. In other cases, systemically running QA when reopening an existing segment (which has been already translated and already QAed) can slow down WFC.

View

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Zoom the Ms-Word window
Use normal' view
Zoom the document window
Hide ruler
Wrap text to window (only if Use normal view' is checked)
Keep only Standard, Formatting, & Wordfast toolbars
Large Wordfast dialog boxes
Remember Wordfast setup tab position
Document's zoom factor=150

This dialog box is used to optimize Ms-Word's view and display parameters when translation begins, to ensure a comfortable visual environment. Do not underestimate this part. Visual strain coming from a mediocre work setup takes its toll on translators. Unfortunately, it can take years before one realizes the strain he/she has put on his/her eyes. The following parameters will setup your display and view environment every time you start a translation session. However, if the "When translation starts..." checkbox is unchecked, WFC will leave the view and display setup unchanged, except for hidden text, which needs to be visible.

Zoom the Ms-Word window
It is recommended to zoom (maximise, or enlarge) the Ms-Word window for resolutions up to 800x600 (i.e., VGA & SVGA). For higher resolutions (XGA, UXGA), you should decide what's best for your eyes, based on physical screen size (15, 16, 17 inches etc).

Zoom the document window
Recommended at all times with Word 97, but then again, you may need to override this function if you have to use multiple documents.

Text zoom=
WFC will propose a zoom factor of 120 (for resolutions up to SVGA) and 140 for higher resolutions, for optimum visibility. Of course, this is based on a normal text sized 10 to 12. You may have to adjust this parameter for other text sizes.

Do not show spaces
Since segmentation requires to show all hidden characters, I found that not displaying spaces is quite a relief, because those little dots are really tiring. But then again, if you have to pay special attention to unbreakable spaces, for instance, you may need to switch this off (i.e., show spaces).

Use normal view
If you have a high resolution (say SXGA, 1280x1024) and a 17" monitor, plus a fast machine, the Page view can be considered although it is not recommended. Page view forces constant repagination, offers a hectic scrolling from page to page and quickly exhausts your system's resources. It is better to occasionally use Page view for those rare documents where the page layout is of prime importance and falls in the translator's responsibility. In all other cases (and even with a fast machine and a big screen), normal view is still, by far, much more comfortable, especially when jumping from page to page, scrolling through long documents etc. Normal view offers a much smoother scrolling, and a jumpy scrolling really damages eyesight and causes migraines. However, turn this switch off (i.e., leave view mode unchanged) if page layout and design is a must.

Wrap text to window
This feature is essential (but available only in normal view) to avoid scrolling horizontally every time a line is wider than the screen.

Hide ruler
In most cases, the ruler is not essential, but takes up space, which is a problem on small screens. Override this if required.

Keep only Standard, Formatting & WFC toolbars
Use this function if you wish to automatically hide unwanted toolbars taking up space.

Remember that you can manually modify your display options during a translation session using Ms-Word's own Tools/Options/View (Or Preferences/View with most Macs).

Note that WFC forces the display of hidden text, because using WFC without hidden text visible can be dangerous, since delimiters would be invisible and easily deleted.

AS (AutoSuggest)

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Segments from Translation Memory
Segments from Machine Translation
Subsegments from Translation Memory
Subsegments from Machine Translation
Placeables =Case,CASE,123,{[()]},wo@#rd
Terminology from glossaries: trigger level =2
Terminology from Machine translation
Conversions US > Metric
Synonyms
Escape triggers ASOrder matches in AS:
Paste top-rated segment from RT,MT,BT,TM  

AutoSuggest (AS) suggests TM matches and placeables in a drop-down list. You can continue typing and ignore the suggestion, or press Enter to "grab" the suggestion. Suppose the source segment contains proper names like "Zbigniew Brzezinski" or "Grossbliederstroff": AS will suggest those names as soon as a capital A or G is typed. Speed and reliability are the two advantages of AS.

TM and MT propositions can be proposed, as well as "placeables". Placeables comprise known terminology ("known" terminology is highlighted in the source segment if a glossary is active and contains one or more source terms), and elements in the source segment that could be untranslatable.

Segments from the Translation Memory
This option will pop up a list of TM matches, 100% or fuzzy, as soon as a new segment opens. Other shortcuts and methods (Alt+left/right to cycle through the various segment propositions, if applicable, or Ctrl+Alt+M to display more detail from the TM) remain effective.

Segments from Machine Translation
If MT is set up in the Translation Memory / MT pane, machine-translated segments wil appear in the AS drop-down list.

Subsegments from Translation Memory
Subsegments from the TM will be proposed whenever possible.

Subsegments from Machine Translation
Subsegments from MT will be proposed whenever possible.

Placeables = Case,CASE,123,{[<word>]},wo@#rd
If WFC senses that some source terms are likely to be untranslatables, they will be suggested as soon as you begin to type them in the target segment. This setting fine-tunes how placeables are defined. The default setting is made of items separated with commas, which can be edited or remove, and have the following meanings:

Terminology from glossaries
If you begin to type two or more letters that match a source or target terminology that WFC has recognized in the source segment, a suggestion for AutoSuggest will pop up.

Synonyms
AS can list synonyms for words in the target segment. Synonyms are provided by Microsoft Word's thesaurus if it is correctly set up. The thesaurus is outside Wordfast's scope, and the hotline cannot help if it is not installed. Wordfast simply looks up the thesaurus and lists synonyms so they are at your fingertips. As you type a target segment, WFC routinely checks for new words, and builds up the AS lists.
Synonyms can be numerous, and AS can become overwhelming. If the feature feels like a counter-productive attention grabber, uncheck the AS synonyms feature, and check it under Terminology/Other. In that case, synonyms are less ubiquitous, but still present, either in the status bar, or in the Terminology, or QA, companions.

Esc triggers AS (not recommended)
This option disables AS in a segment until you press Escape. It is necessary with one specific version of a popular dictation software, now obsolete.

Paste top-rated match:
This option pre-fills the target segment with the highest-rated match, among all available sources: TM, BTM (BT), Remote TM (RT), and Machine Translation (MT). Note that if a remote TM is used, and the remote TM needs a delay to respond, this option will wait for the reply from the remote TM. Also note that a proposed translation will be pasted (prefilling the target segment) only if the target segment is empty.

Order matches in AS:
This option specifies the order for translation prropositions in the AS dropdown (it has no effcet on placeables, only on target propositions).

UI (User Interface)

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB

Icons and menus items Enable custom shortcuts
NextSegmentAlt+Down
CopySourceAlt+Insert
ExpandSegmentAlt+
ShrinkSegmentAlt+
EndTranslationAlt+End
--> ProvisionalSegmentF10
--> RestartSessionAlt+Home
--> PreviousrcSegmentAlt+Up

Show notifications in status bar


This section lets you choose which icons are displayed in the toolbar, and which shortcuts are used. Check or uncheck icons.

To change a shortcut, click on the relevant line, then press Enter. The only available key definitions are listed below. They can be associated with CTRL, ALT, SHIFT with a plus sign (+). No space must be used:

Up, Down, left, Right, PageUp, PageDown, Home, End, Esc, Enter, F1...F12, A...Z, NumericPlus, NumericMinus, BackSpace, Delete, Insert, Tabulator, SpaceBar.

If you run a default setup without custom shortcuts, Wordfast loads faster. In this case, leave the "Enable custom shortcuts" checkbox unchecked.

Show notifications in status bar
By default Wordfast displays useful information in gray text between source and target segments. You may prefer those notifications to appear in Word's status bar.

Note that notifications can be fine-tuned using Pandora's Box further settings (see below).

PB (Pandora's box)

Wordfast (c:\wordfast\wordfast.ini)
X
Translation Memory Terminology Tools Setup ?
General Segments QA View AS UI PB


Enable PB



Commands are case-sensitive. Commands that contain an underscore are disabled, like Need_ForSpeed.
Auto_Complete=NoHeaders
Allow_EmptyTarget
Break_DownTags
Capitalize_FirstTargetLetter
CleanUp_OnlyBookmarks
ConcordanceCloseAfterCopy
Concordance_Dialog=Always
Concordance_MaxHits=10
Concordance_NoHeaders
ConcordanceSearch=ExactExpression
Concordance_Search=ExactExpression
Concordance_Search=ExactWord
Concordance=Source_Only

WFC tries to cover the essential needs of everyday translation, but there are countless special situations that require specific features. Rather that multiplying endless setups with buttons and checkboxes, a raw but efficient "un-natural" interface is used to activate some rarely used features. Just enter one of the following commands in Pandora's box, to obtain a particular behaviour from WFC. Commands are separated with a paragraph mark (use Shift+Enter to enter paragraph marks).

Important: PB commands are case-sensitive: use them the way they are produced when clicking the "Commands" button, or the way they are written in this manual. Adding or removing the _ (underscore) character makes them inactive or active. The underscore character can be located anywhere within the command. Thus,

AllowEmptyTarget is active;
Allow_EmptyTarget is not active;
AllowEmpty_Target is not active;
_AllowEmptyTarget is active because the underscore is not within the command;
Allowemptytarget is ignored because its case is not correct.

Right-clicking the list of commands will toggle between the display of all commands, and only active commands.

The syntax of these commands is complex. Thus, rather than writing them into, and deleting them from, Pandora's box, you may consider leaving them in Pandora's box. To turn off a command, just insert an underscore in the command. In other words, SegmentAll is active, but Segment_All is not active. The underscore can be positioned anywhere inside the command.

AncillaryFiles=KeepWithParent
Ancillary_FilesFolder=
Ancillary files are non-critical files that are generated by WFC for every TM or glossary. Those are indexes, or backups. By default, these files are kept in the same folder where wordfast.dot resides (usually, Word's STARTUP folder). You can opt for having those files in the same folder as their parent files.
Ancillary_FilesFolder= will specifiy a custom folder for ancillary files.
AllowEmptyTargetAllows WFC to validate a segment with an empty target. Empty targets do not pose any particular problem, but in regular mode (especially for beginners), there's a warning that prevents the user from validating an empty segment.
AllowSpecialCharactersUse this command if you notice that typing special characters, such as diacritics with the help of the Ctrl or Alt keys, are not possible when WFC is in operation.
CapitalizeFirstTargetLetterThis command is useful when a dictation ("Voice Recognition", VR) program is used, and the VR program fails to capitalize the first letter of a dictated sentence. When the segment is committed, WFC will make sure the target segment's first letter is uppercase.
ConcordanceCloseAfterCopyCloses the concordance search window when you use the Alt+F12 shortcut (copy-paste into target segment).
ConcordanceMaxHits=XWhere X is a number. Limits the number of concordances found to X. The maximum value is 4096.
ConcordanceNoHeadersTurns off the display of TU creator, date and attributes when displaying concordances, so that more entries can be visible on one page.
ConcordanceSearch=X
where X can be All, Source or Target
During sessions, if you select a term in a source segment, WFC will execute a concordance search in the TMs source segments only. If you select a term in a target segment, WFC will execute a concordance search in the TMs target segments only.
ConcordanceSearch=All will force WFC to search all segments (source and target), regardless of where you selected a term.
ConcordanceSearch=Source will force WFC to search only source segments of the TM.
ConcordanceSearch=Target will force WFC to search only target segments of the TM.
If this command is not enabled, WFC searches concordances in target segments only.
CopySearchWordCopies into the clipboard the term that is selected for search when a Concordance or Reference search is done.
CopySourceWhenNoMatchIs equivalent to using the Copy source icon in WFC when no match is proposed by the translation memory.
Custom_DataEditor= CustomDataEditor=NotePad
CustomDataEditor=Word
CustomDataEditor=C:\...\app.exe
Defines the application that opens a TM or a glossary when you right-click the "Select TM" or "Select Glossary" button in the WFC user interface. "NotePad" is recommended. Excel is recommended to edit and save glossaries, but not for TMs.
FirstKeyControl
FirstKeyShift
With some versions of Ms-Word 2003, the very first character typed after a segment opens is "mute". Enable any of these commands (only one at a time) to cure the problem. They simulate the press of a "mute" key to work around the problem.
FirstKeyDNSSolves a problem caused by some versions of the DNS dictation software that makes the cursor jump a few lines up or down after a segment opens.
KeepTemplate=addin.dotWhen you expand the WFC template, WFC de-activates any template or add-in found in Tools/Templates & Add-Ins. Many templates have shortcuts or macros that conflict with WFC's. If you want to keep a template which can work together with WFC, enter its name. The example provided here would keep the template named "addin.dot" active together with WFC. To keep all templates, use KeepTemplate=All. This setting, however, may cause problems with templates that rely on shortcuts used by WFC.
KeepLineSpacingWhen translating a table, if the table row has a fixed line spacing (or "height"), it may be impossible to display an opened segment. Opened segments use 7 lines, and the cell may be too narrow, vertically speaking, to display teh segment. WFC dynamically sets line spacing to "Automatic" in tables, and restores it to its original value when the segment is closed. This feature disables this behaviour.
LeaveColoursAt clean-up time, if colours were specified in WFC/Setup/General, colours are reset by applying the "Auto" colour to the entire document. This option inhibits this general colour reset.
LinkSetupToDocument It is possible to link documents (but only documents of Ms-Word's native format DOC ) to a particular setup. If this is done, and a later translation session is opened with a different setup, WFC will issue a warning. This warning gives you the choice of using the new setup (the document's link will then be modified accordingly), or loading the original setup. The same warning will be issued at cleanup time, on some conditions. Use the WFC menu option "Unlink" on a document to unlink it, or "Relink" to re-link it. See the important note below.
LinkTMToDocumentIt is possible to link documents (but only documents of Ms-Word's native format DOC ) to a particular TM. If this is done, and a later session is opened with a different TM, WFC will issue a warning. This warning gives you the choice of using the new TM (the document's link will then be modified accordingly), or loading the original TM. Use the WFC menu option "Unlink" on a document to unlink it, or "Relink" to re-link it. The same warning will be issued at cleanup time, on some conditions.
! Note on the "Link" settings.The "Link" feature stamps documents with a marker that links them to a TM or Setup, when opening a translation session. Any WFC, with any setting (even if the two "Link" setting are not checked) will issue a warning if another translation session is started on the linked document with a different TM or setup. Cleanup, however, will issue a warning only if the WFC/Tools/Cleanup button is used and "Update TM" is required and the corresponding WFC "Link" setting is currently checked. In other words, a linked document will trigger a warning at all times when starting a translation session, regardless of the local and current WFC setup, but the same document will trigger a warning at cleanup time only if the local and current WFC setup's "Link" option is checked. The reason is that many translators translate, but send uncleaned documents to the client or agency, and the cleanup is performed there. This prevents cleanup on a different computer (like the client's or the agency's) from triggering the warning.
MacroEndSession=XXXWhere XXX is an Ms-Word macro name. The EndSession macro is executed when a translation session ends, with Alt+End, or when closing a segment in any other way.
MacroPreSegmentation=XXXWhere XXX is an Ms-Word macro name. The PreSegmentation macro is executed when a segment is opened, right before the segment is turned over to the translator for translation or edition. See Appendix III for more info on macros.
MacroPostSegmentation=XXXWhere XXX is an Ms-Word macro name. The PostSegmentation macro is executed when the translator "closes" a segment, immediately before closure. "Closing" a segment happens if you press Alt+Down on an opened segment, or Alt+Up, or Alt+End, or any other shortcut that closes the currently opened segment. That macro us typical meant to check for errors and warn the user. See Appendix III for more info on macros.
MacroMaiden=XXXWhere XXX is an Ms-Word macro name. The Maiden macro is executed only once, the very first time a WFC translation session is started on a document. If your macro ends with a Visual Basic "End" instruction, this will also halt the WFC translation session opening, and the document will remain "virgin". Running the WFC menu "Misc/Unlink" option renders the active document "virgin" again. See Appendix III for more info on macros.
MacroRetire=XXXWhere XXX is an Ms-Word macro name. The Retire macro is executed right before a clean-up is attempted on a WFC-translated document. See Appendix III for more info on macros.
MacroStartSession=XXXWhere XXX is an Ms-Word macro name. The StartSession macro is executed when a translation session begins.
MacroQualityCheck=XXXWhere XXX is an Ms-Word macro name. The QualityCheck macro is executed right before a MacroPostSegmentation, that is, when the translator closes an opened segment (by using Alt+Down, Alt+Up, Alt+End, or any other shortcut that closes the currently opened segment). See the note on QA macro interactive mode. See Appendix III for more info on macros.
NoPromptsInhibits prompts when:
Using "RestoreSegment".
Using "ForceWriteSegment".
Translating in "Page" or "Print" view (prompt to use Draft view).
Validate (commit) a segment where the target text is empty.
NoPromptToSaveIniInhibits prompts to save settings when closing the WFC setup window. All changes are saved automatically.
NoSendKeysWFC sends a dummy Control key after opening a segment on Ms-Word 2003 because of a VBA bug. Use this command to prevent this behaviour, if you have no problem opening segments with Word 2003. The most common symptom of the problem is that the first character you type remains blank - but only with an unpatched Word 2003.
OptionalTags
example:
OptionalTags=< >,<"
Enter a list of tags, separated with commas, after the equal sign.
These tags are ignored when WFC performs QA to verify that tags are identical in source and target. See the section on tagged documents for more information.
ProcessQuotes=147,148This command will force WFC to always use the required quotes when proposing a possible target segment, regardless of what sort of quotes are in the translation memory. Possible values are:
ProcessQuotes=171+160,160+187 will force French-style quotes (with the required unbreakable spaces).
Mac syntax: ProcessQuotes=199+202,202+200
ProcessQuotes=147,148 will force curly double quotes (up)
Mac syntax: ProcessQuotes=210,211
ProcessQuotes=145,146 will force curly single quotes
Mac syntax: ProcessQuotes=212,213
ProcessQuotes=132,147 will force curly double quotes of another sort (up/down). PC only.
Mac: no equivalent, but note that 227 is for closing curly double quotes.

ProcessQuotes=34,34 will force straight quotes as in "example"
ProcessQuotes=Source will replicate the source segment's quote style

Note: in case isolated segments should not receive the quotes you specified, but re-use the source segment's quotes (this may be the case for technical parameters), use the Ctrl+Alt+U shortcut to copy source quotes to the target segment.
ProcessApostrophes=39Similar to ProcessQuotes. This command will force a certain style of apostrophes, regardless of what the TM has. Possible values are:
ProcessApostrophe=39 will force straight apostrophes as in l'exemple
ProcessApostrophe=146 will force curly apostrophes as in l'exemple
Mac syntax: ProcessApostrophe=213
ProcessApostrophe=Source will replicate the source segment's apostrophe style
Ctrl+Alt+U will replicate the source segment's apostrophe style.
ProcessDashes=45Similar to ProcessQuotes. This command will force a certain style of dashes, regardless of what the TM has. Possible values are: ProcessDashes=45 will force simple dashes (minus sign) as in attaché-case
ProcessDashes=150 will force the endash (short) as in attach–case
Mac syntax: ProcessDashes=208
ProcessDashes=151 will force the emdash (long) as in attach—case
Mac syntax: ProcessDashes=209
ProcessDashes=Source will replicate the source segment's dash style.
Ctrl+Alt+U will replicate the source segment's dash style.
Propagate1When using CopySource, all recognized terminology (if terminology recognition is turned on) in the target segment is replaced with its translation. This command is also active with the "Translate" tool, but only for unknown segments which are replaced with the source segment using the CopySource function. This command uses glossary #1. This command is often associated with CopySourceWhenNoMatch. When a propagate command is used, the Alt+Insert (Alt+S on a Mac) shortcut has a toggling effect between A. CopySource and propagate. B. Just CopySource without propagation. Important note: if propagation must be active during the pretranslation of documents (using WFC's Translate tool), see the command "ToolsTranslateWithTR" further below.
Propagate2 Propagate3Same as above, but using glossary #2, or #3. The three commands can be used together.
PropagateAndHighlightWhen propagation is done, propagated terms in the target segment are highlighted.
PropagateCase=XWhere X can be 0, 1, 2, 3.
0 is the default setting: the glossary's case is propagated as it is.
1 forces a propagation of the target term in all lower-case.
2 forces a propagation of the target term in all upper-case
3 tries to re-use the source term's case.
PropagateInReversePropagates terms in reverse order (useful by language pairs that have a reverse syntax order), when PropagateOnlyKnown (see below) is activated.
PropagateMethod=[],many,addDetermines the method for the propagation of recognized terms. The first two characters (here, [ and ]) specify the characters that are added around propagated terms. The many switch determines whether all possible glossary entries are propagated, in case the glossary has multiple entries for the same source term (i.e., one given source term is repeated, with different target terms). The add switch determines whether propagated terms are added to the target segment, or whether they replace the target term, which is the regular method.
PropagateOnlyKnownNormally, propagation will be done on a copy of the source segment. In contrast, this command will insert all known terminology (separated with a space) in the empty target segment. When this command is active, the CopySource (Alt+Ins on a PC, Alt+S on a Mac) shortcut toggle effect will have three states: A. CopySource and propagate in the desired order (see above); B. CopySource and propagate in the opposite order; C. Just CopySource with no propagation.
PropagatePlusSpace
PropagatePlusSpaces
When propagation is done, this command adds a space after the propagated term, if no space is found after the term.
In the plural form, PropagatePlusSpaces also adds a space before a propagated term, if no space is found there.
PropagateWholeIf a recognised single term ends with a wildcard, the whole word is replaced, rather than just its root. Thus, if the glossary has affect* = affecter and the source text has affection, the final result will be affecter rather than affection.
ReportFolder="C:\MyFolder"This commands tells WFC in which folder the various reports (Cleanup, Analyse, Translate etc) should be saved. If CleanUp, Analyse, Translate fail, make sure this setting points to a valid folder.
ReportManyNormally, all reports (for the Cleanup, Analyse, Translate functions) have the same name, and new reports overwrite previous ones. This command instructs WFC to add a time stamp in the report's name, so that they all have unique names.
QuickAccess_Toolbar=HideWith Word for Windows 2007 and above: hides the Quick Access (QAT) toolbar.
ReportWithTabsThis command instructs WFC to separate elements of the report with tabs rather than spaces, so that they can be copied into an Excel worksheet.
SegmentAllNormally, WFC does not segment isolated numbers, or other pieces of text that do not contain any alphabetical letter. This command forces WFC to segment everything.
Segment_Style=Bright
Segment_Style=Transparent
Segment_Style=VGA
This gives the opened segment different styles or shades. If you are not happy with the way segments appear on your monitor, or if your monitor's colours are washed away, you may find your happiness here.
SetReference=ParagraphWhen a search for Reference is done, results are limited to the sentence where the searched expression is found. This command displays the entire paragraph.
ShowMessages=
Welcome
Session
In_Segment
Tool_bar
%PR
%PR_W
INI, LANG, TM, BTM, MT, WFA, WFS, TU, GLO, TERM
Welcome: shows a brief message in Ms-Word's status bar when the WF toolbar is expanded, reminding of the current INI and setup. InSegment: shows session messages between source and target segment (recommended).
Session: shows messages during a translation session.
ToolBar: shows session messages in a toolbar. Only available with Word versions 97 to 2003.
%PR: shows an estimate of the translation progress in the current document. Accuracy can only be possible during a top-down (regular) translation session. Ctrl+F5 refreshes all counters and gives a better translation progress estimate. %PRW: the progress estimate includes a wordcount.
INI, LANG, TM, BTM, MT, WFA, WFS, TU, GLO, TERM: includes a reminder of those values in the messages.
If this command is not used, the default value is
ShowMessages= Welcome,InSegment,Session,TU,TERM
ShowMemoryAtStartThis command enables TM display (for exact or fuzzy matches) from the start of the session. It's equivalent to clicking the "Memory" icon right after starting a translation session.
ShowMemoryIf<100Will display the contents of the TM above the currently opened segment if the match rate (the match precentage) verifies the value range (here it is "<100"). You could use ">80" or "<99" as well: the operator can be < or > and the value can be any number.
ShrinkInternalTagsShrinks internal tags to a short, numbered tag system to shorten segments with long tags and make them more legible. This is done as a visual aid; actual tags are actually preserved.
SkipSegment>99
SkipSegment<80
When manually translating an already segmented, bilingual document (using Alt+Down), all segments that have a match rate higher than 99 (or less than 80 in the example) will be skipped. Other values can be specified, for example, SkipSegment>95.
TMX_TW4WINProduces a TMX export that's compatible with Trados TWB version 2.0.
ToolsTranslateSkipUnknownSkips (does not segment) unknown segments when WFC's Tools/Translate tool is being used.
UpdateWithQuickCleanBefore a Quick-clean operation, you will be asked if you just want to update your TM, without cleaning-up. If you say no, you can go on and proceed with Quick-clean anyway, so the regular use of Quick-clean is not affected.
WaitForMT=XWhere X is, for example, 5. Instructs WFC to pause X seconds while a segment is being machine-translated.
WfToolbarPosition=A,B,C
WfToolbarPosition=1,0,0
(only active with versions of Word up to Word 2003).
This command will make WFC position its toolbar as follows, replacing A,B,C with numbers:
A is for the position style where 1 is horizontal top (regular), 0 is vertical left, 2 is vertical right, 3 is horizontal bottom, 4 is floating;
B is the vertical position, in pixels, from the top of the Word window:
C is the horizontal position, in pixels, from the left of the Word window.
The example (WfToolbarPosition=1,0,0) is for a "regular" position, docked top left.
Note that Office X/Mac and later tend to force a vertical position for custom and add-on toolbars.

Word/character count & billing

WFC's way of counting words is slightly different from Ms-Word's statistics (Tools/Wordcount or Tools/Statistics). For example, in the following text:

L'argent de Louis-Philippe

Ms-Word will find 3 words, while WFC will find 5 words (a very similar word count is upheld by most translation tools). On average, WFC will find from 5 to 10% more words than Ms-Word, depending on the language. The difference is more striking with French, more modest with other languages. This way of counting is in keeping with most translation syndicates and unions in most countries using alphabetic languages.

Discuss the word count issue with your client before starting working on a project.

On tagged documents, tags are counted as one word (regardless of their number of characters or words) and their number is also reported in the analysis final report. A tag is defined as any contiguous series of characters (spaces included) that have the tw4winInternal style.
Note that (as opposed to word count), tags are not included in the character count, because a tag is counted as one word; tags are included in the word count.

The WFC word/character count, as with all CAT tools, is based on what the tool considers to be translatable text. This can depend on the way you set up your tool. For example, the use of the "SegmentAll" command will force WFC to consider any text as translatable, including isolated fields, figures, etc. which would otherwise be left out of the translation process.

The WFC word/character count includes all headers and footers, footnotes, but not fields. Pay attention to word count when auditing a project, or producing an estimate. Ask yourself the question (if applicable) of whether the document(s) contains bookmarks, and if it does, what the author/client wants to do of them; whether graphics or textboxes should be translated, whether headers and footers should be translated, whether the word count is based on source, or target (translated), text, how to count tags, if any (per piece? per word? per character? at what rate?), etc.

Languages that require Unicode

Latin-1 is a character set used for most West-European languages, including Scandinavian languages. It includes all English letters, plus a large number of accented letters. East-European languages like Polish, Czech, Hungarian, etc. use another character set known as CE or Latin-2 and do not fall in the Latin-1 group.
If you do not use Unicode, and your system is Windows NT4, Windows95, or Windows98, then the display of characters in the glossaries and in message boxes may perhaps not be possible, which is a minor annoyance.
I recommend using Windows 2000 (or a higher OS) and Ms-Word 2000 (or a higher version), or Word 2011 on a Mac, with Unicode translation memories, although older platforms may behave well.

CJK (Chinese, Japanese, Korean)

The following discussion concerns the WFC-generated data (like translation memories and glossaries). It does not concern documents. Ms-Word documents always support Unicode, and do not lose encoding. If there are issues, those are font (rendering) issues, or material brought into Ms-Word by copy-pasting alien material.

Unicode translation memories and glossaries should be used for translation where one of the two languages (source or target) is CJK. All versions of WFC after the year 2007 use only Unicode TMs and glossaries, so that should not be a worry.

Use path names and file names with latin, non-accented (English) letters only for TMs, glossaries, INI files, and the Ms-Word Startup path (as displayed in Word/Options/Default file paths/Startup). Try to keep TM and glossary file names under 32 letters, using English non-accented letters, preferably without spaces. WFC may not support folder and file names with unicode characters. If WFC malfunctions, this could be due to the Ms-Word Startup path containing unicode characters. If this is the case, create a folder, for example C:\Startup, or MyMac:Startup and copy Wordfast.dot there. Start Ms-Word, use the Tools/Options (or Office button/Options or Preferences) > Default folders dialog box to change Ms-Word's Startup folder to the one you just created. Close and restart Ms-Word.

If given the choice of Unicode flavour when you save a TM or glossary, select the simple "Unicode" (this can be just Unicode, or UTF-16) setting, not a language-specific encoding.

If you use Ms-Word XP (Ms-Word 2002), note that a notorious Ms-Word 2002 glitch prevents it from saving documents as Unicode (unless you specifically added that feature at installation time). In this case, export the TM to unicode. To do so, start the TM/Glossary editor, click "Tools", and run the "Rewrite as Unicode" special filter. Another workaround is to open an existing Unicode document, delete all its contents, paste your data into it, save it then rename it directly on disk.

In WFC's main window, next to the translation memory path and name, you should see the (CJK) mention. This mention appears if the source language code begins with either ZH-, JA-, or KO-. This mention is essential for WFC to switch to a mode compatible with Chinese, Japanese, or Korean.

Notes:
For Japanese, Chinese, and Korean, make sure the full-width (double-width) punctuation (like ?!?) are visible in the WFC/Setup/General "End-of-segment punctuation"setting. They should be automatically added there when you create a translation memory with JA, KO, or ZH in the source language (for example, ja-JP, zh-CN, etc.). If you do not see the Japanese or Chinese full stop, question mark, exclamation mark, select them in a document. Copy them (Ctrl+C). Open WFC. In the WFC/Setup/General "End-of-segment punctuation"setting, press Enter to edit the value, then paste your punctuation before the existing punctuations there (I advise not to delete the existing, latin punctuation).
For Japanese and Chinese, check at least the "An ESP without a trailing space ends a segment" rule in WFC/Setup/Seg, so that end-of-sentence punctuations that are not followed by a space may still be recognised as ending a sentence. This too is normally done automatically by WFC when the TM is CJK.
To have all target segments receive a specific font (a font that can display CJK characters), use the WFC/Setup/General "Target font" setting to specify the target font. But this is not necessary if your platform automatically adapts fonts to languages.
To have both Concordance search and glossaries displayed using a specific font, go to WFC/Setup/Pandor'as box. Add the parameter TermFont="MyFont" with the required font instead of MyFont.

If you open a glossary or a translation memory with Ms-Word and cannot read the text: select all text then apply a font that can display your language (a specific font, or a generic Unicode font).
If you still cannot see text properly displayed, and all you see are question marks (????) then perhaps, at some stage, the file was saved as (rewritten) using a simple text or text-only or 8-bit ANSI format rather than Unicode. There is no way back. Make sure Unicode files remain Unicode at all times. This concerns the Text format used for translation memories and glossaries, not Ms-Word documents. Unicode is not relevant with the DOC file format.

If an Ms-Word document does not display your language properly, it's a font problem. Target segments must receive the proper font; see above for automatically applying a certain font to target segments.

Special care

This section deals with expert uses of WFC for tasks that require special attention. WFC does not guarantee operation because of its very nature as a mere Ms-Word complement. WFC is an add-on to a complex program (Ms-Word) that handles documents which, in the course of their lives, have been handled very differently by different people using different versions of Ms-Word (on PCs or Macs), through different formats (DOC, DOCX, RTF, HTML, etc.) and sometimes ill-conceived (with textboxes used instead of tables).

The special care section deals with tasks that are possible with WFC, but which need special attention in their execution, as well as a good knowledge of Ms-Word. Beginners should train themselves, or seek professional training, before engaging in projects outlined in the Special Care section. It is out of question for any translator to accept a "Special care" job wihout a prior understanding of the risks involved.

Tagged files

WFC is designed to translate the Word native "DOC" and "DOCX" formats. Tagged files are better translated with programs that were designed from the ground up to handle tagged files, like WF PRO.

WFC retains the capability to translate "tagged" files, but it should be noted that an Ms-Word tool cannot efficiently handle or protect tags. Translating tagged files with Ms-Word requires expertise, should things go wrong at any stage.

Some translation agencies, which are equipped with tagging software to prepare documents, may ask free-lance translators to work on tagged files. Agencies and free-lance translators should know that WFC is compatible with the most current tag formats, such as Trados and RWS Rainbow. Here is some advice for translating tagged documents. Please pay attention to the following advice, because tagged files that are not properly handled can cause serious problems.

Agencies that entrust tagged files to a translator for the first time should review the first translated file immediately after the translator has completed it, to make sure tags have been properly handled. If necessary, adjustments should be made before going any further into the project.

What follows is a crash course on handling tagged files with WFC and Ms-Word.

Internal tags
The red tags (usually with the tw4winInternal style) are internal and are mostly found within the text to be translated, in the translation.

Example:The <B>final</B> document.
translates as:Le document <B>final</B>.

In this example, <B> and </B> are tags that command the bold type in HTML. The translator has positioned the red tags at the right position in the translated sentence. The translated text does not have a tw4winInternal (neither a tw4winExternal) style (with a "Normal" or "Translatable" style). Only tags have a tag (tw4winInternal) style, red or grey. Styles are important, because tagging/untagging software relies on style, not font colour, to differentiate tags from translated text.

! Be careful when typing. If you type regular text (for example, right after right after a <tag>), and notice that the text appears red, stop! Only tags must have the tw4winInternal style. Regular text should have another style, usually named "Normal", or "Translatable", whatever that is. In fact, the style of the regular text does not matter, as long as it is neither tw4winInternal nor tw4winExternal.
Note that to revert text from a tw4winInternal style (or any other style) to the default style, select the text and press Ctrl+Spacebar. A slower method involves the Style listbox, the mouse, etc. If you are not familiar with Ctrl+Spacebar, learn it right now: select some styled text, apply it and see how it works.

! Normally, internal tags must not be modified, edited or translated. Some tags can be added or omitted if the translation requires it. Otherwise, the golden rule is that all internal tags (usually enclosed between < and >) present in the source segment must be duplicated in the target segment, and positioned correctly.

To duplicate these internal tags, WFC provides a set of shortcuts. Ctrl+Alt+left/right will select the next/previous internal tag (in the source segment); Ctrl+Alt+down will duplicate ("bring down") the selected tag at the insertion point, in the target segment. You should get used to these shortcuts. For better speed, you can type < or / and AutoSuggest will pop up a list of available tags for quicker placement.

If you copy the source text into the target segment and translate by overwriting it, or if you edit an existing target segment, make sure the translated text does not have a tag (red or grey) style. If the cursor is immediately after a red (or grey) tag, whatever you type will also be red (or grey), and this causes problems later on. To avoid this, remember that if your cursor is immediately after a red tag, pressing Ctrl+Spacebar will restore the normal style at that point, and the text you type will not have a tag style. Ctrl+Spacebar is an Ms-Word shortcut.

In a regular tagged style, the only two important styles are tw4winInternal and tw4winExternal. Text outside tags can have any other style (styles named Normal, Translatable, etc.) or font attributes (bold, blue, whatever).

Here are examples of correct and incorrect translation units:

The <B>final</B> document is here.
  
Le document <B>final est ici.
This TU is correct.
 
The <B>final</B> document is here.
  
Le document <B>final</B> est ici.
!The target word "final" has an internal tag style
 
The <B>final</B> document is here.
  
Le document <B>final</B> est ici.
!The target segment's first tag has lost its internal (red) tag style.
 
The <B>final</B> document is here.
  
Le document <B> final est ici.
!The target segment's second tag is missing (it should be </B>).
 

WFC has a Quality Assurance option called "Identical tags in source/target segments". I recommend turning this QA option on. To avoid having false alerts for tags that are actually optional, use Pandora Box' "OptionalTags" command in WFC/Setup/PB.

Most optional tags are tagged items (like the unbreakable space, quotes, ampersand etc) that look like &amp; or <:hs> or &nbsp; etc. You may have them in the source segment but not in the target segment, or the reverse, according to the translation's needs. Thus, the following segment:

The R&amp;D department is <B>ready</B>.
Le D�partement "Recherche et D�veloppement" est <B>pr�t</B>.

is valid, even if there are three internal tags in the source segment and two in the target. The source segment's ampersand has not been re-used. There may be other exceptions where even non-optional tags must be added or omitted.

Long tags
WFC considers any contiguous text with an Internal style as a one tag. So for example

<p align="left" font="Times New Roman" size="12"><strong><table align="center">

is considered one tag.

If this contiguous stretch of text actually contains more than one tag, and if these tags have to be handled separately, use the Ctrl+Alt+Up shortcut to make WFC treat this tag as separate placeables. The pairs of characters < and > as well as & and ; will be considered as tag beginning and tag ending.

External tags
External tags (tw4winExternal style) are kept out of the translation. Like internal tags, they must not be edited, deleted, translated etc.

! In case of doubt, stop and ask the client or the agency. Do not proceed if you are not sure you handle tags correctly. If you start working on a project with tags for the first time, submit your first translated file for review and approval before going any further.

PDF

Important note: Wordfast Anywhere (http://anywhere.wordfast.net, or https://www.FreeTM.com) can convert a PDF document into a Word document, for free as of Spring 2011. This even concerns "dead PDFs", i.e., PDF documents that contain screenshots of text, or scanned text. Wordfast, as a brand, was the first translation tool maker to offer really free (no usage restriction, no advertizing, no "upgrade to premium") PDF-to-RTF conversions.

The PDF (Portable Document File) format was designed at an age when fonts were scarce and expensive. Many systems lacked fonts, or were equipped with very different fonts. The PDF format contained the fonts it used, making it very portable.

Unfortunately for us translators, the success of PDF is found in another feature: the difficulty (or the near impossibility for most people) to alter PDF content. As a consequence, one cannot directly edit, therefore, "translate from within" a PDF file: text has to be extracted from it. To make things even more complicated, many PDF documents (so-called "Dead PDFs") either lock their content, or worse, only contain graphics (screenshots) of text, not actual, selectable text.

The bottom line is that no tool will let you translate a PDF "from within", and deliver a visually correct translated PDF at the push of a button. PDFs must be converted to another format, and even that process is hazardous.

WFC can convert most "live" PDF documents into Word documents, in the Windows environment, provided:

If the PDF document is rather short, has a simple layout (no columns, no graphics), and is made of actual text, you can try to select it all (Ctrl+A), or page by page, then copy-paste it into a blank Word document. One problem is that every line ends with a paragraph mark (carriage return) as if PDF was created by typewriter nostalgics. Press Alt+F8 and run ("execute") a Wordfast macro called WfTextToDoc, which will reconstruct most paragraphs. You may, as an option, make all fonts and layout uniform by selecting the entire document (Ctrl+A on most locales), then Ctrl+Spacebar (reset all font attributes), then Ctrl+Q (reset all paragraph formats).

Footnotes

When a source segment contains a footnote reference (a number that looks like this: 1 and which, if double-clicked, opens the corresponding footnote), start translating the target segment as usual. At the point where the footnote reference should appear in the translated text, use Ctrl+Alt+left/right to select the footnote reference (it should be boxed in red), then transfer it into the target segment using Ctrl+Alt+Down. If these shortcuts are not available, you can use the corresponding icons (Next/Previous/Copy Placeable) in the WFC toolbar.

You can also manually select the footnote reference, cut it (not copy it) and paste it into the target segment. The important point is to actually cut (not copy) then paste the footnote reference (move the footnote reference), otherwise you would duplicate notes.

When the document's translation is over, double-click any footnote reference to open the footnote pane (the current window will split and the bottom half will show footnotes) to translate the actual footnotes. Simply put your cursor in a footnote and start translating as usual with WFC. You can translate foonotes immediately after a segment, by closing the segment then opening the footnote pane and translating the footnote. But I recommend translating all footnotes in a separate translation session when the document's translation is over.

After you transfer a footnote reference, WFC will replace the source segment's original footnote reference with a "dummy" footnote reference number, so the revisor can know where the original footnote reference position was.

Note that when there are multiple footnotes references in the same segment, they will appear wrongly numbered after you transfer the first footnote reference. The correct numbering will be restored when you transfer the segment's last footnote reference.

In case of mistake, use Ms-Word's undo function immediately after pasting a footnote..

Fields and objects

An Ms-Word document can contain fields or objects like hypertext links, buttons, graphics etc. Normally, fields (to the exception of hyperlinks) should not be translated, unless specifically required by your client, like index fields, for example. Fields should be copy-pasted from the source segment into the target segment. Note that the F9 function key can toggle the two views of fields: either the result of the field (a field is a programmatic instruction processed by Ms-Word, usually resulting in some displayed text - the result), or field codes, which look like { DATECREATION \* FUSIONFORMAT }.

When fields are present in the source text and no proposition comes from the TM, you may consider using WFC's Copy source icon to copy the source segment into the target segment, and translate by overwriting it, leaving fields or objects unchanged. Otherwise, individual fields and objects should be carefully copy-pasted into the target segment's translation, at the appropriate location.

Translatable fields

Read the general introduction to fields (above), if this is not yet done.

Fields where the result (not the code) must be translated.
Hyperlinks are a good example. These fields should be manually copied from source to target, then manually translated - toggle the field's view with Alt+F9 as necessary, so you can edit the translatable element (the result). Another approach is to right-click the hyperlink, then select "Edit" and translate the field's displayed text.

With Ms-Word 2000 or higher, right-click the field, click "Hyperlink", then "Edit hyperlink". The translatable item is at the very top of the "Edit hyperlink" dialog box.

Fields where part of the code must be translated.
The code for most fields cannot, and should not, be translated. There are a few exceptions to this rule, like index fields ("EX", "XE"). Such fields have a translatable item, contained between quotes as in the following example:

{XE "Translatable text:Page 4 Figure 5" \b \r }

Make sure Ms-Word's View options (Tools/Options/View or the Alt+F9 shortcut) are set to display field codes and hidden text.

When you open a segment with translatable fields (and the TM does not bring any match), you can use the Previous/Next Placeable utility (Ctrl+Alt+Left/Right shortcuts or the icons: to select the field in the source segment, then copy it down (Ctrl+Alt+Down ) at the proper position in the target segment. At that moment, WFC will display a text input dialog box containing the translatable part of the field and will wait for the translation (if a match is found in the TM, it will be proposed).

Another way is to use the CopySource icon or shortcut (Alt+Insert). When WFC copies a source segment with translatable fields, it will take you to each translatable field and prompt you for translation.

It is also possible to directly edit the editable part of the field in the document, if the field codes are made visible (Alt+F9). This is recommended if the above method fails for some reason.

Bookmarks

See the glossary of terms if you are not sure what a bookmark is.

Handling bookmarks, or not handling bookmarks, is a question to be discussed with the client. In many projects, the author or the client may not need bookmarks to be positioned in the translated text. This is simply due to the fact that in many cases, bookmarks are part of a complex, carefully engineered scenario, and the document's owner may rather wish an engineer, or a technician, to re-position bookmarks on the translated document, then test the entire document again. In this situation (the translator not being required to position bookmarks in the translated document), simply click "No" if WFC prompts you to have bookmarks prepared for translation.

Your client should inform you of the presence of bookmarks and give you instructions (transfer them or ignore them), since it is the client, or the author, who has introduced the bookmarks in the first place. However, your client may not be the author of the document(s), and the client may not even know what a bookmark is. In this last case, use tact and wisdom to make sure what should be done. The bottom line is: do not transfer bookmarks - a complex task at times - unless your client asks you to do so; but if you have to handle bookmarks, carefully weigh and estimate the extra workload. A bookmark, at the very least, should be billed as ten words, although it usually takes longer to correctly position the two ends of a bookmark than to translate two words.

Normally, bookmarks found in the source text should be transferred into the target text, over the corresponding span of translated text.

One important point is, since two bookmarks cannot have the same name in the same document, bookmarks must be transferred (moved), not copied, into the target text. In other words, you cannot duplicate or copy bookmarks as you would, for example, duplicate or copy fields.

Before starting a translation session over a document that contains bookmarks, WFC will warn of the presence of bookmarks and propose to mark them using conspicuous red markers positioned at the beginning and end of the bookmark, like this: [ and ]. If a bookmark has a null length, you would see []. Answer "Yes" to have bookmarks thus marked.

WFC will prompt you only once (per document) for marking bookmarks. If you answer "No", then WFC will not prompt you anymore for marking bookmarks on the current dcoument unless you open a translation session with the cursor at the very top of the document. If you answered "No" by mistake, or if you want to mark bookmarks at a later stage, use the WFC menu, select the Miscellaneous submenu and run "Unlink". Once the document has been "unlinked", WFC will prompt you again for marking bookmarks if you start a translation session.

During translation, if a source segment contains red markers, all you need to do is use the Next/Previous/Copy Placeable icons or shortcuts (Ctrl+Alt+Left/right/down) to select or box the red bookmark markers (they always come in pairs, opening and closing), then transfer the red marker(s) at the appropriate location in the target segment using Ctrl+Alt+down .

When cleaning up a document, WFC will remove the source segments as usual then replace the red markers in the target segments with the appropriate bookmarks.

WFC's Quick-clean function will propose an option for processing (restoring) bookmarks without cleaning up the document. This is useful for translators who are required by the client to send back "uncleaned" or "bilingual" documents (for example, because the client wants to clean up the documents with a different tool, not with WFC). In this case, the document is not cleaned up, but all bookmark markers are removed, bookmarks are correctly assigned to the target text.

Pay attention to the bookmark question before beginning a project, because handling bookmarks takes time; if the problem is overlooked, reconstructing bookmarks manually on a translated document can take a long time.

The WFC Translate tool's default behaviour is to mark bookmarks. If you want to prevent this, add "TranslateIgnoreBookmarks" in Pandora's box.

Bookmarks can be found in many different types of documents, and they are put to many different uses. Documents that contain hyperlinks, indexes, or Tables of Contents usually make considerable use of bookmarks.

Dictionary

(PC only) WFC can be linked to virtually any external dictionary application, such as the Collins� On-line, Harrap's� Shorter, Merriam Webster's�, Microsoft Encarta�, any web-based dictionary or database, Trados Multiterm� etc, using the Select dictionary button of the Terminology/Reference tab.

The access keystroke (Keys button) defines the keystrokes used for accessing an external dictionary, where some fields are replaced with values as in the following table:

To set up the "Keys" parameter, start your dictionary application, then note the sequence of keystrokes necessary to perform a word search. Once this is done, click the Keys button and enter the caption of the dictionary application window, followed by a semi-colon, followed by the keys you noted. For example,

Harraps;{pause}{F3}{Escape}%e{SearchWord}{Enter}

will instruct WFC to look for an application whose window name begins with Harraps, activate it, pause for 200 milliseconds, then type an F3 key, followed by an Escape Key, then Alt+E, then the searched-for word, then an Enter key.
All typable keys are simply entered as they are, in lowercase. Function keys and other special keys are entered as follows:

Once the dictionary has been setup, close WFC. Position the cursor on a word, or select an expression, and click the Dictionary icon (or press Ctrl+Alt+D. For the dictionary #2, use the Ctrl+Alt+F shortcut). WFC will launch the dictionary application (or activate the relevant window if the application is already running) and execute the sequence of keystrokes you defined.

Concordance search

The search for concordance will be done first in the background translation memory (if applicable), then in the regular translation memory. The purpose of Concordance search is to find Translation Units (TUs) that contain a given word or a set of words.

The Ctrl+Alt+C shortcut or the Concordance icon launches the search. The search will bring results on words that begin like the searched-for item, case-insensitive. Searching for cat will bring TUs that contain cat, or catering or caterpillar, etc, but not bobcat or supercat.

Searching for *cat will bring TUs that contain words like bobcat, or supercat, etc.

The AND operator can be used. Searching for cat+dog will bring TUs where the two words cat AND dog are found. If words are simply separated with spaces, the OR operator is assumed, so searching for cat dog will bring TUs where either cat OR dog are found. To search for an exact phrase, have it contained within straight quotes, so searching for "The cat chases the dog" will bring results where the phrase "The cat chases the dog" is literally found, regardless of case.

Note that to open the dialog box that lets you specify such extended search options, you must start concordance search when no selection is made; if a selection is made (for example, one word is selected in the source segment), then WFC assumes that the selected word has to be searched and will directly search for it, without offering the extended search dialog box. This allows fast searches with minimal clicks or shortcuts.

The same rules apply for Reference searches as well.

If you check the "Search concordances in all sibling translation memories" option in WFC/Terminology/Other, the concordance search will be extended to other TMs present in the same folder as the currently active TM.
It is possible to cancel a Concordance search with the Escape key, or with the same shortcut that started the search (i.e., Ctrl+Alt+C).

Conversions

Disclaimer: this feature converts from US Customary Units to Metric, and back. For other measures (such as British Units, or the older Imperial Units), the set of signatures must be modified. The default set of conversion signatures is in an editable resource text file named either wordfast.en.txt, or wordfast.local.txt, located in the same folder as wordfast.dot. The list of conversions is located at the end of the file, for major languages.

Wordfast Classic's AutoSuggest feature is equipped with an automatic conversion utility as of version 6.28, January 2016.

The mission statement for the WFC converter is:

Typical use case

While translating a segment that contains a recognized unit of measurement, WFC should pop up an AutoSuggest box with a set of useful propositions, as follows:

 A 300-pound gorilla escaped the zoo.
 
 Un gorille de 3 |

 
 PL 300-pound -> 135,9 kilogrammes (exact)
 PL 300-pound -> 125 kilogrammes (�5%)
 PL 300-pound -> 150 kilogrammes (�10%)
 PL 300
 

In the above EN to FR example, typing the number 3 pops up a suggestion for a converted measure. In a non-scientific translation, rounded measures like 125 or 150 can be preferred to the surgically exact 135,9 kilogrammes.

WFC's converter should spot and convert most abbreviated/full measurements, singular or plural, between US Customary Units and Metric. The source measurement should follow a number, only separated from the number with either a space, or a non-breaking space, or a minus sign (dash, or hyphen). In other words, with the example above, the following formats are recognized:

300 pound, 300 pounds, 300-pound, 300 lb, 300 lb., 300-lb., 300-lb, 300 lbs...

However, the following forms may not be recognized:

300 US pounds, 300-plus pounds, 300 more pounds...

because there is an unrecognized word between the number and the unit.

Financial formats

WFC attempts to convert, then suggest, financial formats following the most likely conversion rule.

Note that the Setup > General dialog box in WFC lets you enter the values for the target language thousand and decimal separators, as well as abbreviated dates. Those are ,.- (comma, dot, dash) for EN-US as target language, and will often be .,/ for European languages, with possible variations, such as French, which uses non-breaking space, comma, slash, noted as ~,/ (the tilde character ~ means the non-breaking space).

 Today's lottery is worth $400,000.00!
 
 La loterie d'aujourd'hui vaut 4 |

 
PL 400 000,00
PL 400,000.00
 

Dates

Due to the great variety of date formats, WFC only recognizes major formats, such as a flat, regular date like 12 January 2016 (proposed as, for example, 12 janvier 2016 in French). Abbreviated date formats that are complete (year, month, day) for unequivocal recognition are also recognized:

 The meeting is scheduled for 2016/12/31.
 
 La r�union est pr�vue pour le 2 |

 
PL 2016/12/31
PL 31-12-2016
PL 31/12/2016
 

Note that WFC proposes three formats here: the original one, and two that were reversed to fit non-English formats, using the two major separators.
The set of signatures is automatically generated by WFC based on your language setup. This set of signatures is provisionally generated for a few languages such as CS, DA, DE, EN, ES, FI, FR, IT, JA, HU, NL, NO, PL, PT, RU, SV, ZH. The set of signatures assumes that EN-US is either the source or target language. Other languages may be submitted by the community and will be available by download.

Customization of conversions

The set of signatures is in the WFC7.en.ui file and can be edited as a text file. The file is located in the same folder as wordfast.dot, and is named wordfast.en.txt, or wordfast.local.txt. For a language pair like EN to FR (English to French) , conversions are located at the end of the file, listed for en-xx and fr-xxx as follows:

en-xx-cnv="=*2.54,acre=*0.4046,acre=*4046,acres=*0.4046,acres=*4046,cu. feet=*0.283,cu. foot=*0.283,cu. ft.=*0.283,cu. in.=*16.38,cu. mile=*4.1618,cu. miles=*4.1618,cu. yards=*0.7645,cubic feet=*0.283,cubic foot=*0.283, (...)

fr-xx-cnv=cm,m�tre carr�,hectare,m�tres carr�s,hectares,m�tres cubes,m�tre cube,m3, (...)

Every EN measurement in the en-xx-cnv setting has a name (like "acre", or "cubic yard"), followed by a conversion formula, where * means multiplication, and / means division. Values are separated by commas.

For example,

acre=*0.4046

means that, if a number is followed by acres, like perhaps "120 acres" in the source sentence, then WFC will calculate 120 x 0.4046, and suggest 48.552 hectares. Values rounded to the nearest 5% and 10% are also proposed (like 50 hectares in that example).

In the signature file, the same measures may appear under different forms, like various abbreviations (pound; lb; lb.), or singular and plural forms (pound; pounds). This redundancy is meant to enhance the chances of spotting the pattern in the source text. Wildcards are not used, this is why singular and plural forms of the same measures are used in the signature file. Note the unorthodox English lbs abbreviation in the supplied signature set. They are necessary because of the poor quality of some technical documents. You may even enter a custom and horrific lb's (OMG!) if the English you are translating is of appalling quality. CAT tools belong to the realm of what's practical rather than what's ideal -- possible versus perfect.

Note: the very first measurement in the English (en-xx-cnv) setting is a straight double quote like " - this is because " is sometimes used in place of "inch".

TM & glossary management

Introduction
WFC translation memories and glossaries share the same format: tab-delimited text. This format is perhaps the most simple database format you can find - most other translation tools use proprietary formats that render direct data maintenance difficult (illustrating the concept of "captive market"). WFC remains committed to user-friendly formats, and to the competition's sustained astonishment, performance is not hampered at all by WFC using open formats.

To make a long story short, you can consider that both your TMs and glossaries are regular Ms-Word documents and use Ms-Word to maintain them: edit, proof-read, cut, paste, merge, etc. Countless other popular software can be used to maintain WFC data, and should any ad-hoc tool be developed for specific purposes, the openness of the WFC format is a welcome simplification for engineers. If the TM is too large for Excel™, then Ms-Access™, Ms-Word, FileMakerPro™, dBase™, FoxPro™, Paradox™ etc will open it anyway. Even the diminutive Notepad™, JustWrite™, WordPad™, SideKick™, XyWrite™... can open small to medium TMs.

The TM/Glossary editor

Click the "TM/Glossary editor" icon in WFC's main toolbar, or the last icon in any of the glossary toolbars to start the TM/Glossary editor. Outside a translation session, glossary toolbars can be opened using the Ctrl+Alt+Right shortcut (and closed using the Ctrl+Alt+Left shortcut). During a translation session, the Ctrl+Alt+G shortcut pressed on a word or selection will open the glossary toolbar(s) of the glossary(ies) where the term was found. Glossary toolbars open only on glossaries that were specified in WFC/Terminology.

WFC's TM/Glossary editor is intended to make maintenance easy and intuitive, and offers practically identical methods for TMs and glossaries. Once the editor is opened, you can scroll up/down the data, edit/delete/add entries.

Note: cutting (deleting) a single line (or entry, or TU) is a soft operation, meaning it can be reversed or undone (press Delete twice on an entry to see the toggling effect). When an entry is cut (or soft-deleted), it appears as a blank line, but when it is selected, the source and target data appears in the editor's bottom blue/green display. Ctrl+Delete will permanently erase cut entries by "packing", i.e. rewriting, the entire TM or glossary.

TM Tools

TM Tools contains special filters are meant to perform operations that would be difficult or impossible to perform with just filtering and sorting. These operations are:

Mark redundant entries
(there are various types of definition for a redundant entry, depending on whether you use a TM or a glossary). This feature marks entries that are considered duplicates. Once the marking is done, you can review them, then delete them all by using the Cut shortcut (Ctrl+X) followed by a hard-delete command (Ctrl+Delete). Of course, with a TM, such entries are grouped if the TM is sorted on the source segment.

Reverse source and target
This will rewrite the current file and reverse source and target fields.

Export to Unicode
Exports the current file to a unicode format.

Export to TMX (TM only)
Exports the current file to the TMX format. The TM is not overwritten - a new file is created, and it has a .tmx extension.

Export to TBX (Glossaries only)
Exports the current file to the TBX format. The glossary is not overwritten - a new file is created, and it has a .tbx extension. The TBX format is a popular glossary exchange format (TBX stands for Term Base eXchange) among translation tools.

Remove tags
This special filter removes tags from a TM. This is recommended after finishing a project with tagged files. The leverage of TUs with tags is precious within the scope of a particular project. Tagged leveraged outside a project is an extreme rarity. This is why it is recommended to remove tags from a TM that will be used on different translation projects. Tags bloat TMs to a ridiculous extent.

Export as segment document
This filter will create a segmented document in Ms-Word that contains all Translation Units (TUs) in the TM.

Repair and compact TM
This maintenance will rewrite the entire TM, removing lines marked for deletion, removing empty lines. It will re-create the index. This filter can be run if the TM does not perform well, or before storing or archiving a TM.

Mark suspicious TUs
This powerful feature can clean up a TM by marking TUs that look suspicious for various reasons. After the filter was executed, TUs that look suspicious are simply marked (checked, or selected). You can review and delete them as needed.

Rewrite Entries with a Mask
This powerful feature is used to replace a particular field, or many fields, with some given value, or erase the content of the fields, in all visible entries. Visible entries are those that are displayed in the editor. If a filter is set, only some entries are visible.

You are first presented with an empty entry (a mask). You can:

All fields that are left blank (or which do not begin with = followed by at least one character) in the mask will remain untouched in the file.

The following mask would replace all User fields with "FOO", and erase Attribute fields 1, 2, 3, 4 in the entire TM:

Practical example: "I have that older, bulky TM that combines TUs from various translators. I want these entries grouped by user (translator) name. I want to delete all entries that have a usage counter of less than 2, and that are older than August 31, 2004. Then I want to review them one by one and perhaps have some entries not marked for deletion if I think they're useful after all. Only then will I erase all marked entries that remain".

Note that all operations except #7 can be undone.

Filter

Filtering means you define a condition with a Field Condition Argument format.
For example:

SourceText & "MyText
where & means "contains", or
Counter = 0

See more examples in the Filter or Sort dialog box' Help.

When Argument is made of text, it must be enclosed in straight quotes like this: "MyText".

The effect of a filter is that only the entries that conform to the filter's condition(s) will be made visible in the glossary editor. When a filter has been set, using the Mark methods (mark, unmark, copy, paste, cut) will operate only on visible entries.
In the Data Editor, use the F8 shortcut to cancel a filter.

Sort

Sort files only if you need. Sorting can take some time, because the entire file is actually (physically) sorted, not just the display of the file. Sort when necessary. WFC adds the convenience of being able to sort source or target text on the number of words or characters in segments. This can be useful for terminology extraction.

Replace

This is a standard Find-replace operation. The operation is done on one specific field at a time. Ctrl+H gives you quick access to this feature when browsing a file.

With this feature, you can replace a word like "Smith" into "Jones" in the TM's target segments. Be careful that find-replace operations do not produce unwanted results.

Example: Changing language codes in your TM
Suppose your source language code is EN and you want to change it into EN-US: Select "SourceLanguageCode" in the list of fields, enter EN as search, enter EN-US as replacement. Click OK.

TMs and glossaries must be created for one language pair only. I also advise keeping separate TMs for different subject (domain) and client, and having them in dedicated folders so that keeping track of them, and especially backing them up, remains easy.

TMs keep growing all the time. Most of TUs are very unkikely to be re-used, while a minority of them will. Since WFC keeps track of how many times a TU is re-used in the usage counter field, it is advised, when a TM reaches a large size (over 100,000 TUs), or when finishing a large translation project, to perform a compression by eliminating all TUs that have never been re-used. As a result, the TM's size will be considerably reduced, while its overall efficiency will be preserved. To do so:

Creating a startup TM. Create one single, large TM by combining all the TMs you have. Delete all TUs that have a usage counter of less than 3. To compress further, you can visually review the TM and delete TUs that are unlikely to pop up again. To do so, sort the TM on "SourceWords", go to the end of it and review the TUs that are the longest, where there are likely "ghost" candidates, longish TUs that are unikely to show up again. Delete them. This TM can then be used as a primer - if you need to create a new, empty TM, better use a copy of that TM instead, because it contains a "Top 50" or perhaps a "Top 1000" of your previous work. It's like priming a pump with a cup of water.

A WFC TM may contain TUs where the first figure of the date (normally "2", but it can be "1" for TMs created in the previous millenium) is replaced with "x", and which, as a consequence, appear to be "cut" in the editor. This is because, in the course of a translation session, the TU was proposed as 100% match on a green background, but the target segment was edited, so WFC has deleted the original version of the TU in the TM and has re-written the TU's edited version at the end of the TM. This is normal. Do not "resurrect" or un-delete such TUs: their correct version appears further down in the TM. During translation sessions, WFC is blind to TUs that are marked "x". As a rule of thumb, perform a "Reorganisation" of the TM before working on it. This is done with the WFC > Translation memory > TM "Reorganise" button and it erases all TUs that were marked as "Deleted" with an "x" mark in the course of previous translation sessions.

Sharing TMs with other WFC users, or with other CAT tools.
Sharing TMs with other CAT tools: open the TM with the TM/Glossary editor, click Tools, apply the "Export TM as TMX" special filter. The TM will be re-written as TMX and the file's extension will be changed to .tmx.

WFC segmentation rules

The largest possible unit of segmentation with WFC, as with most translation tools, is the paragraph. Paragraphs end with a paragraph mark (ANSI 13 with or without page feed ANSI 10), page feed (ANSI 12), end of cell (ANSI 7). Not that the manual line feed (ANSI 11) does not end a paragraph. Nevertheless, WFC can be set up to consider the manual line feed as ending a segment: see the section on customizing ESPs, or the note further below.

WFC attempts to recognize individual segments within a paragraph by parsing the paragraph and looking for End of Segment Punctuations (ESPs). The default ESPs used by WFC are . : ! ? as well as the tabulator mark, noted ^t by WFC, and the manual line feed, noted ^l . Users can edit the list of ESPs to fine-tune segmentation, although that is not recommended, as it breaks their TM compatibility with most other TMs.

If all ESPs are deleted, WFC segments at the whole paragraph level. This is not recommended, as some paragraphs may exceed the acceptable segment limit of 8,000 characters (nearly two large pages!) imposed by WFC, although segments of that size are very rare. If a segment is larger than 8,000 characters, WFC ignores the extra characters, which can be segmented with the "ForceSegment" shortcut.

To remain compatible with most other tools, WFC does not consider the manual line feed (noted ^l ) as ending a segment. Users can add ^l to the user-defined list of ESPs in WFC to break segments when a manual line feed (ANSI code 11, decimal) is encountered, which is generally considered more logical. However, by default, WFC does not end a segment at a manual line feed.

Within a paragraph, WFC will consider that it has reached the end of a segment if:

Rules 2, 3, 4 can be disabled by the user in the WFC > Setup > Segments pane. With CJK languages, rule 2 is always disabled, and the "wide-character" equivalent punctuations are also used.

Rule concerning the beginning of a segment

If a segment begins with a series of numbers (or combination of numbers and full stops) followed by a full stop, WFC assumes that it's a numbering scheme, and skips the apparent numbering scheme. With the following text:

10. This is text

the segment will begin with "This is text", skipping the initial "10.". If the initial number is actually part of the segment, translators can press Alt+Delete (Unsegment), then select the entire sentence and press Shift+Alt+Down (ForceSegment). Translators can also set WFC to always override the number-skipping behaviour with the "SegmentAll" command in Pandora's Box.

Parts of text not considered as segments.
Isolated series/combinations of numbers, spaces, punctuation do not consitute a segment. For example,

100
100.89.67.90
100 (9078) // 67-56

will be skipped by WFC as being "numbers". But

100a
100.89.67.90Z
100 (9078) // 67-56P

will all be segmented, because at least one letter is present in each series of numbers/punctuations. The "SegmentAll" command in Pandora's Box will force WFC to segment isolated series of numbers/spaces/punctuation at all times.

Abbreviations

Users can specify a list of abbreviations in WFC > Setup > Segments. WFC will not end a segment if its last series of characters matches any of the abbreviations, case-sensitive. For example, if "Pr." is listed in the user-specified abbreviations, which is the case by default, the following sentence will be considered as making up a whole segment...

Here is Pr. Johnson.

... although "Pr." is followed by a full stop, a space, and a capital letter.

There are many translation-time shortcuts and options that let the translator fine-tune segments to expand them, shrink them, or force a selection of text to be considred a whole segment, regardless of rules. However, translators should remember to prefer default segmentation whenever possible, to remain compatible with other TMs.

The WFC Translation memory format

A WFC translation memory is a tab-delimited text file. It's the simplest of all formats - it can be opened with text editors, like Notepad, or unicode-compliant word processors, as well as with Excel. WFC TMs can be regular ANSI (8-bit) text, or Unicode UTF-16 (both little-endian and big-endian).

A Translation Memory (TM) is a set of lines (paragraphs) of text. In a pure text file where the display does not wrap, lines are paragraphs. The very first line is a header, and all other lines are Translation Units (TUs), sometimes called "entries". Lines/Entries/TUs are sets of fields, a field being any text (even lack of text, which denotes an empty field) followed by a tabulator. In other words, the WFC TM format is Tab-delimited Text, which is arguably one of the oldest, most robust, open, easy to manipulate data format ever. In the header (the very first line in a TM), each field begins with a % (per cent) mark.

Fields making up a TU:
Here are the first two paragraphs (the TM's header and first Translation Unit) of a TM where the TU is defined as in the table above. Fields are separated with tabulators noted  ↦ here below.Paragraphs are long, so they may wrap in your display - but there are only two paragraphs:

%20041231~160445 ↦ %YAC, Yves A. Champollion ↦ %TU=00000000 ↦ %EN-US ↦ %WFC TM v5.0 ↦ %FR-FR ↦ %87412764 ↦ Domain ↦ Client
20041231~165410 ↦ YAC ↦ EN-US ↦ Red Riding Hood was walking in the woods. ↦ FR-FR ↦ Le Chaperon rouge se promenait dans les bois. ↦ EL PS

The header (first line in the TU) in the example above defines two attributes named Domain and Client. The first TU contains two attribute values: EL and PS. Either attribute names (unique per TM) or attribute values (multiple: one per TU) can be made of up to 64 characters (acronyms are used in the example above: EL for Electronics and PS for a client, however, longer descriptors can be used). Question/exclamation marks ( ! ? ) are forbidden in attributes names and values.

When reading a TU, WFC defaults on the side of optimism in case the TU does not look correct or canonical. When in a TU:

Files
A translation memory (e.g. WfMemory) generates the following files:

If you need to archive a TM, or send it to a colleague, the only necessary file is the .TXT file. It is recommended to reorganise a TM before sending it to someone (using the WFC/Translation memory/TM/Reorganise button).

If a translation memory is lost, remember that (if you keep copies of your translated, segmented files) cleaning up the segmented files that produced the TM will recreate the corresponding TM with its translation units.

Fault detection (ignoring malformed TUs)
WFC considers that a TU is a bad one based on counting how many tabulators are in a line of text. A line of text with less than 6 tabulators cannot form a valid TU. Another fault-detection method used by WFC is that language codes should not be no longer than 5 characters. When language codes of more than 5 characters are encountered during a TM reorganisation, it is an indicator that something is amiss with that particular TU, and it is assumed to be faulty. WFC does not halt on faulty TUs, it ignores them.

Remarks:
The date does not necessarily have a tilde (~) separating date and time. Any printable character can be used there, except a number. WFC uses the tilde (~), the equal (=) sign, and the star sign(*). The equal sign means the TU was "marked" (flagged) by WFC's data editor. This has no consequence on the TU's status: it remains fully valid. Although WFC always records the date and time when writing a TU, the date and time are optional and could be empty (or even made of an invalid date) in which case WFC would simply assume the current computer's date and time, or previous TU incremented by one second, if in a sequential loop. Dates and times are "local", taken from the local computer's clock.
If any optional field is left empty, its trailing tabulator should be present. For a TU to be valid, there must be at least six tabulators, with the fifth field (the source segment, located between the fourth and the fifth tabulator) made of at least one printable character.
The date's first character (a number from 0 to 9, usually, a number 2 if the TU was created in the current millenium) can be "x". It means that this TU is not valid anymore - WFC marked it for future deletion. The first full reorganisation of the TM by WFC will erase this TU. Do not remove the "x", or replace it with a number, unless you know what you are doing.

Placeholders

Placeholders are used to encapsulate a few special characters, or tags. A WFC placeholder always has the following format: &tX; where X can take various values: &t=; &tA; &t1; &t#; , etc.

Note to engineers
The ampersand, quotes, as well as < and > characters are not escaping character. The ampersand is not escaped. The WFC TM format is not a member of the SGML/HTML/XML family. The WFC TM format is simple text, tab-delimited.

Limitation: A WFC TM would create a slightly fuzzy match with text containing the &tX; placeholder, as in this very paragraph. That is a known and accepted minor limitation that has not happened yet in decades.

Tags in a WFC TM

When dealing with so-called tagged documents, a WFC TM records placeholders for tags. Those placeholders have a &tX; format, where X is the order of appearance of tags in the source segment. The X order is noted A (ANSI decimal 65), B, C, etc., up to ANSI decimal code 165. Thus, there can be no more than 100 tags in a WFC segment.

For example, the following "tagged" source segment:

<FONT FACE="Helvetica">This is some text.</FONT>

would appear, in a WFC TM as:

&tA;This is some text.&tB;

At translation time, when WFC pulls a TU from the TM and is about to propose the TU's target segment as a translation candidate, WFC uses a substitution algorithm to dress the proposed target segment with the full "real" tags, taken from the document's (not the TM's) source segment, using a triangulation method:

Document's source segment <-> TM's source segment <-> TM's target segment

The triangulation can be successful only if all target tags have a "parent" tag in the source segment. In the rare cases when the TM's target segment has tags that do not appear in the TM's source segment (orphaned tags), WFC records the full syntax of these orphaned tags at TU creation time, so that they can be restored properly at translation time, when the target segment must be proposed with the correct format. If we have, at TU creation time:

then the target segment would be recorded in the TM as:

&tA;Voici du texte&t=&nbsp\;;:

where &t= opens the original tag syntax (&nbsp; in our example) and ; (colon) closes the sequence.

Other examples of segments:

In source segment:<FT>This is some text<AR> here<FT>.
In target segment:<AR>Voici du texte<FT> ici.
In TM TU source:&tA;This is some text&tB; here&tA;.
In TM TU target:&tB;Voici du texte&tA; ici.
 
In source segment:<FT>This is some text<AR> here.
In target segment:<AR>Voici du<AR> texte ici<FT>.
In TM TU source:&tA;This is some text&tB; here.
In TM TU target:&tB;Voici du&tB; texte&t=; ici&tA;.

In most translation memory systems, TMs are overloaded with tags that do not belong there. A TM takes significance when its content is put to (re-) use, meaning, when its past translations are leveraged for a new transation project. Re-using TM content is only done in the presence of a new document to be translated. In other words, at use time, the software operates a triangulation between a new document's new source segment which contains the new formatting, and an existing TM source/target pair which contains formatting placeholders.

Troubleshooting

Note: the WFC wiki, accessible from http://www.wordfast.net has more searchable contents.

I installed WFC but I don't see the toolbar

Press Ctrl+Alt+W.
If no document is open, open a document.
If the View/Toolbars menu has a "WFC" item, click it. Otherwise, use the Tools/Templates & AddIns... menu to Add the Wordfast.dot (dot, not doc) template to your list of templates.
See WFC refuses to start.

Some, or all shortcuts, do not respond any more

The major causes are:

If you tried all methods described above and shortcuts are not functional, exit Ms-Word, search for, then rename all Normal.dot files to Normal.old and restart Ms-Word.

With Word 2002.2003, the first key I type when a segment opens does not respond
See the "FirstKeyControl"command in Pandora's Box section.

My keyboard keeps changing (shifts from one language to another)
See the previous point on shortcuts, list item 4.
If the first key you type when a segment opens does not respond, see the previous item.

I cannot type special, or accented, characters any more
Same as above.

My Antivirus says WFC is, or contains, a virus
See the relevant section on macro viruses in the Glossary of terms used in the manual section.

Ms-Word 97
One known bug, documented by Microsoft at http://support.microsoft.com/default.aspx?scid=kb;EN-US;q162349 is that, if the document has graphics that were pasted into it (as it is often the case with screenshots), Ms-Word97 may not have the resources needed to display them, and the graphics could be changed into empty boxes containing a red cross. All subsequent efforts to restore the graphics will fail.
This is likely to happen if the document was created, or manipulated, with a non-SR1 Ms-Word97 (SR-1 being a bug fix, or patch, distributed by Microsoft - see Microsoft, not WFC, support). Furthermore, having the "Allow fast save" option checked in Tools/Options/Save aggravates the situation.
I recommend turning off "Allow fast save" (and do frequent manual saves using Ctrl-S) with Ms-Word97, because this feature is known to drain resources and create problems.

Microsoft Outlook
Some versions of Outlook can be a problem if they are set to use Ms-Word as email editor. If this is the case, uncheck the "Protect delimiters..." checkbox in WFC/Setup/Segments. Then, in Ms-Word, use View/Toolbars/Customize/Keyboard/Reset all to reset shortcuts. Close Ms-Word and Outlook and try again. If this does not solve the problem, you may have to either not use WFC and Outlook at the same time, or use Outlook's HTML-mode edition (not Ms-Word as an email editor), which offers most of the facilities provided by Ms-Word.

The main WFC setup window does not display any text
Make sure the Tahoma font is available in your system. Normally, when Ms-Word is installed, the Tahoma font is automatically added to your system by Microsoft.

I see lots of blue, or red, text with a line through in the middle
Turn off the revision ("Track changes") mode. Remove the protection, if there is one (Tools/Remove protection) before translation. Normally, documents in revision or "Track changes" mode should not be translated with a CAT tool until this mode is turned off.

WFC (not Ms-Word) says "Sorry, this file is read-only"
This means your translation memory (not your document) has a read-only attribute. This usually happens if the TM was intentionally read-protected, or if the TM comes from a CD-ROM. With your disc explorer, right-click the file, click Property (with a Mac, use Option+i) and uncheck the read-only checkbox.
Also, on some recent systems, certain folders are write-protected. Make sure your TM is not located in an Ms-Office folder, or in any system folder. Create a folder for translation memories.
Do not confuse this message with Ms-Word warning you that a document is in read-only mode.

Erratic behaviour during translation sessions
Some Pandora's box commands need to be turned off after use, like "BetterMatch=Write", "Skip" commands etc, because they may produce unwanted results on documents other than the ones for which they were set. The same consideration applies to macros: turn them off when they're no longer needed. If you never used Pandora's box commands however, there is no need to check this point.
Remember that some options set in Tools/Options and Tools/Autocorrect may also cause erratic behaviour (such as replacing quotes, changing text automatically etc).

Make sure some of WFC's shortcuts are not hijacked by another template.

The first key typed when a segment opens does not respond.
See the "FirstKeyControl" command in Pandora's Box section.

See the point on invalid or corrupted normal.dot.
See the point on multiple keyboards.

My keyboard keeps changing
See the point on multiple keyboards.
Make sure you are not on "Automatic language recognition". This option can create problems and make Ms-Word "panic" during translation. Use Ms-Word's Tools/Language menu to make sure.

Ms-Word does not look the same
Or, "After a translation session, Ms-Word displays paragraph marks or field codes or a strange font, or pictures are not displayed etc."
During translation, WFC has to modify some display options in order to function properly. When a translation session ends, your previous display setup should normally be restored. If this is not the case, don't panic - just click Ms-Word's Tools menu, then Options (Edit/Preferences in some Mac versions), click the View tab and check/uncheck the necessary options to restore your usual display setup. Get acquainted with the "Options" dialog box and the various View settings.
You may also have to use Ms-Word's View menu and change from/into the "Page" view.

Ill-behaved documents
Some customers send documents that were originally attached to a template (this can be checked by opening the document, then using the Tools/Templates & Add-Ins menu and looking at the top textbox). If a reference is made to a template that is not present in your hard disc, expect trouble. Contact your customer. Deleting the reference to a non-existent template will usually solve the problem, but should be done with the customer's consent and knowledge, so that the template attachment can later be restored.
If the Ms-Word document has many fields (Tools/Options/View/Field codes or Alt+F9 can be used to display field codes) that refer to non-existent graphics, indexes, links etc, you can have erratic document behaviour. If your customer cannot provide you with the referenced objects, make sure that Tools/Options/General does not require Ms-Word to update links when opening the document.
Large RTF files with complex layout and/or fields, which were created with a different software, or even with just another version of Ms-Word, can behave strangely and cause Ms-Word to crash. In desperate cases, try importing a problem document into a new, empty document, (using copy-paste or Insert/file), with the client's consent.

Always inform the client if you have to fiddle with documents.
The bottom line is that the customer should provide the translator with a clean, stable document. A good third of service calls to the WFC hotline are actually caused by ill-behaved documents, and another third to systems, or Ms-Word installations, that are not stable.

Ill-behaved templates
You are welcome to run WFC together with other templates or Ms-Word add-ins, but please understand that I cannot guarantee the reliability of such a practice. A lot of shortcut conflicts or mysterious behaviours with Ms-Word and WFC are simply due to the presence of other templates or Ms-Word add-ins that monopolise shortcuts.

Ms-Word templates and add-ins are programs that usually contain VBA code. There are many ways of writing VBA, some of which are not really professional, resulting in poorly engineered applications.
Microsoft has introduced a much more reliable and modern environment with the 32-bit VBA architecture of Word97 and higher versions. Unfortunately, many programmers still use antiquated techniques dating back to Ms-Dos (8-bit architecture), or Windows 3 (16-bit architecture) using, for example, absolute I/O file numbers, instead of using the FreeFile function offered by Microsoft, or WORDBASIC functions.

A document was closed with an open segment:
Normally, press Alt+Down, and WFC will atempt to recover the segment, and usually can do it.
Try starting a session by opening the segment that was left opened. If this does not solve the problem, close the document without saving it, go to the segment that was left opened and do the following:

Open the Bookmark dialog box from the Insert menu. Delete all bookmarks that begin with Wf (such as WfTU, WfSource, WfTarget etc).
Delete all paragraph marks within the problem segment. As a result, the coloured backgrounds disappear. If they don't, select the paragraph then use Ctrl-Q or Format/Borders and shadings to remove backgrounds.

Make sure the delimiters (the little purple symbols) are correctly set.
Save your document and resume the translation session.

I want to service my TM, but I keep getting the message "This file is used by another process".
Most likely, the TM is currently opened in Ms-Excel. Or it is being shared through a network, or in two simultaneous Ms-Word sessions, or the previous translation session was not terminated properly. Do not service a TM currently used across a network. If you're not networking, close Ms-Word. With your disc Explorer, find the folder where the translation memory is, and delete the translation memory file that has the ".net" extension - and only this file. If this does not work, reboot your system.

Terminology recognition does not work:
Run the following checklist:

Slow performance or frequent "Out of memory":
Many systems are overloaded with fonts. Many applications add unwanted fonts to your system without telling you. In Windows, see the \Windows\Fonts (or \Winnt\Fonts) folder. If you have more than 50 fonts, consider the following. Create a \Windows\Font2 folder and drag-drop into this new folder all the fonts that are found in \Windows\Fonts and that are not vital. If these fonts are later required, you can drag them back into the \Windows\Fonts (or \WinNt\Fonts) folder. Note that any font that is located in the \Windows\Font folder burdens your system, gobbling RAM and resources. There are many other ways to make sure your system is streamlined for optimal professional use, but this is beyond the scope of this manual. In any case, if your system is used for games, or intensive multimedia activities, or other purposes, especially by other people, expect trouble. You cannot use a workstation for gaming, or heavy graphical/multimedia applications, and expect it to be utterly stable with the full Microsoft Office environment.

On slower computers (less than 200 MHz and/or less than 32 Mb RAM, and/or very slow video cards), I recommend using some or all of the following methods:

Turn off spell/grammar check during the translation session (make spell-check an after-translation task).
Decrease the colour depth of your display to 16 or 256 colours, at least during translation sessions.
Using Tools/Options, uncheck the "Paginate" option to prevent Ms-Word from constantly re-paginating your document. Work in Normal view mode, not Page or Print view.
Turn off the "Autosave" function in Tools/Options or Preferences. During translation, press Ctrl+S once in a while to save your document.
In extreme cases, use the Draft font option in the View tab of the Tools/Options menu in Ms-Word; in the View menu, select Normal rather than Page.
With large TMs (over 50,000 TUs), reorganise the TM at least once a week with the Reorganise button in WFC/Translation memories/TM. Do some TM maintenance.
Uncheck "Allow fast save" in Tools/Options/Save (or "Preferences" in a Mac). "Allow fast save" is known to drain resources and may cause Word to crash.
Desperate cases: pre-translate (WFC/Tools/Tools/Translate) the document before working on it.

Bugs and Crashes
Windows 9.xx, Millenium (and 2000 to some extent), as well as Mac OS 7, 8, 9, are not "mission-critical", or bullet-proof OSs like Unix or Linux, for example. They're variations of earlier OSs that were running with late-XXth-century limited resources, over a rather primitive architecture.
All applications, but especially Ms-Word, keep robbing more RAM resources to display fonts, graphics, temporary text editing, undo information, etc as you open documents, scroll, type, edit etc. Furthermore, Ms-Word does many tasks in the background, while those OSs are not really multi-tasking.

If Ms-Word crashes while WFC is active: re-start your system and try the same task again with a "fresh" system. Temporarily turn off fancy system add-ons that are supposed to miraculously guard your system from crashes, boost power, enhance the desktop, defragment in the background etc. Just keep the antivirus, but temporarily turn it off for testing purposes. In Ms-Word, turn off any template or add-in other than WFC (go to Tools/Templates & Add-Ins, uncheck templates and add-ins). If you can duplicate the same crash (or freezing) on a bare system, WFC may be the cause of the crash. In case of freezing, try pressing Ctrl+Pause (or Ctrl+Break on some keyboards), then click End if a dialog box appears. If you have the chance, try executing the same job or task with WFC on another computer before concluding that WFC is responsible for the crash.

In such a case, make sure you have the latest version of WFC (compare your version with the one in www.wordfast.net). If the crash persists with the latest version of WFC, use the hotline on www.wordfast.net to let us know. We will process the report as quickly as possible.
The Windows 2000+Ms-Word 2000 (or higher versions) combination is known to be significantly more stable.

WFC refuses to start
This could be due to one of the following reasons:

MacIntosh
Use simple folder and file names for TMs and glossaries, without accented letters, spaces, punctuation, or symbols, less than 32 characters in length. This point does not concern documents, only TMs and glossaries.

This manual's glossary

(Terms that are already part of the Ms-Word environment are treated briefly - Refer to Ms-Word's Help or Manual for a more complete definition)

CAT: CAT stands for Computer-Assisted Translation. CAT broadly refers to software used by professional translators to boost productivity, and/or enjoy a more comfortable environment. Other acronyms have been added, which only create confusion. In the end, no acronym or naming is ever perfect, and CAT remains the most widely used term.

TMX: TMX stands for Translation Memory eXchange. TMX is a gateway format that allows translation tools to exchange TM content. Most Computer-Assisted Translation (CAT) tools do not use TMX as their native format, but nearly all of them can import from, or export to, the TMX format. As in all conversions, a minor quantity of information may be lost, such as attributes (meta tags).

XLIFF, TTX, TXML: Those are formats made to hold document content during the CAT process. Thus, filters have to used ahead of translation to transform a document (DOC, XLS, PDF, HTML, whatever) into one of those formats; when the translation and proofreading is complete, the same filter must be used to reconstruct the translated document into its original format. Those formats are usually written in XML.

Translation Unit (TU): A TU is a set of source and target segments. A TU also records creation date, plus optional attributes (see below).

Translation Memory (TM): A TM is a set of TUs - a database of TUs. Practically every translation tool has its own format. WFC has its own format but, unlike most other tools, it's an open format, which can be edited with a wide variety of editors. The TMX translation memory format is a gateway between different TM formats. WFC supports TMX.

Attributes: (aka metatags in translation units) A TU is a pair of source and target sentences. TUs have built-in attributes that record information (creation date, languages codes, etc.). 5 user-definable attributes can be customized. A typical attribute is the identity of the translator who generated the TU. Other attributes can be subject, client, job number, etc. Each of the 5 attributes can have many values, stored in a drop-down list, visible in the WFC/Translation memory/TM Attributes tab. For example, the "Subject" attribute could have three possible values, such as "Scientific", "Literary" and "Business". The value that is visible in the drop-down list is said to be the "active" value.
Attributes can help organizing TMs. See the Attributes section for more information.

Match: One purpose of a translation tool is to find "matches" in the TM for the source segment you are curently translating. When a fuzzy match is found, the segment will display a percentage rating the match's resemblance with the TMs reference source segment. Bear in mind that this is a purely statistical and blind computational process, it has little to no semantic relevance. Some CAT tools evoke "semantic" neural networks or "linguistic" sense - those claims are as reliable as washing powder hype.

Penalty: When a match value is being calculated, penalties can be applied to lower the match value. Usually, these penalties are based on an attribute variance. See the relevant section on penalties.

Microsoft Word (Ms-Word): The application (or software) with which you are currently reading this manual. Ms-Word is generally used at a fraction of its capacities. A professional translator will gain a lot by learning a few advanced functions, such as smart Find-Replaces (see the Appendix IV below), customizing toolbars and shortcuts, and essential macro knowledge. Consider seeking expert help or training: this investment in time or money will be recouped very quickly. I have created a step-by-step manual called "Ms-Word for translators" that you can use for free.

Microsoft Office: a collection of applications, usually sold and installed together, of which Ms-Word is a member. Ms-Office includes Ms-Word, Excel, PowerPoint, Access, FrontPage, Publisher, Outlook, OneNote, etc, although the restricted version of Ms-Office usually offers only Ms-Word and Excel.

VBA (Visual Basic for Applications) is a programming language shared by all Ms-Office applications. WFC is written in pure, original VBA, add-ons, OCX, etc - this is why it runs on both Windows and Mac, and is ready to be ported to other platforms.

Macro. An intensive use of Ms-Word can sometimes lead to highly repetitive tasks (imagine you have to change the first paragraph font on a hundred documents). The macro recorder can record a series of actions done in Ms-Word into a macro named by you; from then on, you can execute this macro as many times as necessary by simply calling the macro dialog box (Alt+F8) and executing the macro, or better, by assigning the macro a shortcut. Macros are written in VBA. Press Alt+F11 or use Tools/Macro/Visual Basic Editor to open the VBA editor window, where your recorded macros will appear, in the code module(s) of your "Normal" template.

Macrovirus (or Ms-Word virus). Any piece of executable code, in practically any language, is a potential virus. The only difference between an application and a virus is the fact that a virus was created to hurt, harm or destroy. Both Ms-Word documents and Ms-Word templates can contain VBA code, as well as many other formats, like graphics etc.
Most operating systems released after 2010 have a built-in virus protection, and do very well without an antivirus. Many antivirus programs place an immense burden on the system and end up being worse than the ill they are supposed to cure.

Every release of WFC is scanned before being put on download. As of September 2019, over 35,000 registered users are using WFC, over 6,000 of them contribute daily to a public discussion group.

If your Antivirus reports that WFC is a virus. This happens with roughly one antivirus in 20. WFC holds lots of VBA code and antivirus applications with a shallow or unreliable virus-detection algorithm can falsely report WFC as virus. You should do any, or all, of the following:

Immediately test WFC with another antivirus - perhaps by asking a colleague equipped with a different brand of antivirus. If another antivirus of another brand also reports WFC a virus, then the matter is serious: WFC has perhaps been infected by an infected document or template. Kindly report this to sales@wordfast.net.
Contact the maker of your antivirus, report the alarm, ask them to download WFC as you have done, so they can also test it. Then they should (if they are serious and honest) modify their antivirus software, or prove that WFC is a virus - one of the two.
Contact the WFC hotline at info@wordfast.net. No need to post panicking mails in the mailing list: all such mails until now were proved to be false alarms, and an embarrassment for their authors.

Documents and Templates: A document holds contents, i.e., text.
A Template is a model of document that proposes a preset layout, so that the user can concentrate on contents rather than on appearance. Templates can also be used as Add-Ins, extending Ms-Word's capacities. WFC is an Add-In.
Normally, a template is not opened as a document: it is either used to create new documents with a certain preset appearance, or it is added to Ms-Word's list of templates, using the Tools/Templates & Add-Ins menu. WFC belongs to this last category.

Toolbars: (this is for Ms-Word versions prior to Ms-Word 2007. From Ms-Word 2007 onward, toolbars are contained in a so-called Ribbon, which can be either minimized (Ctrl+F1), or occupy substantial screen real-estate.)
Ms-Word's "View" menu has a "Toolbars" option (right-click in the toolbar area to get there quickly) that lets you turn toolbars off and on. Turn off toolbars you are not using: they take up space and load the visual field, creating confusion. Use the same menu's "Customise" option to customise toolbars.
In the "Customise" dialog box, go to the "Commands" tab (the second one). Experiment by clicking in the list of commands, holding the button down, dragging a command. Drop its icon in a toolbar of your choice. You have just added a icon to your toolbar. If you make intensive use of a Ms-Word function and keep using menus, it is recommended to drop the corresponding command in a toolbar for quick access.
To remove an icon from a toolbar (the "Customise" dialog box being visible), drag its icon and drop it outside the toolbar: it will be removed.
I encourage WFC users to add the following two icons in either the "Standard" or the "Formatting" toolbar: Format/PasteFormat (play with it to learn how powerful it is. The icon looks like a brush, or a short broomstick); View/FieldCodes. Do not customise WFC's toolbars.

Selection: Dragging the mouse over text in a document while holding the left button (Windows) or the single button (Mac) will select a portion of the document, which then appears in reverse video, usually white on black. The insertion point (the blinking cursor) disappears when a selection is made. A selection can also be made by holding either Shift key down and moving the cursor by means of the arrow keys.
When a selection is cancelled, the insertion point, or cursor, appears again.

Bookmarks: A bookmark, as in a paper book, is inserted at some position in the document so that we can get back there quickly at a later time. Use the Insert menu to insert a bookmark over the current selection, or at the insertion point. The bookmark has to be given a name. The bookmark will "remember" the selection's position and extent in the document.
Bookmarks are saved together with a document.
Ms-Word's Tools/Options/View dialog box can be used to have the position and span of bookmarks made visible with grey [brackets].

Bookmarks are part of the document, and play a crucial role in documents that have links, automatic indexes, table of contents etc. The translation process may require bookmarks to be transferred into the translated text, at the appropriate position, extending over a corresponding length of text, retaining the same bookmark name. Since two bookmarks cannot have the same name in the same document, WFC proposes ways to handle them during the translation process. Refer to the Bookmarks section.

Refer to Ms-Word's Help or Manual for more information on bookmarks.

Fields: Fields can be inserted into a document using the Insert/Field... menu. A field usually contains a code that has to be calculated, computed or in some way, processed by Ms-Word. Thus, there are two ways of looking at fields: the code, or the result. Use Tools/Options/View to toggle field display modes, or use the Alt+F9 shortcut.
Note that fields are calculated at the moment when they were created. Placing the cursor over a field and pressing F9 will force the update (the recalculation) of the field.
A field that has not been updated may perhaps not show a correct value. For example, a Table of Contents, which is produced by a TOC field, may not necessarily be up-to-date.
If the update produces an error, the field will display an error message.

Refer to Ms-Word's Help or Manual for more information on fields.

Tags: See the section on Tags for a thorough presentation of tags. This term refers only to special untranslatable elements (usually grey or red) found in a particular category of files known as "tagged files", pre-processed for translation with adequate software (Rainbow Horizon, PlusTools, Trados Stagger, etc).

Delimiters (segment delimiters): Those should not be confused with tags. Delimiters are the purple symbols that delimit the beginning and end of both source and target segments, such as .
A bad segment is a segment where delimiters have suffered from deletion, addition, or edition. Bad segments create problems at cleanup time. They can be manually fixed by reproducing the set of delimiters found in a healthy segment. The protection of delimiters may have to be turned off (the shortcut is Ctrl+Alt+F12) before you fix delimiters.

Segment: A segment is an elementary unit of translation. Segments are usually sentences. In some cases, it may be necessary to translate entire paragraphs rather than sentences, but this is rarely the case.

During a translation session, the current segment (with the coloured background for source and target segments) is said to be opened. Segments should be left opened only for the duration of the translation process. If you need to make a break, complete your current segment then press Alt+End to close both the segment and the translation session.

Commit a segment: A segment is committed when the translator presses Alt+Down or Alt+End on an opened segment during a translation session, thereby "closing" the currently opened segment. At that moment, if the source/target pair does not exist in the TM, the pair of sentences will be added to the TM (subject to TM rules). Shift+Alt+End (Close segment) will close a segment without committing it to the TM.

Source, target: Translation is done from a source language (in)to a target language. A translation project may have one source language and many target languages. Most translators, however, deal with one source language and one target language, in which case, we speak of a language pair.

Appendix I - Segmentation & TM

Segmentation

WFC considers a document as a set of segments, a segment being usually a sentence, ending with an end-of-segment punctuation (ESP) such as full stop, question mark etc (the ESPs are customizable in WFC's Setup/General tab). Paragraph marks, page breaks, end of cell, tabulators etc will always end a segment. I have highlighted the 10 segments present in the following example:

The mark-ups for retail are as follows:  for class A stores, 10%.  For class B stores, 15%.  Please observe the following chart:

ClassClass AClass B
Mark-up.  No exceptions10%15%

These mark-ups must be applied at all times.

Note that the isolated 10% and 15% are not considered segments. A segment must have at least one translatable item (at least one letter). See the segment example. It is possible, however, to force WFC to segment such untranslatable text: see Pandora's box "SegmentAll" command.

Even in the absence of translation memory, a segmenter saves time and boosts productivity. The problems, when translating from a printed document, are:

Eye strain.
You will constantly move back and forth between the paper document and the computer screen. Your eyes will have to re-focus many times every minute. A lot of translators end up, after a number of years, with severe sight problems.

Brain strain.
After having translated a sentence, you will have to look again at your paper sheet and locate the exact position of the last sentence and read the next one. This exercise requires attention and drains intellectual power.

Professional errors.
Because of problem 2, it regularly happens that we skip a sentence, not to mention an entire paragraph, which is a serious professional error. Perhaps the document is made of a series of 100 nearly identical sentences, with slightly different numerical parameters, like

Please apply the following mark-up for Class A: 10% Please apply the following mark-up for Class B: 12% but exclude zone TT-001 Please apply the following mark-up for Class C: 11.5% Please apply the following mark-up for Class F: 13% Please apply the following mark-up for Class P: 9%

...etc for 3 pages!

If one line is forgotten, the translator becomes responsible for a serious professional error.

Working with a segmenter on an electronic original, you will not have to worry a second. The segmenter will faithfully segment the document and ask you to translate every segment, without forgetting a drop. Furthermore, in the above example, once you have translated the first line, WFC will actually recognise the next lines and pre-translate them for you.

More professional errors.
Look at the second line, with the TT-001 parameter. This parameter should not be translated, but faithfully copied. Now, make sure you type Zero-Zero-One and not O-O-I. Seems easy? Technical documents are full of such Byzantine parameters. To us, they're annoying. To the customer, they're vital. Mis-type just one, and the customer ends up with a faulty manual.
WFC has a Quality Assurance algorithm that will warn you if the untranslatable parameters are not faithfully copied from source to target. It also has QA functions to help respect the customer's specs on typography.

Document layout.
Look again at the above example on segmentation. If you translate from paper, you will have to re-create that fancy layout, fiddling with formats, tables, borders, colours, fonts etc. With WFC, every target segment is formatted like the source segment (this is true at segment level, the first source character defining the format of the target segment. WFC makes every effort to duplicate the styles of, for example, untranslatable elements; in somes cases, you may have to manually apply bold, italic etc within the segment).

Terminology consistency.
Over a large project (say you receive 50 pages every month, so you work for this particular customer 5 days a month, for 12 months), every time you work, you will have to remember the customer's glossary. With WFC, you create and save a particular setup for each customer, which remembers TM and glossaries. WFC will warn you every time the translation's terminology is in conflict with the customer's glossary.

Translation Memory

The natural complement of a segmenter is translation memory. Every time a segment is translated, it is stored in the TM. Thus, a TM is a database of Translation Units (TU). A TU records source & target segments, date of creation, languages used, and the ID of the TU's creator. It also has a usage counter that records how many times a TU was re-used. The more a TU is re-used, the more it is valuable.

Translation memory, mostly on technical documents, can save a lot of time, because WFC will recognise segments that were already translated and propose them - you only have to check, validate and move on.

When WFC has delimited a segment, it will scan the TM, searching for an exact or approximate match to the source segment. If a match is found, the TU's target segment (the recorded translation) is proposed. WFC will display a number, ranging from 0 to 100, that rates the degree of similarity between the document's source segment and the TU's source segment. A 100% match is considered exact. A match under 100% but equal to or above the (user-definable) fuzzy threshold is considered fuzzy; beneath that value, it is considered a no-match and will not be proposed.

If a translation is proposed, pressing Ctrl+Alt+M (Memory) will display the TU that was found during the TM's scan. In the case of a fuzzy match, differences between the document's source segment and the TU's source segment are highlighted. The TM management section contains valuable supplementary information.
If WFC has found many matches, pressing Alt+Right/Left will display matches with lower/greater match value.

Appendix II - Language & spell check

A document can contain text written in different languages. In Ms-Word, the language is a text attribute, just as font, colour, etc. The Tools/Language menu is used to apply a certain language to a selection. This language setting is important, for example, when spell-checking.
Usually, the client will send you a document where all the text has the source language (e.g. "English") as attribute. When translating, it is important that the target text receives the target language (e.g. "French") as attribute. This allows you to spell-check the target segments using the proper dictionary. This should be set up in WFC's Setup/Segments tab.

WFC will apply the specified target language (or default language, as specified in WFC/Setup/Segments) to the target segment. If, however, you have chosen the "leave unchanged" setting, WFC will not redefine the target language.

Appendix III - Macro samples

About all macros entered with Pandora's Box "Macro..." commands

(PC only): if instead of a macro name, you enter "Keys=" followed by a string of text similar to the one described in the Dictionary keys section, then WFC will execute the keystrokes you have defined. For example, using
Keys=L&H;^a{Delete}{SourceSegment}{Home}%tt{Ms-Word}
in conjunction with the English version of L&H's Power Translator 7 text-to-speech function, will read aloud your source segment at the time it is presented for translation.

Creating macros
Macros should normally be entered in Normal.dot. To do so: in Ms-Word, use Tools/Macro/Visual basic editor (or press Alt+F11) to open the VBA window. In the left side of the window, double-click "Normal". If there is no module, use the Insert menu to add a module. Usually, a new module called "Module1" is added. Double-click it. A window should open to the right: this is where you should copy-paste the macros given below, and edit them as needed. These macros will be saved with Normal.dot when you exit Ms-Word.

The WFC hotline does not offer support on VBA and macros. Refer to your Ms-Word manual, or to literature on the subject.
In WFC/PB/Macro..., you should enter "Normal.Module1.CheckLength" if, for example, you want to try the first macro described below (either as a QA macro, or as a post-segmentation macro).

To associate a macro with a shortcut: use the View/Toolbars/Customise menu. Click "Keyboard". In the leftmost list, choose "Macros" as category. In the righmost list, click the macro name. Enter the Shortcut in the textbox, then click "Assign". Close the dialog box.

You should refrain from using the following statements or instructions in macros you intend to use with WFC

If you need to open and close I/O files on disk, remember to use the FreeFile() function to ask VBA for an available I/O file number. Otherwise, your macro may conflict with a file already in use by WFC.

If you want your QA or post-segmentation macro to refuse to validate the segment and prompt the user to correct the translation, your macro should add a "WfStop" bookmark anywhere in the document (simply insert a Selection.Bookmarks.Add "WfStop" instruction before ending the macro). If WFC finds such a bookmark, it will cancel segment validation, remove the bookmark and take the user back to the target segment.

Checking segment character count

Here is a typical QA macro using the interactive mode just described above. It checks the target segment to make sure it's not longer than 80 characters (spaces included). If it is, it warns the user and sends him/her back to the segment:

Sub CheckLength() If Not ActiveDocument.Bookmarks.Exists("WfTarget") Then Exit Sub If Len(ActiveDocument.Bookmarks("WfTarget").Range.Text) > 80 Then If MsgBox("Target > 80 signs! Stop and edit?", vbYesNo, "WFC") = vbYes Then Selection.Bookmarks.Add "WfStop" End If End If End Sub

Checking segment visible length

The following macro does the same as the previous macro, but this time, the visible length of text is compared rather than just the number of characters. Note that a segment's visible length depends on its font.

Sub CheckRealLengthOfText() 'This macro warns the user if the target segment is over 130% of the source's length. 'The *real* visible length of text is compared, not just character count '(Of course we assume both source and target have the same font and size) Dim I As Integer, Segment As Range Static L(1) As Long For I = 0 To 1 If I = 0 Then Set Segment = ActiveDocument.Bookmarks("WfSource").Range Else Set Segment = ActiveDocument.Bookmarks("WfTarget").Range End If Selection.Start = Segment.Start: Selection.End = Selection.Start Do While Selection.Start < Segment.End - 2 Selection.MoveStart wdLine: Selection.MoveEnd , -1 L(I) = L(I) + Selection.Information(wdHorizontalPositionRelativeToTextBoundary) Selection.MoveStart , 1 Loop Next 'Here, "1.3" means 130%. Change this figure as needed. If (L(1) > L(0) * 1.3) Then If MsgBox("Target text length is over 130% that of source target." + vbCr + vbCr + "Get back to the segment and correct it?", vbYesNo, "WFC") = vbYes Then Selection.Bookmarks.Add "WfStop" End If End If End Sub

Checking quotes consistency

The following macro compares source/target segment to make sure quotes are consistent (same types and numbers of quotes used). Add this macro to WFC/Setup/General, as a QA macro, or as a Post-segmentation macro.
When a quote discrepancy is found, WFC will warn the user, with a choice of getting back to the segment and correcting the problem, or just moving on to the next segment.

Sub CheckQuotes() If Not ActiveDocument.Bookmarks.Exists("WfSource") Then Exit Sub Dim I As Integer, Src As String, Trg As String, Quotes As String, Uq As String Quotes = Chr(34) + Chr(171) + Chr(187) + Chr(147) + Chr(148) Src = ActiveDocument.Bookmarks("WfSource").Range.Text Trg = ActiveDocument.Bookmarks("WfTarget").Range.Text For I = 1 To Len(Quotes) Uq = Mid(Quotes, I, 1) If (InStr(Src, Uq) > 0 And InStr(Trg, Uq) = 0) Or (InStr(Src, Uq) = 0 And InStr(Trg, Uq) > 0) Then If MsgBox("Possible problem with quotes (" + Uq + ".) Fix it?", vbYesNo, "WFC") = vbYes Then Selection.Bookmarks.Add "WfStop" End If Exit Sub Else If InStr(Src, Uq) > 0 Or InStr(Trg, Uq) > 0 Then If InStr(Src, Uq) > 0 Then Mid(Src, InStr(Src, Uq), 1) = "*" If InStr(Trg, Uq) > 0 Then Mid(Trg, InStr(Trg, Uq), 1) = "*" I = I - 1 End If End If Next End Sub

Highlighting text with Shading

Q: I would like to highlight selected text, not using highlight, but Borders and Shading/Shade/Yellow instead. However, this is really slow because I have to use the menus each time.

A: Associate the following macro to Alt+H. See the part on associating macros with a shortcut.

Sub HighLight() Selection.Font.Shading.BackgroundPatternColorIndex = wdYellow End Sub

Extracting the contents of textboxes into a new document

Q: I want to run a word count of all the text contained in textboxes in my document.
A: Run the following macro. It will create a new document containing all text found in textboxes.

Sub ExtractFromTextBoxes() Dim I As Integer, J as Integer, Boite As Variant, ThisDoc As Document ActiveWindow.View.Type = wdPrintView Set ThisDoc = ActiveDocument DocName = ThisDoc.FullName Documents.Add On Local Error Resume Next ' Convert InlineShapes (anchored shapes) to regular shapes For Each Boite In ThisDoc.InlineShapes Boite.ConvertToShape Next ' I > 0 indicates there are still ungrouped textboxes to process ' J is just a security to avoid looping endlessly. I = 1: J = 0 While I > 0 And J < 10000 ' Ungroup grouped shapes For Each Boite In ThisDoc.Shapes Boite.Ungroup Next ' make sure all textboxes were ungrouped ' (embedded groupings may need more than one pass to be ungrouped) For Each Boite In ThisDoc.Shapes I = 0: I = Boite.GroupItems.Count If I > 0 Then Exit For Next J = J + 1 Wend For Each Boite In ThisDoc.Shapes With Boite.TextFrame ' If a textbox has text, copy it into the empty document If .HasText Then Selection.InsertAfter .TextRange Selection.InsertParagraphAfter Selection.Start = Selection.End End If End With Next ' Ungrouping usually creates a mess: ' close the original document without saving it ThisDoc.Close 0 End Sub

From Text to Doc: a smarter approach

The following macro attempts to rebuild a DOC-like document from a TXT document where all lines unconditionally end with a paragraph mark. Text copied from the Internet, or from PDF files, suffer from this common problem. Note that there is no sure-fire way of "guessing" how paragraphs should be rebuilt. The following macro uses a few methods that usually give good results, rebuilding most paragraphs correctly. But the final result must be visually checked before professional use.

Sub TextToDoc() Dim S As Selection, D1 As Range, D2 As Range, IsPara As Boolean, T As String If Windows.Count = 0 Then MsgBox "Sorry, no document open": Exit Sub Set S = ActiveWindow.Selection: Set D1 = S.Range: Set D2 = S.Range S.End = 0 Do While S.Start < S.StoryLength - 1 ' Turn off screen refresh for better speed Application.ScreenUpdating = False IsPara = False ' We store the last letter of the line into the string T S.MoveEndUntil vbCr: T = Trim(S.Text): T = Right(T, 1) ' A first attempt to determine if we do have an end of paragraph: ' the line ends with an end-of-sentence If InStr(".!?", T) > 0 Then IsPara = True If S.End < S.StoryLength - 3 Then D1.SetRange S.End + 1, S.End + 2 If IsPara Then D2.SetRange S.End - 1, S.End Else D2.SetRange S.End - 2, S.End - 1 ' If the last character of the line is lowercase and the first character of the next line is uppercase, ' we'll assume we've got a real paragraph. ' Disable this for languages that capitalize a lot, like German etc. If D2.Characters(1).Case = wdLowerCase And D1.Characters(1).Case = wdUpperCase Then IsPara = True ' if the font name or size varies from the current line to the next, we'll also assume ' there's a new paragraph. Very often the case with text copied from PDF; not ' relevant with Txt files. If S.Font.Name <> D1.Font.Name Then IsPara = True If S.Font.Size <> D1.Font.Size Then IsPara = True End If ' If we do not have a paragraph, then join the two lines into one and move on If Not IsPara Then S.Start = S.End: S.Delete: S.InsertAfter " " Else S.InsertParagraphAfter: S.MoveStart wdParagraph, 1: S.MoveStart wdParagraph, 1 End If Loop S.End = 0 MsgBox "Text to Doc conversion finished. Please check the document." End Sub

Appendix IV - Advanced Find/Replace

Note: the WFC Knowledge Base, accessible from http://www;WFC.net has more contents on the following topic.

Ms-Word's Find/Replace feature (FR) accepts wildcards and advanced features. A good understanding of FR can save the day on numerous occasions. I had to oversee translation projects where, to my astonishment, translators were spending hours executing visual/manual Find-Replace actions that could have been safely executed automatically.

Sure, FR actions can be destructive if they're not executed properly, since they can modify unwanted parts of the document. On a short document, a visual/manual FR can be preferred, since setting up and testing a smart and safe FR can take a little while.

Note that PlusTools offers a FR feature that can be run over many files, both in manual and automatic mode, with the possibility to edit the document and restart the FR where it was interrupted.

Back to source

Q: Whoops! My documents have been pretranslated, and I don't have access to the originals. But now I would like to have the originals back, unsegmented. Apparently, it takes a lot of successive Find-Replace passes to un-segment documents...

A: Quite the contrary. It takes only one FR pass to do that.

Find what (\{\0\>)(*)(\<\})(*)(\{\>)(*)(\<\0\}) Replace with \2 Use Wildcards Set the replacement font format to "not hidden" (check, then uncheck, the "Hidden" checkbox).

The only limitation is, make sure source segments do not contain hidden text. But they rarely do.
Note: the the same result can be achieved with the Alt+Delete shortcut, pressed when no segment is opened.

Turning US financial number formatting into French

This means changing US thousand separators (commas) into non-breaking spaces, and US decimal separators (full stops) into commas. Here is a two-pass method:

Find what .([0-9][0-9])> Replace with ,\1 Use Wildcards

then,

Find what ([0-9]),([0-9][0-9][0-9]) Replace with \1^s\2 Use Wildcards

This method is offered as sample in WFC's Pandora's box commands. Note that WFC's "FR" command executes FR actions only in the current target segment, at segment validation time.
Use this FR in automatic mode ("Replace all") if the figures and numbers in your document are essentially financial. If, however, your document mixes scientific figures with financial figures, I recommend using this FR method with a visual confirmation for each replacement (in Ms-Word's "Find" dialog box, click "Find Next" and "Replace" rather than "Replace all").

From Text to Doc

Q: In my document, all lines end with a carriage return, even if they don't end a paragraph. What can I do to reconstruct a normal text flow?

A: There is no absolute answer, but a global FR can do most of the job; a last manual verification will restore paragraphs that are unduly cut. See the other, smarter, macro-based alternative in Appendix III, "Text to Doc".

Find what ^p^p Replace with <!?a$

The above FR will preserve double paragraph marks (replacing them into a very unlikeky sequence of characters, which we here call a code)

Find what ^p Replace with

The above FR will turn all single paragraph marks into a space. A space has to be entered in the "Replace with" argument.

Find what <!?a$ Replace with ^p^p

The above FR will restore double hard carriage returns.

This is a typical three-pass FR example. Note that when using wildcards, Ms-Word no longer accepts some characters such as ^p (hard carriage return), so two- or three-pass FR actions are often necessary to bypass this limitation.

But hey, wait a minute...

Actually, a one-pass FR can achieve just the same result, but don't tell anyone, because it's a secret:

Find what ([!^0013])([^0013])([!^0013]) Replace with \1 \3 Use Wildcards

(Note the space after \1) Amazing, right? Be cautious though. On some Ms-Word versions, ^0013 introduces a new line but not necessarily a new paragraph, as surprising as this may seem. Use this geeky method if you're a geek yourself and know what you're doing.

Replacing numbers

A segmentation problem had produced segments where match values were often over 100. So the documents had such match values as <}833{> or <}944{> etc. It appeared that the last figure of the match value had been duplicated (these two segments should have been <}83{>.and <}94{>). How could this be fixed in many documents, in one pass, making sure other figures are not modified by the procedure?

The answer is:

Find what (\<\})([1-9])(?)(?)(\{\>) Replace with \1\2\3\5 Use Wildcards

Explanation: When the "Match wildcards" checkbox is checked, "expressions" are anything contained within parentheses. The "Replace with" numbers actually refers to expressions located in the "Find what" argument.

The ([1-9]) expression in the "Find what" argument, for example, refers to any number in the range 1 - 9. In the "Replace with" argument, it is referred to as \1, meaning, "expression 2".

So the Find-Replace action can be read as:

If such a chunk of text is found, replace the entire chunk with expressions 1, 2, 3, 5.

As a result, the redundant number (expression 4) is deleted from match values, with no risk of upsetting the rest of the document. An added safety measure could be to set the style for the Search parameter to "tw4winMark".

Delete target segments that are just a copy of the source segment

Q: I have a segmented document, where the source segment was copied over the target segment when there were no matches (0%). Now I would like the target segments to be empty instead, but of course, leaving fuzzy and exact matches in place, untouched.

A: A find-replace can, in one pass, transform zero matches where the source has been copied to target into no-matches with an empty target.

Find what (\<\}\0\{\>)*(\<\0\}) Replace with \1\2 Use wild cards

Associating macros with a shortcut

Use the View/Toolbars menu, click the customise submenu. Click "Keyboard". In the "Categories" list, click "Macros". Select the macro. Enter the shortcut in the Shorcut text box, then click "Assign" then "Close".

Credits

All trademarks noted™ are the property of their respective owners.
Ms-Word, Excel, Acces, PowerPoint are trademarks of Microsoft Corp.
Translator's Workbench is a trademark of Trados Corporation

Table of Contents