Poll: If you use a CAT-tool, how do you prefer the text to be segmented?
Thread poster: ProZ.com Staff
ProZ.com Staff
ProZ.com Staff
SITE STAFF
Jun 26, 2021

This forum topic is for the discussion of the poll question "If you use a CAT-tool, how do you prefer the text to be segmented?".

View the poll results »



 
Mario Freitas
Mario Freitas  Identity Verified
Brazil
Local time: 16:18
Member (2014)
English to Portuguese
+ ...
The shortest, the better Jun 26, 2021

One thing I learned fast with CAT tools is that the shorter the segment, the more useful it will be in the future. Smaller segments are the ones that give you 101% matches and autofill. The longer the segment, the less likely it is for this to happen. Configuring your CAT to segment by paragraph is wasting a big part of its functionalities.

Anton Konashenok
Ventnai
Ana Vozone
neilmac
Thayenga
Elena Kharlamova
MollyRose
 
Vasaporn Chaiyakul
Vasaporn Chaiyakul  Identity Verified
Thailand
Local time: 02:18
Member (2021)
English to Thai
+ ...
Sentence preferred, but.. Jun 27, 2021

In many Asian languages, there's no easy way (or perhaps no way in my pair, for example) for the tool to separate these continuous characters into sentences.

[Edited at 2021-06-27 08:41 GMT]


 
Christine Andersen
Christine Andersen  Identity Verified
Denmark
Local time: 21:18
Member (2003)
Danish to English
+ ...
Sentences, as long as you can override the automatic segmentation manually Jun 27, 2021

Quite often, I find sentences in the source language do not correspond exactly to sentences in the target.
Sometimes I simply split one into two, but it is useful to be able to merge two or three sentences or divide them differently.

What really is impossible to work with is when the text has not been properly formatted, and the CAT segments it into lines instead of sentences, or peppers the source with irrelevant tags...


 
Maria Teresa Borges de Almeida
Maria Teresa Borges de Almeida  Identity Verified
Portugal
Local time: 20:18
Member (2007)
English to Portuguese
+ ...
I don't use a CAT tool Jun 27, 2021

Dinosaurs don't like cats...

dinosaur


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 21:18
Member (2006)
English to Afrikaans
+ ...
Long for intermediary tools, short for matching tools Jun 27, 2021

I prefer long segments for the intermediary formats, and short segments for matching tool. In other words, if the end-client starts off with an IDML file, and the first-level agency converts it to SDLXLIFF, and the second-level agency then distributes it via a MemoQ server, from where I would then export it for translation in e.g. OmegaT, then I would prefer OmegaT to segment by sentence, but MemoQ and Trados to segment by paragraph.

 
Gibril Koroma
Gibril Koroma  Identity Verified
Canada
Local time: 15:18
French to English
+ ...
By sentence Jun 27, 2021

It's easier for me.

MollyRose
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Jared Tabor[Call to this topic]

You can also contact site staff by submitting a support request »

Poll: If you use a CAT-tool, how do you prefer the text to be segmented?






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »