Broken translation memories showing extreme high % matches for unrelated text?
Thread poster: Adieu
Adieu
Adieu  Identity Verified
Ukrainian to English
+ ...
Dec 7, 2021

I've got a client who recognizes the issue and even pays out without match discounts, but perseveres in trying to populate a messed-up translation memory. This thing shows 90+% matches between grammatically incorrect vernacular where a mental patient is talking about his issues and pieces of bureaucratic cover letter templates for applications for permits.

There's literally not a word in common. Some aren't even remotely close in length.

Any idea how this is happening?


 
Daryo
Daryo
United Kingdom
Local time: 19:30
Serbian to English
+ ...
Maybe ... Dec 8, 2021

Adieu wrote:


I've got a client who recognizes the issue and even pays out without match discounts, but perseveres in trying to populate a messed-up translation memory. This thing shows 90+% matches between grammatically incorrect vernacular where a mental patient is talking about his issues and pieces of bureaucratic cover letter templates for applications for permits.

There's literally not a word in common. Some aren't even remotely close in length.

Any idea how this is happening?


Maybe that proves the bureaucratic jargon makes about as much sense as the verbal effusion of mental patients?

Or the software was tweaked a bit too much in favour of agencies, in an effort to beat the competition?

One of these explanations ought to make sense...


 
Grigori Gazarian
Grigori Gazarian  Identity Verified
Mexico
Local time: 12:30
Member (2021)
Spanish to Russian
+ ...
Tags, maybe? Dec 8, 2021

Adieu wrote:

There's literally not a word in common. Some aren't even remotely close in length.

Any idea how this is happening?


Are there many tags? Not sure, but I think memoQ's algorithms also consider similar tags. Just a wild guess.


 
Adieu
Adieu  Identity Verified
Ukrainian to English
+ ...
TOPIC STARTER
Nope Dec 8, 2021

No tags.

Tags are a separate type of cancer.

Grigori Gazarian wrote:

Adieu wrote:

There's literally not a word in common. Some aren't even remotely close in length.

Any idea how this is happening?


Are there many tags? Not sure, but I think memoQ's algorithms also consider similar tags. Just a wild guess.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Broken translation memories showing extreme high % matches for unrelated text?






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »