Showing posts with label CAT. Show all posts
Showing posts with label CAT. Show all posts

Thursday, December 23, 2010

HOLIDAY MUSINGS FROM LANGUAGE LAND



THE CHICKENS ARE COMING HOME TO ROOST
(A Tale for the Holiday Season)
By Isidro Ludwig Burt Rand

Once upon a time (actually it was only three years ago), a very nice, kind and gentle woman who happened also to be translator, a writer and a poetess, wrote a piece about the impending demise of a certain species of human translators, a demise wrought by the advent and growth of a machine species formally called robotic translation, but more commonly known by its nickname PussyCAT.

This very nice, kind and gentle woman (whom we shall call Ms. Cognac), sent her article to an organization of translators of which she had been a most loyal, most non-controversial and most un-revolutionary member for many, many years.  Her article was received by one of the organization’s minor clerks, a young fellow named Mr. Mannerless, who informed the very nice, kind and gentle Ms. Cognac that her article was not worthy of publication in the organization’s monthly organ – a most professional journal – but that it could be published as a “letter-to-the-editor” if it could be reduced to about 200 or 300 words at the most.

Ms. Cognac did not readily understand the reasons for this rejection or reduction, but being the very nice, kind and gentle woman she was (and still is), she saw little reason to make waves, even small waves.  However, she did relate the incident to a certain Mr. Beast (sometimes known as Mr. Unprofessional and sometimes known as the Duke of Darkness or other times known as Mr. Vulgaritie). 

Mr. Beast inquired of the organization precisely why Ms. Cognac’s article had been given such treatment, particularly by the minor clerk Mannerless.  Suddenly, the wind changed direction and Ms. Cognac was informed that her article would be sent to a so-called Review Committee for guess what?  Review, of course.

The Review Committee was composed of one person, a fellow named Mr. Geek, who was the organization’s Supreme Grand Guru of All Matters Technological.  This struck both Mr. Beast and Ms. Cognac as quite strange since Ms. Cognac’s article had absolutely nothing to do with matters technological, except of course the future effect of matters technological on the species known as human translator.

Anyway, Mr. Geek gave the article to his wife for due perusal and equally due review.  Mrs. Geek liked the article very much (at least that is what Mr. Geek told Mr. Beast), and since Mrs. Geek liked the article, Mr. Geek decided to read it himself and he sort of liked it too.  Being the fair man that he was (and perhaps still is), he affixed his imprimatur of approval upon Ms. Cognac’s article.  And in relating all of these events to Mr. Beast, he (Mr. Geek) said to him (Mr. Beast) something most revealing, to wit:

“Ms. Cognac is a translator of another time”

There was no mistaking in Mr. Beast’s mind as to the meaning of that phrase:  To Mr. Geek (and all of his disciples), the time of translators like Ms. Cognac was over…finished…done…terminated…kaput!  This was the age of the robot and the PussyCAT (and Mr. Beast and Ms. Cognac and all other dinosaurs had better get with the program).

And so, with Mr. Geek’s imprimatur of approval, the article was returned to the organization for publication.  However, the minor clerk Mr. Mannerless refused to publish it (without further reasons given) and so informed Ms. Cognac, who in turn duly advised Mr. Beast.  While Ms. Cognac expressed the desire of having as little to do as possible with the organization, Mr. Beast’s flames of curiosity burned higher and hotter.

He thought about making some inquiries via electronic postal service of the organization’s president, a Mr. Cheery, but knew very well that Mr. Cheery never responded not just to mail, but to anything.  He therefore directed his curiosity and inquiries to the organization’s heir apparent, Czar Nicolai I, who it was known did on occasion respond to inquiries made of Him.

Czar Nicolai I did respond to Mr. Beast’s inquiry (noblesse oblige).  His Almost-Supreme Royal Highness informed the Duke of Darkness (Mr. Beast’s own aristocratic title) that the minor clerk Mr. Mannerless had absolute power and authority (and the blessings of the organization’s supreme rulers) to reject any article for publication that He (the minor clerk Mr. Mannerless) deemed worthy of rejection.  Thus Czar Nicolai I shut the door to any further inquiries, as well as any snooping and meddling.

But the curiosity of the Duke of Darkness knew no bounds and he called upon some of his spies and various and other traitor-like types within the realm, one of whom was a member of the royal court known as Count Oozy (as in the lyrics, “Oozing charm from every pore, he oiled his way around the floor”) or sometimes Sultan of Slime.

From this network of spies, informants and double-agent traitors, Mr. Beast learned that the organization and its royal rulers were petrified by what was contained in Ms. Cognac’s article.  In the neurotic and paranoid world of the organization’s royal court, the words scribed by this very nice, kind and gentle woman would truly upset the organization’s prime patron (called in the vulgate form of English “advertiser”) and/or any other potential patrons (or in the vulgate form of English “advertisers”) of the world of translation technology.  The fact of the matter was that the minor clerk Mr. Mannerless had been ordered by his superiors and rulers to affix the royal stamps and seals marked “Banned” and “Censored” to Ms. Cognac’s words.

“As time went on,
needless to say, along came another wind
and blew the PussyCAT away”.

The name given to this wind was “Google”. 

Wooooooooooooooooooooooooooooooooooooo went the “Google” wind like so:
“Google Translate Now Offers Alternate Versions for Each Word
“Google Translate is introducing a subtle but important new feature, the possibility to alter the translation on the fly and pick the best version from several options for each of the words translated. You can enter your own version if the ones listed aren't accurate. The feature also gives a glimpse at how the technology behind Google Translate works.

"’Sometimes translation can be pretty tough. Language is full of ambiguities and our system has to do its best to make the right choices. So why choose?,’ Josh Estelle, Senior Software Engineer at Google, writes.

“’We’ve launched a new feature to provide you with alternate translations for each phrase in the translated text. Just click the translated phrase and you’ll see a pop-up menu of possible alternates for that phrase, as well as the original phrase highlighted in your original text,’ he explains.

“Hover over any translated word and you will not only see the original word, or words, to which it relates, but also alternative translations. If you think something just doesn't sound right, you can click on the word for a drop-down list of other versions. You can also enter your own translation.

“This way, there is a great chance that one of the alternates makes a lot more sense in the context and you'll get a much better translation in the process. What's more, you'll also be helping Google do a better job next time.

"’Not only can these alternative translations give you a better understanding of a confusing translation, but they also allow you to help Google choose the best alternative when we make a mistake,’ Estelle explains.

“Google Translate uses a statistical machine translation system. Google's computers scours through vast
data sources and look for translations of words, phrases and so on.

“When it's translating something, the system looks through its vast data set and finds the version that is the most likely, based on sheer number. It's not a perfect system, as anyone who has used Translate will know, but the beauty of it is that it gets better in time.

“So any time you make a correction, your input is added and weight against the data already available. This way, little by little, Google Translate will become more accurate”.

So you see, my children, the key word here is “alternatives”, for that is precisely what occurs in the brain of the species known as human translator, and precisely what doesn’t occur in the brain of PussyCAT.  And if the Lords of Google find their holy grail, then the fate of the subjects and disciples of PussyCAT, including Lords Treydoss, Slow-of-Words and Been-There-Done-That will be in the hands of the gods (or St. Jerome, le cas échéant).  Yes, those subjects and disciples, also known as Mr. Geek’s "translators-of-now" could very well by the time of the holiday season of 2012  become the "translators-of-another-time".  The chickens have come home to roost.

PussyCAT.  c. 1998 – c. 2012.  R.I.P.”

Ho, Ho, Ho.  Merry Christmas!


Monday, November 29, 2010

COMMODIFICATION: ARE TRANSLATORS TO BLAME?





WORDS FOR SALE
Part 3

By Bernie Bierman


CAT tools themselves are not to be regarded as our enemies.  They didn’t do anything.  But the supposed translation specialists did it to themselves and the “real” translators let them.”

The above comment made in response to Part 2 of this series certainly has merit, and I would be hard-put to dispute it.  Indeed, I would say that in respect of the economic impact of computer-assisted translation, the CAT operators or workers (né[e] translators) are not the sole source of the problem, although their complicit behavior has certainly been a contributing factor.  From where I sit or stand and from my keen observations, it has been the language service providers (more commonly known as translation agencies) who have used CAT in an exploitative manner and at the same time have converted translation from a communications service into a commodities business, i.e., the selling of words.

The ability of the various CAT programs currently on the market to undertake “precise” computations consonant with their own (computational) parameters and formulas as to how many “new” words, “repeated” words, matches of phrases or passages or segments, percentages of matches, and so on and so forth down a whole line of esoteric calculations, has resulted in among other things the formulation of a rigid pricing structure that one would commonly find in the sale of goods or commodities.  But given the particular structure of the translation industry, the biggest impact of this pricing rigidity has fallen upon the so-called “low man or woman on the totem pole”, i.e., the independent CAT worker (who has manifested over and over again his or her willingness to accept such rigidity and consequent commodification of the service).

About two years ago, a CAT operator née translator named Ronnie McKee decided to take a much closer look at the computational methods used for determining what a translation agency would pay her for her labors.  She wrote, I was asked (by a translation agency in Spain) to translate a large document, namely, a 20,000- word corporate governance report, with a TM (translation memory) provided.  I had heard about rate scales having to do with number of repetitions, etc. etc., but I never really understood what it meant.  Well, now I do.  And for those who are interested, I'm going to share:”

And Ms. McKee did share, as others have subsequently done, and her findings evidence beyond a shadow of doubt that this intellectual service known as translation has become nothing more than the sale of commodity-like units, which see:
* * * * *
The following table represents the analysis undertaken by the Trados software of the document which Ms. McKee was asked to translate.  The data shown in this table has a direct relationship to the data shown in the immediately-following table, for that table shows what the translation agency will pay the CAT operator when it provides the latter with translation memory (TM) .
Match Types
Segments
Words
Percent
Placeables

Context TM
0
0
0
0

Repetitions
149
415
2
142

 100%             
909
10,616
54
0

95% - 99%         
67
1,638
8
1

85% - 94%         
25
610
3
0

 75% - 84%         
12
372
2
0

 50% - 74%          
6
112
1
0

 No Match          
284
6,055
30
28

 Total          
1,452
19,818
100
171

In the case of the translation agency involved in this transaction, a segment is defined as a sentence; it is what the system joins together into one translation unit.  In line with that, the subject document for which a translation was ordered from Ms. McKee contained (according to the computation undertaken by the agency’s Trados software) 149 repeated segments, and those segments included altogether 415 words.  The 100% line means that there were 909 segments in the source-language (SL) document  that have an identical match in the translation memory   The next group indicates that 67 segments have 95% to 99% matches. Accordingly, this agency determined that it would pay the CAT worker a different rate or amount based upon the degree of the match.
The subject SL document contained 284 segments of “brand new translation” (i.e., “No Match”), and those segments constituted 6055 words.  Therefore, the agency paid the CAT worker  the highest rate for those and paid the lowest rate for the 100% matches.
The table below was used by the translation agency for determining how much it would pay Ms. McKee for her work based upon the data embodied in the above-indicated table:
Match Types
Euro/word
Repetitions      
0,015
100%
0,015
75-99%   
0,040
0-74%           
0,070


* * * * *
A cursory glance at these figures indicates that for the entire project of 19,818 source language words, Ms. McKee was paid a total amount of € 701.96, which we’ll round off to €702.00, or approximately US$990.00.  That breaks down to €0.036 per SL word or US$0.05 per SL word. 

It would be tempting at this point to undertake a full, in-depth analysis of all of these figures to determine such things as how much Ms. McKee made on an hourly basis, how much efficiency did she achieve by using robotic translation tools, along with numerous other compensation-related aspects.  However, that is not the purpose behind the writing of this article, although it would certainly make good fodder for a future article.

Rather, the purpose behind the writing of this article is to point up not just the commodification of translation, but its total and complete commodification.  But not only does this total and complete commodification elicit comments (especially in the compensation-related area), but also raises numerous questions, questions that both translators and CAT workers have raised and continue to raise, such as how the particular CAT program (Trados, WordFast, DejaVu, et al) determines what is a 72% match or a 48% or a 12% match or a 91% match, and whether there isn’t some kind of interpretive element at work in such determination.  Many CAT operators have expressed and continue to express puzzlement over such determinations (made not by them, but by the translation agency), and even more puzzlement over the methods employed for arriving at such determinations. 

And another equally striking question is how a computer or computer program can determine that two segments in a source language document are identical without having a clue as to whether the target language will demand a somewhat different or very different translation because of that very naughty word context.  Admittedly, however, this is drifting slightly from the main thrust of this article, which to repeat is the commodification of translation.

The pinpoint breakdowns and classifications indicated in the tables shown above demonstrate beyond any discussion and beyond any doubt that the communications service once known as translation has become nothing more than a robotic selling of words…with certain words being the most expensive, other words being a little less expensive, and some words being bargain-basement cheap or in some cases given away for nothing (or if one prefers, given away in the most charitable manner).

And to support and otherwise propagandize this commodification, CAT workers and CAT operators are told that they will attain heights of efficiency and productivity never dreamt of just ten short years ago, and that this efficiency and productivity will double, triple or even quadruple their earnings.  How this will come about in the context of ever-falling translation prices and rates, of more and more competition and of more and more “memorization” of vocabulary, terminology, phraseology, etc., is never explained. 

Not surprisingly, the world community of CAT workers and operators has cheerfully accepted this commodification of their work (perhaps because many never knew better and/or never knew what preceded this commodification) and, in many cases, they have joined their translation agency clients to become the champions, the salespersons and propagandists of Words for Sale.

Yes, indeed, today one can easily tell when CAT workers and operators are gathered in one of their very numerous periodic meetings or conventions or conferences: there is a long line of chauffer-driven limousines, a small but obvious congregation of pilots and flight attendants for the numerous private jets waiting at the nearby airports, and of course a parade of the latest fashions in dress that warms the hearts and pocketbooks of the most elite designers of Paris and Milan

* * * * *

Postscript:  Some time in late January or early February of every year, I prepare financial data for my personal accountant, Smiley.  The data is neatly laid out so that Smiley does not unnecessarily have to call me for any explanations.  Now, I know and Smiley knows that I know that he does not sit down at his desk and spend literally hours upon hours with an adding machine and calculator filling out all sorts and manner of tax forms and declarations (as my late brother-in-law, also an accountant, had to do for many years). 

Nay, I know and Smiley knows that I know that he takes all of the data I provide him with, inputs it into a computer and in a matter of minutes, yes minutes, out come all of the completed tax returns and declarations.  However, when Smiley sends his bill, it does not reflect 12 minutes or 22.4 minutes, or even 1 hour and 7 minutes of work.  So you, the CAT worker out there who might be reading this article, may want to ask “Then what precisely does Smiley’s bill reflect?”

OK, I’ll tell you what it reflects: It reflects his knowledge, his skills, his expertise, his unique and singular talents, and I am more than happy to pay him for that knowledge, skill, expertise and talent, It provides me with a sound sleep at night.   I trust, or at least hope that the readers of this piece (whether they are CAT workers or translators or even translation agency executives) will get my drift, if not the message.



SPECIAL COMMENT

I don’t use CAT tools, but nevertheless I feel that it isn’t fair for a translator to charge for passages that are repeated in a document.  If I can cut and paste a passage, then I’ll charge only once for the words in that passage”
   - Statement by Eve Hecht, a translator and instructor in German legal translation at New York University, made at
       the 2010 convention of the American Translators Association.


If there is any one statement among literally hundreds of thousands of statements by translators (and their 21st century successors, CAT workers or CAT operators) that so perfectly reflects their economic attitudes, it is the one by Eve Hecht quoted above.  Unfortunately and indeed most regrettably (at least unfortunately and regrettably as I see the world of translation), Ms. Hecht’s beliefs and attitudes are not just widely accepted, they have become unquestioned conventional wisdom.  In this past decade more translators and their CAT worker successors have come to believe and accept the attitude expressed by Ms. Hecht.

I have said “unfortunately” and “most regrettably” only out of politeness and what I with some hesitation would call “good taste”.  But I hope that the reader will allow me the indulgence of setting aside the politeness and “good taste”, and allow me to express myself in the terms that would be more deserving of Ms. Hecht’s economic or commercial viewpoints.

Ms. Hecht’s statement makes me want to vomit!  It makes me want to scream. It makes me want to yell out, “Is that all that you think of yourself, of your unique skills (as a translator), of your talents?  Is that what you think of yourself and the work that you do…nothing more that some clerk-like endeavor, nothing more than an extension of the clerical furniture?  Is that all you think of the value of your time?  Is that all the self-worth you have?  Is this what you teach your students, namely that translation is nothing more than typing words, “NEW” words? 

You say, Ms. Hecht, that “it isn’t fair for a translator to charge for passages that are repeated in a document”.  It is not fair to whom?  Surprise me, Ms. Hecht, and tell me precisely why it is not “fair” for a translator to charge for a “repeated passage”.  Surprise me, Madame, and defend your position.  You claim that you are a translator of legal texts.  OK.  Suppose you have a passage like this:

Pursuant to the provisions of §12, section b.2 of Article 719 and §23, section d.1 of Article 793 of the Civil Procedure Code, as amended, the plaintiff can claim…” and two paragraphs later, the text reads, “But in the case before us, the provisions of §12, section b.2 of Article 719 and §23, section d.1 of Article 793 of the Civil Procedure Code, as amended, do not apply to a plaintiff who is unable…

As I understand it, Madame, the second time that the passage the provisions of §12, section b.2 of Article 719 and §23, section d.1 of Article 793 of the Civil Procedure Code, as amendedappears, you do not charge for the translating of that passage.  According to what you say, even if this passage appears ten times within the text, it is a “freebe” to the client, because as I understand your thinking, all you did was “cut and paste” or make a macro, and that in and of itself has zero value…to you (and to all other translation practitioners whom you believe should follow your “benevolent” and “self-sacrificing” attitude).

Tell me, Ms. Hecht, if the words “pursuant to” cropped up 75 times in a text, would you also eliminate those words 74 times from your word-count and attendant invoice? After all, according to you, the only thing you did was to make a little macro and push a button 74 times, and that certainly is not worth anything. Not a single Euro cent.  No a single penny.  Not a single centavo.  Not one single Turkish Lira.   Right?  After all, Ms. Hecht,  according to what you state and believe, it would not be fair for a translator to charge for pushing a macro key 74 times, or 174 times or 374 times.  Indeed,  Madame, why charge for words like “and”, “but”, “in”, “out”, “from”, “to”, “you”, “they”, “he”, “she”, etc., etc., etc., that repeat over and over and over again within any text?

If indeed, Madame, this is in fact what you believe and what you implement in your work, then I have absolutely no hesitation in expressing my opinion that you are precisely one of the many who have wittingly or unwittingly urinated upon the profession of translation, loudly proclaiming that it has little value and what little value it has is about on a par with the most menial clerical endeavor. 

You, Madame, in my most considered opinion are precisely why the translation-buying and translation-using public views and continues to view translators as nothing more than little language clerks and extensions of the clerical furniture.  Yes, Ms. Hecht, you truly want to make me vomit.  Now, really and truly surprise me and defend your position.



* * * * *

Monday, September 27, 2010

COMMODIFICATION AND THE CALCULATION OF RATES

Several people sent me emails about my last post rather than making comments here. Two in particular had to do with Bernie Bierman's email to his clients, and I passed them along to him. So that the discussion can continue, I am publishing his reply below:


Dear Jodi and dear Aliza,

Rosene Zaros, the publisher of this blog and journal informed me that both of you disagreed with me – in slightly different ways - on the matter of charging on the basis of target language words. I would be truly shocked if there were more than ten persons in the entire world of translation who agreed with me, not just on this issue, but on any other translation-related issue.

In retrospect, I shouldn’t have even written my “letter to my clients”, for it was based solely and totally upon erroneous thinking, and worse upon truly dinosauric concepts. In a phrase, I should have listened to the voices bellowing in unison, “Hey, Bernie, get with the program”. Stupidly, or unwisely, I chose to ignore those voices and proceeded along on my hind legs. If someone called me a slang term denoting the egress of the rectal canal, he, she or they would have been absolutely correct..

You see, my very dear ladies, when I wrote my letter to the client, I was still cemented into the thinking and attendant belief that there are still more than a sufficient number of translators in this vast world who perceive of themselves as writers, as specialists in communications, as persons having knowledge (profound or peripheral) of certain or many subjects, or for the very least the terminology, the phraseology and the writing styles of those subjects. Indeed, as I was writing my words, I kept laboring under this very antiquated notion that in the ultimate the fee of a practitioner in this business – whether an individual translator or a translation service company – was really based upon some very unique skills combined with knowledge and other gray matter components, and all wrapped up in a package labeled “communications service”. And I further kept laboring under the equally antiquated notion that the word count was merely the best of all of the other inadequate methods (e.g., per line, per stroke, per hour, per coffee break) to be used as a mere reference upon which this fee for a communications service could be based. I really don’t know what possessed me to keep thinking in a period of time that now seems like at least one thousand years ago.

Incidentally, the reason that gave rise to the writing of my letter to my (ever dwindling number of) clients was US$4.80. Yes, that’s right, US$4.80. No, not US$48.00 or US$480 or US$4800, but US$4.80. If you yourselves don’t see the picture, I do believe you see my drift. (At least I hope so).

There are at least two (2) facts of the matter that I should have paid more attention to and carefully considered before writing my letter to my clients. While I have every reason to believe that both of you are fully acquainted with these particular facts of the matter, permit me to address them merely for the sake of the few fellow dinosaurs who may read this tract, and whose understanding of the ways of modern translation need remediation. The first is that there are fewer and fewer translators involved in the translation industry. Translators, as we knew and defined them once upon a fairly short time ago, have been replaced by a genus called CAT workers or CAT operators (a genus somewhat akin to the now-defunct keypunch operator). The second fact of the matter is that with the ever-growing dominance of CAT, it is the computer that undertakes a so-called “scientific” analysis of the text to be translated, comparing it to texts that have been previously translated and stored in its memory, an analysis that includes not just the exact number of words to be translated from “scratch”, but also the number of words that may be similar to those already in the memory; this process is called “matching” and the computer undertakes all sorts and manner of calculations to determine percentages of this “matching”. All of these analyses and calculations of matching percentages are done on the basis of the text to be translated, i.e., the source language text, and the CAT worker or operator is told right from the get-go precisely how many (SL) words he or she will have to “translate” and how much he or she will be paid for those “translated” words. And there is no arguing by the CAT operator with either the computer or the translation agency. (Of course, this latter aspect raises some interesting questions of independent contractorship, but that’s another story for another day.)

Anyway, arguments/discussions/debates/discourses about TL-SL counts are now meaningless; the computer has become and is unquestionably the undisputed master and sole arbiter of this aspect of the translation process, and both the translation agencies and the hordes of CAT operators who labor for those agencies now worship fervently at the altar of this omniscient and omnipotent deity.

If my letter to my clients said anything, it said that I am of the age of the quill pen and parchment paper. And now I shall take leave of you to get into my horse-drawn carriage and repair to the blacksmith.

Always your Most Hbl., Dvt. & Obt. Svt.,

(Hell, if I am to be classified as a dinosaur, I just as well write like one, eh?)

Bernie Bierman