forked from acl-org/acl-anthology
-
Notifications
You must be signed in to change notification settings - Fork 0
/
2000.amta.xml
401 lines (401 loc) · 42.2 KB
/
2000.amta.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
<?xml version='1.0' encoding='UTF-8'?>
<collection id="2000.amta">
<volume id="tutorials" ingest-date="2021-05-14">
<meta>
<booktitle>Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Tutorial Descriptions</booktitle>
<publisher>Springer</publisher>
<address>Cuernavaca, Mexico</address>
<month>October 10-14</month>
<year>2000</year>
<editor><first>John S.</first><last>White</last></editor>
</meta>
<paper id="1">
<title>Ontological semantics</title>
<author><first>Sergei</first><last>Nirenburg</last></author>
<bibkey>nirenburg-2000-ontological</bibkey>
</paper>
<paper id="2">
<title>A gentle introduction to <fixed-case>MT</fixed-case>: theory and current practice</title>
<author><first>Eduard</first><last>Hovy</last></author>
<bibkey>hovy-2000-gentle</bibkey>
</paper>
<paper id="3">
<title>Controlled languages</title>
<author><first>Teruko</first><last>Mitamura</last></author>
<author><first>Eric</first><last>Nyberg</last></author>
<bibkey>mitamura-nyberg-2000-controlled</bibkey>
</paper>
<paper id="4">
<title>Statistical machine translation</title>
<author><first>Kevin</first><last>Knight</last></author>
<bibkey>knight-2000-statistical</bibkey>
</paper>
<paper id="5">
<title>The diversity and distribution of languages</title>
<author><first>Laurie</first><last>Gerber</last></author>
<bibkey>gerber-2000-diversity</bibkey>
</paper>
<paper id="6">
<title><fixed-case>MT</fixed-case>ranslatability</title>
<author><first>Arendse</first><last>Bernth</last></author>
<author><first>Claudia</first><last>Gdaniec</last></author>
<bibkey>bernth-gdaniec-2000-mtranslatability</bibkey>
</paper>
</volume>
<volume id="papers" ingest-date="2021-05-14">
<meta>
<booktitle>Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Technical Papers</booktitle>
<publisher>Springer</publisher>
<address>Cuernavaca, Mexico</address>
<month>October 10-14</month>
<year>2000</year>
<editor><first>John S.</first><last>White</last></editor>
</meta>
<paper id="1">
<title>Building a <fixed-case>C</fixed-case>hinese-<fixed-case>E</fixed-case>nglish mapping between verb concepts for multilingual applications</title>
<author><first>Bonnie J.</first><last>Dorr</last></author>
<author><first>Gina-Anne</first><last>Levow</last></author>
<author><first>Dekang</first><last>Lin</last></author>
<pages>1-12</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_1</url>
<abstract>This paper addresses the problem of building conceptual resources for multilingual applications. We describe new techniques for large-scale construction of a Chinese-English lexicon for verbs, using thematic-role information to create links between Chinese and English conceptual information. We then present an approach to compensating for gaps in the existing resources. The resulting lexicon is used for multilingual applications such as machine translation and cross-language information retrieval.</abstract>
<bibkey>dorr-etal-2000-building</bibkey>
</paper>
<paper id="2">
<title>Applying machine translation to two-stage cross-language information retrieval</title>
<author><first>Atsushi</first><last>Fujii</last></author>
<author><first>Tetsuya</first><last>Ishikawa</last></author>
<pages>13-24</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_2</url>
<abstract>Cross-language information retrieval (CLIR), where queries and documents are in different languages, needs a translation of queries and/or documents, so as to standardize both of them into a common representation. For this purpose, the use of machine translation is an effective approach. However, computational cost is prohibitive in translating large-scale document collections. To resolve this problem, we propose a two-stage CLIR method. First, we translate a given query into the document language, and retrieve a limited number of foreign documents. Second, we machine translate only those documents into the user language, and re-rank them based on the translation result. We also show the effectiveness of our method by way of experiments using Japanese queries and English technical documents.</abstract>
<bibkey>fujii-ishikawa-2000-applying</bibkey>
</paper>
<paper id="3">
<title>Mixed-initiative translation of Web pages</title>
<author><first>Michael</first><last>Fleming</last></author>
<author><first>Robin</first><last>Cohen</last></author>
<pages>25-29</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_3</url>
<abstract>A mixed-initiative system is one which allows more interactivity between the system and user, as the system is reasoning. We present some observations on the task of translating Web pages for users and suggest that a more interactive approach to this problem may be desirable. The aim is to interact with the user who is requesting the translation and the challenge is to determine the circumstances under which the user should be able to take the initiative to direct the processing or the system should be able to take the initiative to solicit further input from the user. In fact, we envision a need to support interactive translation of Web pages as the World Wide Web becomes more accessible to people with varying needs and abilities throughout the world.</abstract>
<bibkey>fleming-cohen-2000-mixed</bibkey>
</paper>
<paper id="4">
<title>A self-learning method of parallel texts alignment</title>
<author><first>António</first><last>Ribeiro</last></author>
<author><first>Gabriel</first><last>Lopes</last></author>
<author><first>João</first><last>Mexia</last></author>
<pages>30-39</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_4</url>
<abstract>This paper describes a language independent method for alignment of parallel texts that re-uses acquired knowledge. The system extracts word translation equivalents and re-uses them as correspondence points in order to enhance the alignment of parallel texts. Points that may cause misalignment are filtered using confidence bands of linear regression analysis instead of heuristics, which are not theoretically reliable. Homographs bootstrap the alignment process so as to build the primary word translation lexicon. At each step, the previously acquired lexicon is re-used so as to repeatedly make finer-grained alignments and produce more reliable translation lexicons.</abstract>
<bibkey>ribeiro-etal-2000-self</bibkey>
</paper>
<paper id="5">
<title>Handling structural divergences and recovering dropped arguments in a <fixed-case>K</fixed-case>orean/<fixed-case>E</fixed-case>nglish machine translation system</title>
<author><first>Chung-hye</first><last>Han</last></author>
<author><first>Benoit</first><last>Lavoie</last></author>
<author><first>Martha</first><last>Palmer</last></author>
<author><first>Owen</first><last>Rambow</last></author>
<author><first>Richard</first><last>Kittredge</last></author>
<author><first>Tanya</first><last>Korelsky</last></author>
<author><first>Nari</first><last>Kim</last></author>
<author><first>Myunghee</first><last>Kim</last></author>
<pages>40-53</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_5</url>
<abstract>This paper describes an approach for handling structural divergences and recovering dropped arguments in an implemented Korean to English machine translation system. The approach relies on canonical predicate-argument structures (or dependency structures), which provide a suitable pivot representation for the handling of structural divergences and the recovery of dropped arguments. It can also be converted to and from the interface representations of many off-the-shelf parsers and generators.</abstract>
<bibkey>han-etal-2000-handling</bibkey>
</paper>
<paper id="6">
<title>A machine translation system from <fixed-case>E</fixed-case>nglish to <fixed-case>A</fixed-case>merican <fixed-case>S</fixed-case>ign <fixed-case>L</fixed-case>anguage</title>
<author><first>Liwei</first><last>Zhao</last></author>
<author><first>Karin</first><last>Kipper</last></author>
<author><first>William</first><last>Schuler</last></author>
<author><first>Christian</first><last>Vogler</last></author>
<author><first>Norman</first><last>Badler</last></author>
<author><first>Martha</first><last>Palmer</last></author>
<pages>54-67</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_6</url>
<abstract>Research in computational linguistics, computer graphics and autonomous agents has led to the development of increasingly sophisticated communicative agents over the past few years, bringing new perspective to machine translation research. The engineering of language- based smooth, expressive, natural-looking human gestures can give us useful insights into the design principles that have evolved in natural communication between people. In this paper we prototype a machine translation system from English to American Sign Language (ASL), taking into account not only linguistic but also visual and spatial information associated with ASL signs.</abstract>
<bibkey>zhao-etal-2000-machine</bibkey>
</paper>
<paper id="7">
<title>Oxygen: a language independent linearization engine</title>
<author><first>Nizar</first><last>Habash</last></author>
<pages>68-79</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_7</url>
<abstract>This paper describes a language independent linearization engine, oxyGen. This system compiles target language grammars into programs that take feature graphs as inputs and generate word lattices that can be passed along to the statistical extraction module of the generation system Nitrogen. The grammars are written using a flexible and powerful language, oxyL, that has the power of a programming language but focuses on natural language realization. This engine has been used successfully in creating an English linearization program that is currently employed as part of a Chinese-English machine translation system.</abstract>
<bibkey>habash-2000-oxygen</bibkey>
</paper>
<paper id="8">
<title>Information structure transfer: bridging the information gap in structurally different languages</title>
<author><first>Margo</first><last>Budzikowa</last></author>
<pages>80-88</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_8</url>
<abstract>This paper presents the implementation part of my doctoral research at the University of Cambridge. It provides a description of the Information Structure Transfer (IST), a machine translation prototype designed within the framework of the Spoken Language Translator (SLT by SRI, Cambridge/Palo Alto) and based on the Core Language Engine ([1]). The IST includes two discourse-processing modules: the pre-transfer Information Structure Activator (ISA) and the post-transfer Information Structure Generator (ISG). The IST prototype calculates and processes vital features of information structure explored in context of structural differences between positional and nonpositional languages. It offers algorithmic solutions and an implementation framework for local discourse processing in machine translation. Under scrutiny is a web of interrelated factors such as pronominalization, anaphora resolution, zero anaphors, definiteness and constituent order.</abstract>
<bibkey>budzikowa-2000-information</bibkey>
</paper>
<paper id="9">
<title>The effect of source analysis on translation confidence</title>
<author><first>Arendse</first><last>Bernth</last></author>
<author><first>Michael C.</first><last>McCord</last></author>
<pages>89-99</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_9</url>
<abstract>Translations produced by an MT system can automatically be assigned a number that reflects the MT system’s confidence in their quality. We describe the design of such a confidence index, with focus on the contribution of source analysis, which plays a crucial role in many MT systems, including ours. Various problematic areas of source analysis are identified, and their impact on the overall confidence index is given. We will describe two methods of training the confidence index, one by hand-tuning of the heuristics, the other by linear regression analysis.</abstract>
<bibkey>bernth-mccord-2000-effect</bibkey>
</paper>
<paper id="10">
<title>Contemplating automatic <fixed-case>MT</fixed-case> evaluation</title>
<author><first>John S.</first><last>White</last></author>
<pages>100-108</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_10</url>
<abstract>Researchers, developers, translators and information consumers all share the problem that there is no accepted standard for machine translation. The problem is much further confounded by the fact that MT evaluations properly done require a considerable commitment of time and resources, an anachronism in this day of cross-lingual information processing when new MT systems may developed in weeks instead of years. This paper surveys the needs addressed by several of the classic “types” of MT, and speculates on ways that each of these types might be automated to create relevant, near-instantaneous evaluation of approaches and systems.</abstract>
<bibkey>white-2000-contemplating</bibkey>
</paper>
<paper id="11">
<title>How are you doing? A look at <fixed-case>MT</fixed-case> evaluation</title>
<author><first>Michelle</first><last>Vanni</last></author>
<author><first>Florence</first><last>Reeder</last></author>
<pages>109-116</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_11</url>
<abstract>Machine Translation evaluation has been more magic and opinion than science. The history of MT evaluation is long and checkered - the search for objective, measurable, resource-reduced methods of evaluation continues. A recent trend towards task-based evaluation inspires the question - can we use methods of evaluation of language competence in language learners and apply them reasonably to MT evaluation? This paper is the first in a series of steps to look at this question. In this paper, we will present the theoretical framework for our ideas, the notions we ultimately aim towards and some very preliminary results of a small experiment along these lines.</abstract>
<bibkey>vanni-reeder-2000-look</bibkey>
</paper>
<paper id="12">
<title>Recycling annotated parallel corpora for bilingual document composition</title>
<author><first>Arantza</first><last>Casillas</last></author>
<author><first>Joseba</first><last>Abaitua</last></author>
<author><first>Raquel</first><last>Martínez</last></author>
<pages>117-126</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_12</url>
<abstract>Parallel corpora enriched with descriptive annotations facilitate multilingual authoring development. Departing from an annotated bitext we show how SGML markup can be recycled to produce complementary language resources. On the one hand, several translation memory databases together with glossaries of proper nouns have been produced. On the other, DTDs for source and target documents have been derived and put into correspondence. This paper discusses how these resources have been automatically generated and applied to an interactive bilingual authoring system. This tool is capable of handling a substantial proportion of text both in the composition and translation of structured documents.</abstract>
<bibkey>casillas-etal-2000-recycling</bibkey>
</paper>
<paper id="13">
<title>Combining invertible example-based machine translation with translation memory technology</title>
<author><first>Michael</first><last>Carl</last></author>
<pages>127-136</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_13</url>
<abstract>This paper presents an approach to extract invertible trans- lation examples from pre-aligned reference translations. The set of in- vertible translation examples is used in the Example-Based Machine Translation (EBMT) system EDGAR for translation. Invertible bilin- gual grammars eliminate translation ambiguities such that each source language parse tree maps into only one target language string. The trans- lation results of EDGAR are compared and combined with those of a translation memory (TM). It is shown that i) best translation results are achieved for the EBMT system when using a bilingual lexicon to sup- port the alignment process ii) TMs and EBMT-systems can be linked in a dynamical sequential manner and iii) the combined translation of TMs and EBMT is in any case better than each of the single system.</abstract>
<bibkey>carl-2000-combining</bibkey>
</paper>
<paper id="14">
<title>What’s been forgotten in translation memory</title>
<author><first>Elliott</first><last>Macklovitch</last></author>
<author><first>Graham</first><last>Russell</last></author>
<pages>137-146</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_14</url>
<abstract>Although undeniably useful for the translation of certain types of repetitive document, current translation memory technology is limited by the rudimentary techniques employed for approximate matching. Such systems, moreover, incorporate no real notion of a document, since the databases that underlie them are essentially composed of isolated sentence strings. As a result, current TM products can only exploit a small portion of the knowledge residing in translators’ past production. This paper examines some of the changes that will have to be implemented if the technology is to be made more widely applicable.</abstract>
<bibkey>macklovitch-russell-2000-whats</bibkey>
</paper>
<paper id="15">
<title>Understanding politics by studying weather: a cognitive approach to representation of <fixed-case>P</fixed-case>olish verbs of motion, appearance, and existence</title>
<author><first>Barbara</first><last>Gawronska</last></author>
<author><first>Hannah</first><last>Duczak</last></author>
<pages>147-157</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_15</url>
<abstract>The paper deals with the question whether representations of verb semantics formulated on the basis of a lexically and syntactically restricted domain (weather forecasts) can apply to other, less restricted textual domains. An analysis of a group of Polish polysemous verbs of motion, existence and appearance inspired by cognitive semantics, especially the metaphor theory, is presented, and the usefulness of the conceptual representations of the Polish motion/appearance/existence verbs for automatic translation of texts belonging to less restricted domains is evaluated and discussed.</abstract>
<bibkey>gawronska-duczak-2000-understanding</bibkey>
</paper>
<paper id="16">
<title>Small but efficient: the misconception of high-frequency words in Scandinavian translation</title>
<author><first>Pernilla</first><last>Danielsson</last></author>
<author><first>Katarina</first><last>Mühlenbock</last></author>
<pages>158-168</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_16</url>
<abstract>Machine translation has proved itself to be easier between languages that are closely related, such as German and English, while far apart languages, such as Chinese and English, encounter much more problems. The present study focuses upon Swedish and Norwegian; two languages so closely related that they would be referred to as dialects if it were not for the fact that they had a Royal house and an army connected to each of them. Despite their similarity though, some differences make the translation phase much less straight-forward than what could be expected. Taking the outset in sentence aligned parallel texts, this study aims at highlighting some of the differences, and to formalise the results. In order to do so, the texts have been aligned on smaller units, by a simple cognate alignment method. Not at all surprising, the longer words were easier to align, while shorter and often high-frequent words became a problem. Also when trying to align to a specific word sense in a dictionary, content words rendered better results. Therefore, we abandoned the use of single-word units, and searched for multi-word units whenever possible. This study reinforces the view that Machine Translation should rest upon methods based on multiword unit searches.</abstract>
<bibkey>danielsson-muhlenbock-2000-small</bibkey>
</paper>
<paper id="17">
<title>Challenges in adapting an interlingua for bidirectional <fixed-case>E</fixed-case>nglish-<fixed-case>I</fixed-case>talian translation</title>
<author><first>Violetta</first><last>Cavalli-Sforza</last></author>
<author><first>Krzysztof</first><last>Czuba</last></author>
<author><first>Teruko</first><last>Mitamura</last></author>
<author><first>Eric</first><last>Nyberg</last></author>
<pages>169-178</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_17</url>
<abstract>We describe our experience in adapting an existing high- quality, interlingual, unidirectional machine translation system to a new domain and bidirectional translation for a new language pair (English and Italian). We focus on the interlingua design changes which were necessary to achieve high quality output in view of the language mismatches between English and Italian. The representation we propose contains features that are interpreted differently, depending on the translation direction. This decision simplified the process of creating the interlingua for individual sentences, and allows the system to defer mapping of language-specific features (such as tense and aspect), which are realized when the target syntactic feature structure is created. We also describe a set of problems we encountered in translating modal verbs, and discuss the representation of modality in our interlingua.</abstract>
<bibkey>cavalli-sforza-etal-2000-challenges</bibkey>
</paper>
<paper id="18">
<title>Text meaning representation as a basis for representation of text interpretation</title>
<author><first>Stephen</first><last>Helmreich</last></author>
<author><first>David</first><last>Farwell</last></author>
<pages>179-188</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_18</url>
<abstract>In this paper we propose a representation for what we have called an interpretation of a text. We base this representation on TMR (Text Meaning Representation), an interlingual representation developed for Machine Translation purposes. A TMR consists of a complex feature-value structure, with the feature names and filler values drawn from an ontology, in this case, ONTOS, developed concurrently with TMR. We suggest on the basis of previous work, that a representation of an interpretation of a text must build on a TMR structure for the text in several ways: (1) by the inclusion of additional required features and feature values (which may themselves be complex feature structures); (2) by pragmatically filling in empty slots in the TMR structure itself; and (3) by supporting the connections between feature values by including, as part of the TMR itself, the chains of inferencing that link various parts of the structure.</abstract>
<bibkey>helmreich-farwell-2000-text</bibkey>
</paper>
</volume>
<volume id="systems" ingest-date="2021-05-14">
<meta>
<booktitle>Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: System Descriptions</booktitle>
<publisher>Springer</publisher>
<address>Cuernavaca, Mexico</address>
<month>October 10-14</month>
<year>2000</year>
<editor><first>John S.</first><last>White</last></editor>
</meta>
<paper id="1">
<title><fixed-case>MT</fixed-case>-based transparent Arabization of the <fixed-case>I</fixed-case>nternet <fixed-case>TARJIM</fixed-case>.<fixed-case>COM</fixed-case></title>
<author><first>Achraf</first><last>Chalabi</last></author>
<pages>189-191</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_19</url>
<abstract>As is known, the majority of the actual textual content on the Internet is in English language. This represents an obstacle to those non-English speaking users willing to access the Internet. The idea behind this MT-based application is to allow any Arabic user to search and navigate through the Internet using Arabic language without the need to have prior knowledge of English language. The infrastructure of TARJIM.COM relies on 3 basic core components : 1- The Bi-directional English-Arabic Machine translation Engine, 2- The intelligent Web page layout preserving component and 3-The Search Engine query interceptor.</abstract>
<bibkey>chalabi-2000-mt</bibkey>
</paper>
<paper id="2">
<title>The <fixed-case>KANTOO</fixed-case> machine translation environment</title>
<author><first>Eric</first><last>Nyberg</last></author>
<author><first>Teruko</first><last>Mitamura</last></author>
<pages>192-195</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_20</url>
<abstract>In this paper we describe the KANTOO machine translation environment, a set of software services and tools for multilingual document production. KANTOO includes modules for source language analysis, target language generation, source terminology management, target terminology management, and knowledge source development. The KANTOOsystem represents a complete re-design and re-implementation of the KANT machine translation system.</abstract>
<bibkey>nyberg-mitamura-2000-kantoo</bibkey>
</paper>
<paper id="3">
<title>Pacific Rim portable translator</title>
<author><first>John</first><last>Weisgerber</last></author>
<author><first>Jin</first><last>Yang</last></author>
<author><first>Pete</first><last>Fisher</last></author>
<pages>196-201</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_21</url>
<abstract>ARL’s FALCon system has proven its integrated OCR and MT technology to be a valuable asset to soldiers in the field in both Bosnia and Haiti. Now it is being extended to include six more SYSTRAN language pairs in response to the military’s need for automatic translation capabilities as they pursue US national objectives in East Asia. The Pacific Rim Portable Translator will provide robust automatic translation bidirectionally for English, Chinese, Japanese, and Korean, which will allow not only rapid assimilation of foreign information, but two-way communication as well for both the public and private sectors.</abstract>
<bibkey>weisgerber-etal-2000-pacific</bibkey>
</paper>
<paper id="4">
<title><fixed-case>L</fixed-case>abel<fixed-case>T</fixed-case>ool: a localization application for devices with restricted display areas</title>
<author><first>Jimmy C.M.</first><last>Lu</last></author>
<author><first>Lars</first><last>Åkerman</last></author>
<author><first>Karin</first><last>Spalink</last></author>
<pages>202-208</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_22</url>
<abstract>The LabelTool/TrTool system is designed to administer text strings that are shown in devices with a very limited display area and translated into a very large number of foreign languages. Automation of character set handling and file naming and storage together with real–time simulation of text string input are the main features of this application.</abstract>
<bibkey>lu-etal-2000-labeltool</bibkey>
</paper>
<paper id="5">
<title>The <fixed-case>L</fixed-case>ogo<fixed-case>V</fixed-case>ista <fixed-case>ES</fixed-case> translation system</title>
<author><first>Nan</first><last>Decker</last></author>
<pages>209-212</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_23</url>
<abstract>The LogoVista ES translation system translates English text to Spanish. It is a member of LEC’s family of translation tools and uses the same engine as LogoVista EJ. This engine, which has been under development for ten years, is heavily linguistic and rule-based. It includes a very large, highly annotated English dictionary that contains detailed syntactic, semantic and domain information; a binary parser that produces multiple parses for each sentence; a 12,000+-rule, context-free English grammar; and a synthesis file of rules that convert each parsed English structure into a Spanish structure. The main tasks involved in developing a new language pair include the addition of target-language translations to the dictionary and the addition of rules to the synthesis file. The system’s modular design allows the work to be carried out by linguists, independent of engineers.</abstract>
<bibkey>decker-2000-logovista</bibkey>
</paper>
<paper id="6">
<title><fixed-case>L</fixed-case>&<fixed-case>H</fixed-case> lexicography toolkit for machine translation</title>
<author><first>Timothy</first><last>Meekhof</last></author>
<author><first>David</first><last>Clements</last></author>
<pages>213-218</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_24</url>
<abstract>One of the most important components of any machine translation system is the translation lexicon. The size and quality of the lexicon, as well as the coverage of the lexicon for a particular use, greatly influence the applicability of machine translation for a user. The high cost of lexicon development limits the extent to which even mature machine translation vendors can expand and specialize their lexicons, and frequently prevents users from building extensive lexicons at all. To address the high cost of lexicography for machine translation, L&H is building a Lexicography Toolkit that includes tools that can significantly improve the process of creating custom lexicons. The toolkit is based on the concept of using automatic methods of data acquisition, using text corpora, to generate lexicon entries. Of course, lexicon entries must be accurate, so the work of the toolkit must be checked by human experts at several stages. However, this checking mostly consists of removing erroneous results, rather than adding data and entire entries. This article will explore how the Lexicography Toolkit would be used to create a lexicon that is specific to the user’s domain.</abstract>
<bibkey>meekhof-clements-2000-l</bibkey>
</paper>
<paper id="7">
<title>A new look for the <fixed-case>PAHO</fixed-case> <fixed-case>MT</fixed-case> system</title>
<author><first>Marjorie</first><last>León</last></author>
<pages>219-222</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_25</url>
<abstract>This paper describes some of the features of the new 32-bit Windows version of PAHO’s English-Spanish (ENGSPAN®) and Spanish-English (SPANAM®) machine translation software. The new dictionary update interface is designed to help users add their own terminology to the lexicon and encourage them to write context-sensitive rules to improve the quality of the output. Expanded search capabilities provide instant access to related source and target entries, expressions, and rules. A live system demonstration will accompany this presentation.</abstract>
<bibkey>leon-2000-new</bibkey>
</paper>
</volume>
<volume id="studies" ingest-date="2021-05-14">
<meta>
<booktitle>Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: User Studies</booktitle>
<publisher>Springer</publisher>
<address>Cuernavaca, Mexico</address>
<month>October 10-14</month>
<year>2000</year>
<editor><first>John S.</first><last>White</last></editor>
</meta>
<paper id="1">
<title>Is <fixed-case>MT</fixed-case> software documentation appropriate for <fixed-case>MT</fixed-case> users?</title>
<author><first>David</first><last>Mowatt</last></author>
<author><first>Harold</first><last>Somers</last></author>
<pages>223-238</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_26</url>
<abstract>This paper discusses an informal methodology for evaluating Machine Translation software documentation with reference to a case study, in which a number of currently available MT packages are evaluated. Different types of documentation style are discussed, as well as different user profiles. It is found that documentation is often inadequate in identifying the level of linguistic background and knowledge necessary to use translation software, and in explaining technical (linguistic) terms needed to use the software effectively. In particular, the level of knowledge and training needed to use the software is often incompatible with the user profile implied by the documentation. Also, guidance on how to perform more complex tasks, which may be especially idiosyncratic, is often inadequate or missing altogether.</abstract>
<bibkey>mowatt-somers-2000-mt</bibkey>
</paper>
<paper id="2">
<title>Evaluating embedded machine translation in military field exercises</title>
<author><first>M.</first><last>Holland</last></author>
<author><first>C.</first><last>Schlesinger</last></author>
<author><first>C.</first><last>Tate</last></author>
<pages>239-247</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_27</url>
<abstract>“Embedded” machine translation (MT) refers to an end-to-end computational process of which MT is one of the components. Integrating these components and evaluating the whole has proved to be problematic. As an example of embedded MT, we describe a prototype system called Falcon, which permits paper documents to be scanned and translated into English. MT is thus embedded in the preprocessing of hardcopy pages and subject to its noise. Because Falcon is intended for use by people in the military who are trying to screen foreign documents, and not to understand them in detail, its application makes low demands on translation quality. We report on a series of user trials that speak to the utility of embedded MT in army tasks.</abstract>
<bibkey>holland-etal-2000-evaluating</bibkey>
</paper>
<paper id="3">
<title>Machine translation systems: <fixed-case>E</fixed-case>-K, K-<fixed-case>E</fixed-case>, <fixed-case>J</fixed-case>-K, K-<fixed-case>J</fixed-case></title>
<author><first>Yu Seop</first><last>Kim</last></author>
<author><first>Sung Dong</first><last>Kim</last></author>
<author><first>Seong Bae</first><last>Park</last></author>
<author><first>Jong Woo</first><last>Lee</last></author>
<author><first>Jeong Ho</first><last>Chang</last></author>
<author><first>Kyu Baek</first><last>Hwang</last></author>
<author><first>Min O</first><last>Jang</last></author>
<author><first>Yung Taek</first><last>Kim</last></author>
<pages>248-251</pages>
<url>https://link.springer.com/chapter/10.1007/3-540-39965-8_28</url>
<abstract>We present four kinds of machine translation system in this description: E-K (English to Korean), K-E (Korean to English), J-K (Japanese to Korean), K-J (Korean to Japanese). Among these, E-K and K-J translation systems are published commercially, and the other systems have finished their development. This paper describes the structure and function of each system with figures and translation results.</abstract>
<bibkey>kim-etal-2000-machine</bibkey>
</paper>
</volume>
<volume id="workshop" ingest-date="2021-05-14">
<meta>
<booktitle>Proceedings of the Workshop on Machine translation in practice: from old guard to new guard</booktitle>
<publisher>Springer</publisher>
<address>Cuernavaca, Mexico</address>
<month>October 10-14</month>
<year>2000</year>
<editor><first>John S.</first><last>White</last></editor>
</meta>
<paper id="1">
<title>The foreign language challenge in the <fixed-case>USG</fixed-case> and machine translation.</title>
<author><first>Kathleen</first><last>Egan</last></author>
<url hash="a24b99f0">2000.amta-workshop.1</url>
<abstract>The internet is no longer English only. The data is voluminous and the number of proficient linguists cannot match the day to day needs of several government agencies. Handling foreign languages is not limited to translating documents but goes beyond the journalistic written formats. Military, diplomatic and official interactions in the US and abroad require more than one or two foreign language skills. The CHALLENGE is both managing the user’s expectations and stimulating new areas for MT research and development.</abstract>
<bibkey>egan-2000-foreign</bibkey>
</paper>
<paper id="2">
<title>Machine translation, the road ahead</title>
<author><first>Walter</first><last>Hartmann</last></author>
<url hash="bedc64ef">2000.amta-workshop.2</url>
<abstract>The application of MT on the Internet has certainly attracted much attention in recent years, and many observers see its future mostly in this arena of real-time raw translation. However, the need for high-volume, fast turn-around translation of publication quality has not abated. This paper will take stock of that particular use of MT and venture predictions as to its future.</abstract>
<bibkey>hartmann-2000-machine</bibkey>
</paper>
<paper id="3">
<title>Educational implications of a machine translation system</title>
<author><first>Tatyana</first><last>Gurina</last></author>
<url hash="5b0ede77">2000.amta-workshop.3</url>
<abstract>his paper is concerned with the technology of using the PARS English-Russian bi- directional machine translation systems in teaching English as a foreign language. This technology has no connection with the old form of computer-assisted language learning which uses «drill-and-practice» computer exercises and provides a sort of surrogate «electronic teacher». The main objective of the educational implication of PARS is to help the learner become familiar with the words in their normal contexts. The introduction of a machine translation system into teaching foreign languages is intended to get the most fruitful pedagogical results from the use of personal computers and expose the learners to the up-to-date information technologies.</abstract>
<bibkey>gurina-2000-educational</bibkey>
</paper>
<paper id="4">
<title>The Use of <fixed-case>ENGSPAN</fixed-case> at the Pan <fixed-case>A</fixed-case>merican Health Organization: A Reviser’s Perspective</title>
<author><first>Gustavo</first><last>Silva</last></author>
<url hash="ff5c41da">2000.amta-workshop.4</url>
<abstract>ENGSPAN, a machine translation program (English-Spanish), has been used by the Translation Services unit of the Pan American Health Organization since 1985. In 1999, a total of 2,106,178 words were translated in that language combination, 86% of which were done with the help of ENGSPAN; the cost per word was 8.75 cents, that is, 31% below the normal rate. These positive results are explained by a combination of factors: the use of an MT program especially designed to meet the needs of the institution; the close collaboration of translators and computational linguists in the improvement of the program; the application of a pragmatic, flexible, and selective approach with regard to the quality of the end product; and in particular the support of competent translators who do the postediting work.</abstract>
<bibkey>silva-2000-use</bibkey>
</paper>
<paper id="5">
<title>Wired for peace and multi-language communication</title>
<author><first>Vladimir</first><last>Oboronko</last></author>
<url hash="32cce533">2000.amta-workshop.5</url>
<abstract>Our project Wired for Peace: Virtual Diplomacy in Northeast Asia (Http://www- neacd.ucsd.edu/) has as its main aim to provide policymakers and researchers of the U.S., China, Russia, Japan, and Korea with Internet based tools to allow for continuous communication on issues of the regional security and cooperation. Since the very beginning of the project, we have understood that Web-based translation between English and Asian languages would be one of the most necessary tools for successful development of the project. With this understanding, we have partnered with Systran (www.systransoft.com), one of the leaders in MT field, in order to develop Internet-based tools for both synchronous and asynchronous translation of texts and discussions. This submission is a report on a work in progress.</abstract>
<bibkey>oboronko-2000-wired</bibkey>
</paper>
<paper id="6">
<title>Automated translation for the deployment of dynamic and mission critical content</title>
<author><first>Bob</first><last>Sheng</last></author>
<url hash="a24b99f0">2000.amta-workshop.6</url>
<abstract>The Internet is a wonderful medium that frees its users from the confines of geographic boundaries. While the acceptance of the Internet is pervasive, the language barrier is somewhat tougher to overcome. Several options exists on the market to deliver multilingual content, few solutions can stand up to the dynamic demand of a modern website. Language context, translation turnaround times, and various business models are all barriers to creating a total solution for globalization and localization of websites. We will examine the difficulties in localizing a dynamic website and discuss the challenges we have overcome to create a dynamic translation platform.</abstract>
<bibkey>sheng-2000-automated</bibkey>
</paper>
<paper id="7">
<title>What do we need from <fixed-case>MT</fixed-case>?</title>
<abstract/>
<bibkey>nn-2000-need</bibkey>
</paper>
</volume>
</collection>