Machine translation and its evaluation: a study

  • Published: 19 February 2023
  • Volume 56 , pages 10137–10226, ( 2023 )

Cite this article

thesis on machine translation

  • Subrota Kumar Mondal   ORCID: orcid.org/0000-0002-0008-7797 1 ,
  • Haoxi Zhang 1 ,
  • H. M. Dipu Kabir 2 ,
  • Kan Ni 1 &
  • Hong-Ning Dai 3  

1628 Accesses

9 Citations

Explore all metrics

Machine translation (namely MT) has been one of the most popular fields in computational linguistics and Artificial Intelligence (AI). As one of the most promising approaches, MT can potentially break the language barrier of people from all over the world. Despite a number of studies in MT, there are few studies in summarizing and comparing MT methods. To this end, in this paper, we principally focus on presenting the two mainstream MT schemes: statistical machine translation (SMT) and neural machine translation (NMT), including their basic rationales and developments. Meanwhile, the detailed translation models are also presented, such as the word-based model, syntax-based model, and phrase-based model in statistical machine translation. Similarly, approaches in NMT, such as the recurrent neural network-based, attention mechanism-based, and transformer-based models are presented. Last but not least, the evaluation approaches also play an important role in helping developers to improve their methods better in MT. The prevailing machine translation evaluation methodologies are also presented in this article.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save.

  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

thesis on machine translation

Similar content being viewed by others

thesis on machine translation

Neural machine translation: past, present, and future

thesis on machine translation

A Comprehensive Survey on Various Fully Automatic Machine Translation Evaluation Metrics

thesis on machine translation

Text-Text Neural Machine Translation: A Survey

SYSTRAN https://www.systran.net/en/translate/ .

Apertium https://www.apertium.org/index.eng.html .

GramTrans https://gramtrans.com/ .

Google AI Blog https://ai.googleblog.com/2016/09/a-neural-network-for-machine.html .

Microsoft Translator Blog https://www.microsoft.com/en-us/translator/blog/2016/11/15/microsoft-translator-launching-neural-network-based-translations-for-all-its-speech-languages/ .

Moses https://www.statmt.org/moses/ .

www.promt.com.

www.systransoft.com.

Machine Translation Research at Google https://research.google/pubs/?area=machine-translation .

Machine Translation Research at Microsoft https://www.microsoft.com/en-us/research/group/machine-translation-group/ .

Machine Translation Research at Meta AI https://ai.facebook.com/research/NLP .

OpenNMT https://opennmt.net/ .

Google Translate https://translate.google.com/ .

Europarl Datasets https://www.statmt.org/europarl/ .

WMT’14 Translation Task https://www.statmt.org/wmt14/translation-task.html .

WMT’15 Translation Task https://www.statmt.org/wmt15/translation-task.html .

Hugging Face Transformers Index https://huggingface.co/docs/transformers/index .

C4 Dataset https://www.tensorflow.org/datasets/catalog/c4 .

WMT Workshop https://www.statmt.org/ .

Statistical and Neural Machine Translation \(\rightarrow\) Events https://www.statmt.org/ .

WMT22 https://www.statmt.org/wmt22/ , WMT21, WMT20, and so on.

Google Research: Machine Translation https://research.google/research-areas/machine-translation/ .

Abbreviations

Machine translation

  • Natural Language Processing

Rule-based Machine Translation

Corpus-based Machine Translation

  • Statistical machine translation

Example-based Machine Translation

Hybrid Machine Translation

  • Neural machine translation

Expectation-Maximization

Word-based Machine Translation

Syntax-based Machine Translation

Phrase-based Machine Translation

Context-Free Grammar

Synchronous Context-Free Grammar

Inversion Transduction Grammar

Translation Edit Rat

Human-targeted Translation Edit Rat

Multi-reference TER

Google’s Neural Machine Translation

Backpropagation

Back-Translation

Neural Network

Convolutional Neural Network

Recurrent Neural Network

Transformer Neural Network

Gate Recurrent Unit

Long-Short Term Memory

Bidirectional Encoder Representations from Transformers

Low-Resource Language

High-Resource Language

Automatic Language Processing Advisory Committee

Defense Advanced Research Projects Agency

Bilingual Evaluation Understudy

National Institute of Standards and Technology

Metric for Evaluation of Translation with Explicit ORdering

Recall-Oriented Understudy for Gisting Evaluation

Open Machine Translation Evaluation

Text-To-Text Transfer Transformer

ACL (2022) ACL 2014 NINTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION. Available at https://www.statmt.org/wmt14/translation-task.html . Accessed 05 Apr 2022

ACL 2015 NINTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION (2022) https://www.statmt.org/wmt15/translation-task.html . Accessed 05 Apr 2022

Abid A, Farooqi M, Zou J (2021) Persistent anti-muslim bias in large language models. In: Proceedings of the 2021 AAAI/ACM conference on AI, Ethics, and Society, pp 298–306

Agency DARP (2003) Program Translingual Information Detection, Extraction and Summarization. http://www.darpa.mil/ipto/programs/tides/ . Accessed 08 Apr 2022

Ahmadnia B, Dorr BJ (2019) Augmenting neural machine translation through round-trip training approach. Open Comput Sci 9(1):268–278

Google Scholar  

Ahmadnia B, Serrano J, Haffari G (2017) Persian-Spanish low-resource statistical machine translation through English as pivot language. Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017:24–30

Ahmadnia B, Haffari G, Serrano J (2019) Round-trip training approach for bilingually low-resource statistical machine translation systems. Int J Artif Intell 17(1):167–185

Ahmadnia B, Dorr BJ, Kordjamshidi P (2020) Knowledge graphs effectiveness in neural machine translation improvement. Comput Sci 21:299–318

Ahmed A, Hanneman G (2005) Syntax-based statistical machine translation: a review. Comput Linguist 1:1

Álvaro Rocha, Adeli H, Reis LP, Costanzo S (2018) Trends and advances in information systems and technologies, vol 2. Springer, Berlin

Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: a nucleus for a web of open data. In: The semantic web, Springer, pp 722–735

Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473

Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, pp 65–72

Bapna A, Firat O, Wang P, Macherey W, Cheng Y, Cao Y (2022) Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation. In: ACL 2022

Barzilay R, Koehn P (2004) Natural Language Processing, Fall 2004, Machine Translation I, Lecture 20. CS an AI Lab, MIT, New York

Bender EM (2019) A typology of ethical risks in language technology with an eye towards where transparent documentation can help. In: Future of artificial intelligence: language, ethics, technology workshop

Benesty J, Chen J, Huang Y, Cohen I (2009) Pearson correlation coefficient. In: Noise reduction in speech processing, Springer, pp 1–4

Bentivogli L, Bisazza A, Cettolo M, Federico M (2016) Neural versus phrase-based machine translation quality: a case study. arXiv preprint arXiv:1608.04631

Besacier L, Blanchon H (2017) Comparing statistical machine translation and neural machine translation performances. https://evaluerlata.hypotheses.org/files/2017/07/Laurent-Besacier-NMTvsSMT.pdf , laboratoire LIG, Université Grenoble Alpes, France

Bick E (2007) Dan2eng: wide-coverage Danish-English machine translation. In: Proceedings of Machine Translation Summit XI: Papers

Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data, pp 1247–1250

Bouchard G (2007) Efficient bounds for the softmax function and applications to approximate inference in hybrid models. In: NIPS 2007 workshop for approximate Bayesian inference in continuous/hybrid systems

Brown PF, Pietra VJD, Pietra SAD, Mercer RL (1993) The mathematics of statistical machine translation: parameter estimation. Comput Linguist 19(2):263–311

Buck C, Heafield K, Van Ooyen B (2014) N-gram counts and language models from the common crawl. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), pp 3579–3584

Canfora C, Ottmann A (2020) Risks in neural machine translation. Trans Spaces 9(1):58–77

Casas N, Costa-jussà MR, Fonollosa JA, Alonso JA, Fanlo R (2021) Linguistic knowledge-based vocabularies for Neural Machine Translation. Nat Lang Eng 27(4):485–506

Caswell I, Liang B (2022) Recent Advances in Google Translate. Tutorial https://ai.googleblog.com/2020/06/recent-advances-in-google-translate.html . Accessed 05 Apr 2022

Chah N (2018) OK Google, what is your ontology? Or: exploring freebase classification to understand Google’s Knowledge Graph. arXiv preprint arXiv:1805.03885

Chapelle O, Weston J, Bottou L, Vapnik V (2000) Vicinal risk minimization. Adv Neural Inf Process Syst 13:1

Cheng Y (2019) Joint training for pivot-based neural machine translation. In: Joint Training for Neural Machine Translation, Springer, pp 41–54

Cheng Y, Jiang L, Macherey W (2019) Robust Neural Machine Translation with Doubly Adversarial Inputs. In: ACL

Cheng Y, Jiang L, Macherey W, Eisenstein J (2020) AdvAug: Robust Adversarial Augmentation for Neural Machine Translation. In: ACL, https://arxiv.org/abs/2006.11834

Chiang D, Knight K (2006) An introduction to synchronous grammars. Tutorial https://www3.nd.edu/~dchiang/papers/synchtut.pdf . Accessed 4 Jan 2022

Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014a) Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078

Cho K, Van Merriënboer B, Bahdanau D, Bengio Y (2014b) On the properties of neural machine translation: encoder-decoder approaches. arXiv preprint arXiv:1409.1259

Colah (2015) Understanding LSTM networks. http://colah.github.io/posts/2015-08-Understanding-LSTMs/ . Accessed 08 Apr 2022

Common Crawl corpus (2022) https://commoncrawl.org/ . Accessed 30 Sep 2022

Corbí-Bellot AM, Forcada ML, Ortiz-Rojas S, Pérez-Ortiz JA, Ramírez-Sánchez G, Sánchez-Martínez F, Alegria I, Mayor A, Sarasola K (2005) An open-source shallow-transfer machine translation engine for the Romance languages of Spain. In: Proceedings of the 10th EAMT conference: practical applications of machine translation

Costa-Jussa MR, Fonollosa JA (2015) Latest trends in hybrid machine translation and its applications. Comput Speech Lang 32(1):3–10

Council NR (1416) Language and Machines: Computers in Translation and Linguistics; a Report. National Research Council, https://books.google.com/books?hl=zh-CN &lr= &id=Q0ErAAAAYAAJ &oi=fnd &pg=PA1 &dq=Languages+and+machines:+computers+in+translation+and+linguistics &ots=NgytafcXa- &sig=Hc733OYAAT89yd4U-3xLdh77gEM#v=onepage &q &f=false . Accessed 08 Apr 2022

Cui Y, Chen Z, Wei S, Wang S, Liu T, Hu G (2016) Attention-over-attention neural networks for reading comprehension. arXiv preprint arXiv:1607.04423

Currey A, Heafield K (2019) Zero-resource neural machine translation with monolingual pivot data. In: Proceedings of the 3rd workshop on neural generation and translation, pp 99–107

Dabre R, Cromieres F, Kurohashi S, Bhattacharyya P (2015) Leveraging small multilingual corpora for smt using many pivot languages. In: Proceedings of the 2015 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 1192–1202

Dabre R, Imankulova A, Kaneko M, Chakrabarty A (2021) Simultaneous multi-pivot neural machine translation. arXiv preprint arXiv:2104.07410

Dajun Z, Yun W (2015) Corpus-based machine translation: Its current development and perspectives. In: International Forum of Teaching and Studies, American Scholars Press, Inc., vol 11, p 90

Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39(1):1–22

MathSciNet   MATH   Google Scholar  

Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805

Doddington G (2002) Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the second international conference on Human Language Technology Research, Morgan Kaufmann Publishers Inc., pp 138–145

Edunov S, Ott M, Auli M, Grangier D (2018) Understanding back-translation at scale. arXiv preprint arXiv:1808.09381

Esplà-Gomis M, Forcada ML, Ramírez-Sánchez G, Hoang H (2019) ParaCrawl: Web-scale parallel corpora for the languages of the EU. In: Proceedings of Machine Translation Summit XVII: Translator, Project and User Tracks, pp 118–119

Europarl (2022) Europarl: European Parliament Proceedings Parallel Corpus. https://www.statmt.org/europarl/ . Accessed 30 Sep 2022

Forcada ML, Ginestí-Rosell M, Nordfalk J, O’Regan J, Ortiz-Rojas S, Pérez-Ortiz JA, Sánchez-Martínez F, Ramírez-Sánchez G, Tyers FM (2011) Apertium: a free/open-source platform for rule-based machine translation. Mach Trans 25(2):127–144

Freitag M, Torres DV, Grangier D, Cherry C, Foster G (2022) A Natural Diet: Towards Improving Naturalness of Machine Translation Output. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Online

Furuse O, Iida H (1992) An example-based method for transfer-driven machine translation. TMI 1992:139–150

Färber M, Ell B, Menne C, Rettinger A (2015) A comparative survey of dbpedia, freebase, opencyc, wikidata, and yago. Semantic Web J 1(1):1–5

Gehring J, Auli M, Grangier D, Yarats D, Dauphin YN (2017) Convolutional sequence to sequence learning. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70, JMLR. org, pp 1243–1252

Gheini M, Ren X, May J (2021a) Cross-attention is all you need: adapting pretrained transformers for machine translation. In: Proceedings of the 2021 conference on Empirical Methods in Natural Language Processing, pp 1754–1765

Gheini M, Ren X, May J (2021b) On the strengths of cross-attention in pretrained transformers for machine translation

Gispert Ramis A (2007) Introducing linguistic knowledge into statistical machine translation. Universitat Politècnica de Catalunya

Graves A (2012) Long short-term memory. In: Supervised sequence labelling with recurrent neural networks. Springer, Berlin. pp 37–45

Guo Z, Huang Z, Zhu KQ, Chen G, Zhang K, Chen B, Huang F (2021) Automatically paraphrasing via sentence reconstruction and round-trip translation. IJCAI

Habash N, Zalmout N, Taji D, Hoang H, Alzate M (2017) A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages. In: Proceedings of the 15th conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, Association for Computational Linguistics, Valencia, Spain, pp 235–241. https://aclanthology.org/E17-2038

Hardmeier C (2012) Discourse in statistical machine translation. a survey and a case study. Discours Revue de linguistique, psycholinguistique et informatique A journal of linguistics, psycholinguistics and computational linguistics (11)

He D, Xia Y, Qin T, Wang L, Yu N, Liu TY, Ma WY (2016) Dual learning for machine translation. Adv Neural Inf Process Syst 29

Hecht-Nielsen R (1992) Theory of the backpropagation neural network. In: Neural networks for perception, Elsevier, pp 65–93

Hill DC, Gombay C, Sanchez O, Woappi B, Romero Vélez AS, Davidson S, Richardson EZ (2022) Lost in machine translation: The promises and pitfalls of machine translation for multilingual group work in global health education. Discov Educ 1(1):1–5

Horváth I (2022) AI in interpreting: ethical considerations. Across Lang Cult 23(1):1–13

Huck M, Birch A (2015) The Edinburgh machine translation systems for IWSLT 2015. In: Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign

Hull DA, Grefenstette G (1996) Querying across languages: a dictionary-based approach to multilingual information retrieval. In: Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, ACM, pp 49–57

IWSLT (2018) International Workshop on Spoken Language Translation. https://workshop2018.iwslt.org/ . Accessed 08 Apr 2022

Imankulova A, Sato T, Komachi M (2019) Filtered pseudo-parallel corpus improves low-resource neural machine translation. ACM Trans Asian Low-Resour Lang Inf Process 19(2):1–16

Islam M, Anik M, Hoque S, Islam A et al (2021) Towards achieving a delicate blending between rule-based translator and neural machine translator. Neural Comput Appl 33(18):12141–12167

Jean S, Cho K, Memisevic R, Bengio Y (2015) On Using Very Large Target Vocabulary for Neural Machine Translation. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp 1–10

Jehl L, Simianer P, HIrschler J, Riezler S (2015) The Heidelberg university English-German translation system for IWSLT 2015. In: Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign

Jian Zhang MZ Ji Wu (2003) The improvement of automatic machine translation evaluation. J Chin Inf Process 17(6):2, http://jcip.cipsc.org.cn/CN/abstract/article_1823.shtml

Johnson M (2020) A scalable approach to reducing gender bias in Google translate, https://ai.googleblog.com/2020/04/a-scalable-approach-to-reducing-gender.html . Google AI Blog Accessed on 30 Sep 2022

Jurafsky D, Martin JH (2022) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, chapter 10: machine translation and encoder-decoder models, 10.9 Bias and Ethical Issues (3rd (draft) ed.). Accessed on 08 Apr 2022

Kalchbrenner N, Espeholt L, Simonyan K, Oord Avd, Graves A, Kavukcuoglu K (2016) Neural machine translation in linear time. arXiv preprint arXiv:1610.10099

Kaljahi RSZ, Rubino R, Roturier J, Foster J, Park BB (2012) A detailed analysis of phrase-based and syntax-based machine translation: The search for systematic differences. In: Proceedings of AMTA

Karim R (2019) Illustrated attention. https://towardsdatascience.com/attn-illustrated-attention-5ec4ad276ee3 . Accessed 08 Apr 2022

Kenny D, Moorkens J, Do Carmo F (2020) Fair MT: towards ethical, sustainable machine translation. Trans Spaces 9(1):1–11

Kharitonova K (2021) Linguistics4fairness: neutralizing Gender Bias in neural machine translation by introducing linguistic knowledge. Master’s thesis, Universitat Politècnica de Catalunya

Klakow D, Peters J (2002) Testing the correlation of word error rate and perplexity. Speech Commun 38(1–2):19–28

MATH   Google Scholar  

Klein G, Kim Y, Deng Y, Senellart J, Rush AM (2017) Opennmt: Open-source toolkit for neural machine translation. arXiv preprint arXiv:1701.02810

Ko WJ, El-Kishky A, Renduchintala A, Chaudhary V, Goyal N, Guzmán F, Fung P, Koehn P, Diab M (2021) Adapting high-resource NMT models to translate low-resource related languages without parallel data. arXiv preprint arXiv:2105.15071

Koehn P (2009) Statistical machine translation. Cambridge University Press

Koehn P, Chiang D (2019) Special interest group of machine translation. http://www.sigmt.org . Accessed 08 Apr 2022

Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Moran C, Zens R et al (2007) Moses: Open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the association for computational linguistics companion volume proceedings of the demo and poster sessions, pp 177–180

Koehn P (2009) Chapter 4 Word-based models - Statistical Machine Translation. Cambridge University Press, Cambridge. Accessed 08 Apr 2022

Koehn P (2009) Statistical Machine Translation Lecture 5 Syntax-Based Models. Cambridge University Press, Cambridge. Accessed 08 Apr 2022

Koehn P, Och FJ, Marcu D (2003) Statistical phrase-based translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, Association for Computational Linguistics, pp 48–54

Koehn P (2005) Europarl: A parallel corpus for statistical machine translation. In: Proceedings of machine translation summit x: papers, pp 79–86

Koehn P (2009) Statistical Machine Translation Lecture 6 Decoding. Cambridge University Press, Cambridge. Accessed 08 Apr 2022

Kussul E, Baidyk T, Kasatkina L, Lukovich V (2001) Rosenblatt perceptrons for handwritten digit recognition. In: IJCNN’01. International Joint Conference on Neural Networks. Proceedings (Cat. No. 01CH37222), IEEE, vol 2, pp 1516–1520

Labaka G, España-Bonet C, Màrquez L, Sarasola K (2014) A hybrid machine translation architecture guided by syntax. Mach Trans 28(2):91–125

Lample G, Conneau A (2019) Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291

Lavie A, Sagae K, Jayaraman S (2004) The significance of recall in automatic metrics for MT evaluation. In: Conference of the Association for Machine Translation in the Americas, Springer, pp 134–143

Li Y, Xiong D, Zhang M (2018) A survey of neural machine translation. Chinese Journal of Computers 12:2734

MathSciNet   Google Scholar  

Li Q, Zhang X, Xiong J, Hwu Wm, Chen D (2019) Implementing neural machine translation with bi-directional GRU and attention mechanism on FPGAs using HLS. In: Proceedings of the 24th Asia and South Pacific Design Automation Conference, pp 693–698

Lin CY (2004) Rouge: A package for automatic evaluation of summaries. In: Text summarization branches out, pp 74–81

Lin CY, Hovy E (2003) Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of the 2003 human language technology conference of the North American chapter of the association for computational linguistics, pp 150–157

Lin CY, Och FJ (2004) Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04), pp 605–612

Liu J (2020) Comparing and analyzing cohesive devices of SMT and NMT from Chinese to English: a diachronic approach. Open J Mod Linguist 10(6):765–772

Liu CH, Silva CC, Wang L, Way A (2018) Pivot machine translation using chinese as pivot language. In: China workshop on machine translation. Springer, Berlin. pp 74–85

Liu X, Wang Y, Wang X, Xu H, Li C, Xin X (2021) Bi-directional gated recurrent unit neural network based nonlinear equalizer for coherent optical communication system. Opt Express 29(4):5923–5933

Lu Y, Zhang J, Zong C (2018) Exploiting knowledge graph in neural machine translation. In: China workshop on machine translation. Springer, Berlin. pp 27–38

Luccioni A, Viviano J (2021) What’s in the box? an analysis of undesirable content in the Common Crawl corpus. In: Proceedings of the 59th annual meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp 182–189

Luong MT, Sutskever I, Le QV, Vinyals O, Zaremba W (2014) Addressing the rare word problem in neural machine translation. arXiv preprint arXiv:1410.8206

Luong MT, Pham H, Manning CD (2015) Effective Approaches to Attention-based Neural Machine Translation. In: Proceedings of the 2015 conference on Empirical Methods in Natural Language Processing, pp 1412–1421

Ma S, Sun X, Wang Y, Lin J (2018) Bag-of-words as target for neural machine translation. arXiv preprint arXiv:1805.04871

Marcus MP, Marcinkiewicz MA, Santorini B (1993) Building a large annotated corpus of English: the penn treebank. Comput Linguist 19(2):313–330

Mehandru N, Robertson S, Salehi N (2022) Reliable and Safe Use of Machine Translation in Medical Settings. In: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (Seoul, South Korea)(FAccT’22). Association for Computing Machinery, New York, NY, USA

Meng F, Lu Z, Li H, Liu Q (2016) Interactive attention for neural machine translation. arXiv preprint arXiv:1610.05011

Moon TK (1996) The expectation-maximization algorithm. IEEE Signal Process Mag 13(6):47–60

Moon J, Cho H, Park EL (2020) Revisiting round-trip translation for quality estimation. arXiv preprint arXiv:2004.13937

Moorkens J (2022) Ethics and machine translation. Machine translation for everyone: empowering users in the age of artificial intelligence 18:121

Moussallem D, Ngonga Ngomo AC, Buitelaar P, Arcan M (2019) Utilizing knowledge graphs for neural machine translation augmentation. In: Proceedings of the 10th international conference on knowledge capture, pp 139–146

Mueller V (2022) An introduction to synchronous grammars. Tutorial https://medium.com/towards-data-science/attention-please-85bd0abac41 . Accessed 05 Apr 2022

Nießen S, Ney H (2000) Improving SMT quality with morpho-syntactic analysis. In: COLING 2000 Volume 2: the 18th international conference on computational linguistics

Nyberg EH, Mitamura T (1992) The KANT system: Fast, accurate, high-quality translation in practical domains. In: Proceedings of the 14th conference on Computational linguistics-Volume 3, Association for Computational Linguistics, pp 1069–1073

Och FJ, Ney H (2003) A systematic comparison of various statistical alignment models. Comput Linguist 29(1):19–51

Och FJ, Tillmann C, Ney H (1999) Improved alignment models for statistical machine translation. In: 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora

Och FJ, Ueffing N, Ney H (2001) An efficient A* search algorithm for statistical machine translation. In: Proceedings of the workshop on Data-driven methods in machine translation-Volume 14, Association for Computational Linguistics, pp 1–8

Och FJ, Gildea D, Khudanpur S, Sarkar A, Yamada K, Fraser A, Kumar S, Shen L, Smith D, Eng K et al (2003) Syntax for statistical machine translation. In: Johns Hopkins University 2003 Summer Workshop on Language Engineering, Center for Language and Speech Processing, Baltimore, MD, Tech. Rep

Of Standards NNI, Technology) (2010) Open Machine Translation Evaluation. https://www.nist.gov/itl/iad/mig/open-machine-translation-evaluation . Accessed 08 Apr 2022

Ortega JE, Castro Mamani R, Cho K (2020) Neural machine translation with a polysynthetic low resource language. Mach Trans 34(4):325–346

Pal SK, Mitra S (1992) Multilayer perceptron, fuzzy sets, and classification. IEEE Trans Neural Netw 3(5):683–697

Papadimitriou CH (2003) Computational complexity. Wiley, New York

Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on association for computational linguistics, Association for Computational Linguistics, pp 311–318

ParaCrwal (2022) ParaCrawl: Web-scale parallel corpora for the languages of the EU. https://paracrawl.eu/ . Accessed 30 Sep 2022

Pierce J (1966) Language and machines: computers in translation and linguistics; a Report. National Research Council

Popović M (2017) chrF++: words helping character n-grams. In: Proceedings of the second conference on machine translation, pp 612–618

Prates MO, Avelar PH, Lamb LC (2020) Assessing gender bias in machine translation: a case study with google translate. Neural Comput Appl 32(10):6363–6381

Pryzant R, Chung Y, Jurafsky D, Britz D (2018) JESC: Japanese-English Subtitle Corpus. In: Proceedings of the eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, Zhou Y, Li W, Liu PJ (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21:1–67

Ramnath S, Johnson M, Gupta A, Raghuveer A (2021) HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints. In: EMNLP 2021

Ranathunga S, Lee ESA, Skenduli MP, Shekhar R, Alam M, Kaur R (2021) Neural machine translation for low-resource languages: a survey. arXiv preprint arXiv:2106.15115

Ravikumar D, Kodge S, Garg I, Roy K (2020) Exploring Vicinal Risk Minimization for Lightweight Out-of-Distribution Detection. arXiv preprint arXiv:2012.08398

Rebele T, Suchanek F, Hoffart J, Biega J, Kuzey E, Weikum G (2016) YAGO: A multilingual knowledge base from wikipedia, wordnet, and geonames. In: International semantic web conference, Springer, pp 177–185

Rescigno AA, Vanmassenhove E, Monti J, Way A (2020) A case study of natural gender phenomena in translation. A comparison of Google Translate, Bing Microsoft Translator and DeepL for English to Italian, French and Spanish. In: CLiC-it

Richards C, Bouman WP, Seal L, Barker MJ, Nieder TO, T’Sjoen G (2016) Non-binary or genderqueer genders. Int Rev Psychiatry 28(1):95–102

Ringler D, Paulheim H (2017) One knowledge graph to rule them all? Analyzing the differences between DBpedia, YAGO, Wikidata & co. In: Joint German/Austrian Conference on Artificial Intelligence (Künstliche Intelligenz), Springer, pp 366–372

Rocktäschel T, Grefenstette E, Hermann KM, Kočiskỳ T, Blunsom P (2015) Reasoning about entailment with neural attention. arXiv preprint arXiv:1509.06664

Rothman D (2021) Transformers for Natural Language Processing: build innovative deep neural network architectures for NLP with Python, PyTorch, TensorFlow, BERT, RoBERTa, and more. Packt Publishing Ltd

Sakamoto A (2019) Unintended consequences of translation technologies: from project managers’ perspectives. Perspectives 27(1):58–73

Sara Stymne GT (2022) Phrase Based Machine Translation. Tutorial https://cl.lingfil.uu.se/kurs/MT19/slides/pbsmt.pdf . Accessed 05 Apr 2022

Saunders D, Sallis R, Byrne B (2020) Neural machine translation doesn’t translate gender coreference right unless you make it. In: Proceedings of the second workshop on gender bias in natural language processing, pp 35–43

Savoldi B, Gaido M, Bentivogli L, Negri M, Turchi M (2021) Gender bias in machine translation. Trans Assoc Comput Linguist 9:845–874

Schiebinger L (2014) Scientific research must take gender into account. Nature 507(7490):9–9

Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681

Schwenk H, Chaudhary V, Sun S, Gong H, Guzmán F (2019) Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv preprint arXiv:1907.05791

Sennrich R, Haddow B, Birch A (2015) Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709

Sennrich R, Haddow B, Birch A (2015) Neural machine translation of rare words with subword units. arXiv preprint arXiv:1508.07909

Shannon CE, Weaver W (1949) The mathematical theory of information. University of Illinois Press

Sharma S, Sharma S, Athaiya A (2017) Activation functions in neural networks towards data science 6(12):310–316

Shazeer N, Mirhoseini A, Maziarz K, Davis A, Le Q, Hinton G, Dean J (2017) Outrageously large neural networks: the sparsely-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538

Shieber SM, Schabes Y (1990) Synchronous tree-adjoining grammars. In: Proceedings of the 13th conference on Computational linguistics-Volume 3, pp 253–258

Shieber SM, Schabes Y (1991) Generation and synchronous tree-adjoining grammars. Comput Intell 7(4):220–228

Singh SP, Kumar A, Darbari H, Singh L, Rastogi A, Jain S (2017) Machine translation using deep learning: An overview. In: 2017 international conference on computer, communications and electronics (comptelix), IEEE, pp 162–167

Snover M, Dorr B, Schwartz R, Micciulla L, Makhoul J (2006) A study of translation edit rate with targeted human annotation. In: Proceedings of association for machine translation in the Americas, vol 200

Somers H (2005) Round-trip translation: What is it good for? Proc Austral Lang Technol Workshop 2005:127–133

Stanovsky G, Smith NA, Zettlemoyer L (2019) Evaluating gender bias in machine translation. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Florence, Italy, pp 1679–1684, https://doi.org/10.18653/v1/P19-1164 , https://aclanthology.org/P19-1164

Stasimioti M, Sosoni V, Kermanidis KL, Mouratidis D (2020) Machine Translation Quality: a comparative evaluation of SMT, NMT and tailored-NMT outputs. In: Proceedings of the 22nd annual conference of the European Association for Machine Translation, pp 441–450

Staudemeyer RC, Morris ER (2019) Understanding LSTM–a tutorial into long short-term memory recurrent neural networks. arXiv preprint arXiv:1909.09586

Steinberger R, Pouliquen B, Widiger A, Ignat C, Erjavec T, Tufis D, Varga D (2006) The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. arXiv preprint arXiv:cs/0609058

Strubell E, Ganesh A, McCallum A (2019) Energy and Policy Considerations for Deep Learning in NLP. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 3645–3650

Sun T, Gaut A, Tang S, Huang Y, ElSherief M, Zhao J, Mirza D, Belding E, Chang KW, Wang WY (2019) Mitigating gender bias in natural language processing: literature review. In: Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pp 1630–1640

Sun T, Shah A, Webster K, Johnson M (eds) (2021) They, them, theirs: rewriting with gender-neutral English, arXiv:2102.06788

Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Advances in neural information processing systems, pp 3104–3112

Tixier AJP (2018) Notes on deep learning for nlp. arXiv preprint arXiv:1808.09772

Toma P (1977) Systran as a multilingual machine translation system. In: Proceedings of the Third European Congress on Information Systems and Networks, overcoming the language barrier, pp 569–581

Tomalin M, Byrne B, Concannon S, Saunders D, Ullmann S (2021) The practical ethics of bias reduction in machine translation: why domain adaptation is better than data debiasing. Ethics Inf Technol 23(3):419–433

Ullmann S (2022) Gender bias in machine translation systems. In: Artificial Intelligence and Its Discontents, Springer, pp 123–144

United Nations Parallel Corpus (2022) https://conferences.unite.un.org/uncorpus . Accessed 30 Sep 2022

van Wynsberghe A (2021) Sustainable AI: AI for sustainability and the sustainability of AI. AI Ethics (2021). https://doi.org/10.1007/s43681-021-00043-6

Vandeghinste V, Martens S, Kotzé G, Tiedemann J, Van den Bogaert J, De Smet K, Van Eynde F, Van Noord G (2013) Parse and corpus-based machine translation. Essential Speech and Language Technology for Dutch. Springer, Berlin, pp 305–319

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30:1

Vieira LN, O’Hagan M, O’Sullivan C (2021) Understanding the societal impacts of machine translation: a critical review of the literature on medical and legal use cases. Inf Commun Soc 24(11):1515–1532

Wahler ME (2018) A word is worth a thousand words: legal implications of relying on machine translation technology. Stetson L Rev 48:109

Wang S (2005) Computers and translation: a translator’s guide. Language 81(2):544–545

Wang W, Pan SJ, Dahlmeier D, Xiao X (2017) Coupled multi-layer attentions for co-extraction of aspect and opinion terms. In: Thirty-First AAAI Conference on Artificial Intelligence

Weidinger L, Mellor J, Rauh M, Griffin C, Uesato J, Huang PS, Cheng M, Glaese M, Balle B, Kasirzadeh A et al (2021) Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359

White JS (1995) Approaches to black box MT evaluation. Citeseer, proceedings of machine translation summit V, vol 10

Wiki (2019) Automatic Language Processing Advisory Committee. https://en.wikipedia.org/wiki/ALPAC . Accessed 08 Apr 2022

Wiki (2019) Long-short term memory. https://en.wikipedia.org/wiki/Long_short-term_memory . Accessed 08 Apr 2022

Wikimatrix (2022) Mining 135m parallel sentences in 1620 language pairs from wikipedia. https://github.com/facebookresearch/LASER/tree/main/tasks/WikiMatrix . Accessed 30 Sep 2022

Wong YW, Mooney R (2007) Learning synchronous grammars for semantic parsing with lambda calculus. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp 960–967

Wu D (1997) Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Comput Linguist 23(3):377–403

Wu H, Wang H (2007) Pivot language approach for phrase-based statistical machine translation. Mach Trans 21(3):165–181

Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, Krikun M, Cao Y, Gao Q, Macherey K et al (2016) Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144

Xu K, Ba J, Kiros R, Cho K, Courville A, Salakhudinov R, Zemel R, Bengio Y (2015) Show, attend and tell: Neural image caption generation with visual attention. In: International conference on machine learning, pp 2048–2057

Yamada M (2019) The impact of Google Neural Machine Translation on Post-editing by student translators. J Special Trans 31:87–106

Yamada K, Knight K (2001) A syntax-based statistical translation model. In: Proceedings of the 39th annual meeting of the Association for Computational Linguistics, pp 523–530

Yang Z, Yang D, Dyer C, He X, Smola A, Hovy E (2016) Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 1480–1489

Yang B, Wang L, Wong DF, Chao LS, Tu Z (2019) Convolutional Self-Attention Networks. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp 4040–4045

Yun Huang QL Yang Liu (2008) Introduction on machine translation evaluation

Zakir HM, Nagoor MS (2017) A brief study of challenges in machine translation. Int J Comput Sci Issues 14(2):54

Zhang D, Li M, Li CH, Zhou M (2007) Phrase reordering model integrating syntactic knowledge for SMT. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp 533–540

Zhao Y, Xiang L, Zhu J, Zhang J, Zhou Y, Zong C (2020) Knowledge graph enhanced neural machine translation via multi-task learning on sub-entity granularity. In: Proceedings of the 28th International Conference on Computational Linguistics, pp 4495–4505

Zhao Y, Zhang J, Zhou Y, Zong C (2020) Knowledge graphs enhanced neural machine translation. In: IJCAI, pp 4039–4045

Zhou J, Cao Y, Wang X, Li P, Xu W (2016) Deep recurrent models with fast-forward connections for neural machine translation. Trans Assoc Comput Linguist 4:371–383

Zhou W, Ge T, Mu C, Xu K, Wei F, Zhou M (2019) Improving grammatical error correction with machine translation pairs. arXiv preprint arXiv:1911.02825

Ziemski M, Junczys-Dowmunt M, Pouliquen B (2016) The united nations parallel corpus v1. 0. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), pp 3530–3534

Zollmann A, Venugopal A, Och FJ, Ponte J (2008) A Systematic Comparison of Phrase-Based, Hierarchical and Syntax-Augmented Statistical MT. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pp 1145–1152

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their quality reviews and suggestions. This work was supported in part by The Science and Technology Development Fund of Macao, Macao SAR, China under Grant 0033/2022/ITP and in part by The Faculty Research Grant Projects of Macau University of Science and Technology, Macao SAR, China under Grant FRG-22-020-FI.

Author information

Authors and affiliations.

School of Computer Science and Engineering, Macau University of Science and Technology, Taipa, 999078, Macao, China

Subrota Kumar Mondal, Haoxi Zhang & Kan Ni

Deakin University, Geelong, Australia

H. M. Dipu Kabir

The Department of Computer Science, Hong Kong Baptist University, Hong Kong, China

Hong-Ning Dai

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Subrota Kumar Mondal .

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Mondal, S.K., Zhang, H., Kabir, H.M.D. et al. Machine translation and its evaluation: a study. Artif Intell Rev 56 , 10137–10226 (2023). https://doi.org/10.1007/s10462-023-10423-5

Download citation

Published : 19 February 2023

Issue Date : September 2023

DOI : https://doi.org/10.1007/s10462-023-10423-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Computational linguistics
  • Evaluation methods
  • Find a journal
  • Publish with us
  • Track your research

A Survey on the Application of Online Machine Translation in Thesis Writing by English Majors

New citation alert added.

This alert has been successfully added and will be sent to:

You will be notified whenever a record that you have chosen has been cited.

To manage your alert preferences, click on the button below.

New Citation Alert!

Please log in to your account

Information & Contributors

Bibliometrics & citations, recommendations, english majors’ online learning technology needs in china.

With the development of online education, deficiencies of online learning technology affect students’ learning effect greatly. For detecting more technical problems, aid in the progress of technology and promote the development of online English ...

Dependency-Based Chinese-English Statistical Machine Translation

We present a Chinese-English Statistical Machine Translation (SMT) system based on dependency tree mappings. We use a state-of-the-art dependency parser to parse the English translation of the Penn Chinese Treebank to make it bilingual and then learn a ...

Chinese-Japanese Machine Translation Exploiting Chinese Characters

The Chinese and Japanese languages share Chinese characters. Since the Chinese characters in Japanese originated from ancient China, many common Chinese characters exist between these two languages. Since Chinese characters contain significant semantic ...

Information

Published in.

cover image ACM Other conferences

Association for Computing Machinery

New York, NY, United States

Publication History

Permissions, check for updates, author tags.

  • English majors
  • online machine translation
  • thesis writing
  • Research-article
  • Refereed limited

Contributors

Other metrics, bibliometrics, article metrics.

  • 0 Total Citations
  • 0 Total Downloads
  • Downloads (Last 12 months) 0
  • Downloads (Last 6 weeks) 0

View Options

Login options.

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

View options.

View or Download as a PDF file.

View online with eReader .

HTML Format

View this article in HTML Format.

Share this Publication link

Copying failed.

Share on social media

Affiliations, export citations.

  • Please download or close your previous search result export first before starting a new bulk export. Preview is not available. By clicking download, a status dialog will open to start the export process. The process may take a few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress. Download
  • Download citation
  • Copy citation

We are preparing your search results for download ...

We will inform you here when the file is ready.

Your file of search results citations is now ready.

Your search export query has expired. Please try again.

  • Bibliography
  • More Referencing guides Blog Automated transliteration Relevant bibliographies by topics
  • Automated transliteration
  • Relevant bibliographies by topics
  • Referencing guides

The Impacts and Challenges of Artificial Intelligence Translation Tool on Translation Professionals

  • SHS Web of Conferences 163
  • This person is not on ResearchGate, or hasn't claimed this research yet.

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations

Baleigh Qassim Al-Wasy

  • Khaled Abkar Alkodimi

Osama Osama Abdulrhman Alqahtani

  • Natalina Asi

Akhmad Fauzan

  • Naftali Vanesa

Hamza Fakhar

  • Mohamed Bashir

Muawia Abdelmagid Elsadig

  • Nataliia Yuhan
  • Yuliia Herasymenko
  • Oleksandra Deichakivska
  • Yevhen Kozlov

Ahmed Moneus

  • Awadh Nasser Munassar Awadh
  • Fırat SOYSAL
  • MATH PROBL ENG
  • Lingmei Cao
  • Shuxian Zhu

Robert French

  • Jiayuan Zhang
  • M S Rosemary
  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

Navigation Menu

Search code, repositories, users, issues, pull requests..., provide feedback.

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly.

To see all available qualifiers, see our documentation .

  • Notifications You must be signed in to change notification settings

Thang Luong's thesis on Neural Machine Translation

lmthang/thesis

Folders and files.

NameName
66 Commits

Repository files navigation

Thang luong's thesis on neural machine translation.

This repository contains the latest version of my thesis .

Motivated by my advisor (Chris Manning)'s suggestion on extreme openness , I started sharing my thesis writing since day 1. While it is not yet clear how useful it is (smile), you can at least find all of the edits made by my advisor, page by page here . Thanks Chris!

Code, data, and models described in this thesis can be found at our Stanford NMT Project Page .

To save your reading time, let me highlight sections that I have put effort in writing beside my published papers:

  • Chapter 7 - Conclusion : this will give you a big picture of what I have achieved in my dissertation and how it influences subsequent work.
  • Chapter 6 - NMT future (especially section 6.3 - Future Outlook ): this is where I highlight potential research directions and speculate on the future.

Additionally, one can also read:

  • Chapter 2 - Background : this section, I hope, will be useful for any reader who wants to implement NMT by hand like me where I give details on things like derivations for LSTM gradients.
  • Chapter 1 - Introduction : I hope to enlighten readers with the history of MT from the 17th century until now.

Acknowledgments

Thanks Gabor Angeli for sharing the thesis setup!

  • PostScript 97.0%

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • CORRESPONDENCE
  • 25 June 2024

AI machine translation tools must be taught cultural differences too

  • Helene Tenzer   ORCID: http://orcid.org/0000-0001-8760-7777 0 ,
  • Stefan Feuerriegel 1 &
  • Rebecca Piekkari   ORCID: http://orcid.org/0000-0002-4026-5850 2

LMU Munich, Munich, Germany.

You can also search for this author in PubMed   Google Scholar

Aalto University, Espoo, Finland

The ‘No Language Left Behind’ project has scaled machine translation to 200 of the world’s 7,000 or so languages ( Nature https://doi.org/m348; 2024 ). But to successfully preserve or revitalize minority languages, the scope of large-language-model (LLM) training needs to be broadened.

Access options

Access Nature and 54 other Nature Portfolio journals

Get Nature+, our best-value online-access subscription

24,99 € / 30 days

cancel any time

Subscribe to this journal

Receive 51 print issues and online access

185,98 € per year

only 3,65 € per issue

Rent or buy this article

Prices vary by article type

Prices may be subject to local taxes which are calculated during checkout

Nature 630 , 820 (2024)

doi: https://doi.org/10.1038/d41586-024-02091-4

Competing Interests

The authors declare no competing interests.

Related Articles

See more letters to the editor

  • Machine learning

Physics solves a training problem for artificial neural networks

Physics solves a training problem for artificial neural networks

News & Views 07 AUG 24

These AI firms publish the world’s most highly cited work

These AI firms publish the world’s most highly cited work

News 01 AUG 24

Cheap light sources could make AI more energy efficient

Cheap light sources could make AI more energy efficient

News & Views 31 JUL 24

A publishing platform that places code front and centre

A publishing platform that places code front and centre

Technology Feature 07 AUG 24

China’s robotaxis need regulation

Correspondence 06 AUG 24

Quantum computing aims for diversity, one qubit at a time

Quantum computing aims for diversity, one qubit at a time

Technology Feature 05 AUG 24

Ultra-detailed brain map shows neurons that encode words’ meaning

Ultra-detailed brain map shows neurons that encode words’ meaning

News 03 JUL 24

Semantic encoding during language comprehension at single-cell resolution

Semantic encoding during language comprehension at single-cell resolution

Article 03 JUL 24

Language is primarily a tool for communication rather than thought

Language is primarily a tool for communication rather than thought

Perspective 19 JUN 24

Recruitment of Talent Positions at Shengjing Hospital of China Medical University

Call for top experts and scholars in the field of science and technology.

Shenyang, Liaoning, China

Shengjing Hospital of China Medical University

thesis on machine translation

The Recruitment of Fuyao University of Science and Technology

This recruitment of Fuyao University Technologyof Science andUcovers 7 departments including the 6 Schools and the Faculty of Fundamental Disciplines.

Fuzhou, Fujian (CN)

Fuzhou FuYao Institute for Advanced Study

thesis on machine translation

Educational Consultant

You will build and maintain strong relationships with local representatives, key distributors, schools, Ministries of Education, etc.

Riyadh - hybrid working model

Springer Nature Ltd

thesis on machine translation

Senior Marketing Manager – Journal Awareness

Job Title: Senior Marketing Manager – Journal Awareness Location(s): London, UK - Hybrid Working Model Closing date: 25th August 2024             A...

London (Central), London (Greater) (GB)

thesis on machine translation

Faculty Positions& Postdoctoral Research Fellow, School of Optical and Electronic Information, HUST

Job Opportunities: Leading talents, young talents, overseas outstanding young scholars, postdoctoral researchers.

Wuhan, Hubei, China

School of Optical and Electronic Information, Huazhong University of Science and Technology

thesis on machine translation

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies
  • Share full article

Candice Miller, in a white summer dress, poses for a portrait with her husband, Brandon Miller, in a light blue shirt. Both are holding wine glasses.

How an Instagram-Perfect Life in the Hamptons Ended in Tragedy

Candice and Brandon Miller showed the public a world of glittering parties and vacations. The money to sustain it did not exist.

Candice and Brandon Miller. In photographs shared online, their lives were full of parties and luxurious vacations. Credit... Joe Schildhorn/BFA.com, via Shutterstock

Supported by

Katherine Rosman

By Katherine Rosman

  • Published Aug. 8, 2024 Updated Aug. 9, 2024

In the modern Gilded Age of New York, where Instagram is awash in unrestrained displays of wealth, Brandon and Candice Miller were royalty.

At their 10th wedding anniversary “Midsummer Night’s Dream” party, they celebrated with a few dozen friends in the backyard of their 5,500-square-foot vacation home in the Hamptons.

Beautiful women in gowns watched with their handsome husbands as the couple renewed their vows near a swimming pool strewn with peonies and rose petals beneath a canopy of lights.

It was a grand public display of their perfect life and marriage. Ms. Miller told a lifestyle blogger who wrote about the party that her husband’s speech “made me cry by the end with his authentic, raw emotion and romantic words.”

It all culminated in the kind of envy-inducing images anticipated by the roughly 80,000 followers of “Mama and Tata,” Ms. Miller’s popular Instagram feed, which featured a near-constant stream of photographs and videos of her glittering life.

The Midsummer Night party was in 2019. Five years later, the glamorous image that Ms. Miller cultivated and promoted has disappeared, replaced with heartbreak, anger and a mountain of once-secret debt.

We are having trouble retrieving the article content.

Please enable JavaScript in your browser settings.

Thank you for your patience while we verify access. If you are in Reader mode please exit and  log into  your Times account, or  subscribe  for all of The Times.

Thank you for your patience while we verify access.

Already a subscriber?  Log in .

Want all of The Times?  Subscribe .

Advertisement

  • Departments & Programs
  • Majors & Minors
  • Fields of Study

New 2024-2025 faculty

Angie bautista-chavez      .

Assistant Professor, Department of Political Science 

As a scholar of American politics, Angie Bautista-Chavez examines the politics of migration, borders, bureaucracy, race, and citizenship. She is interested in the dynamic interplay between states and racialized migrants—at one level, how Latinx immigrants are regulated by and contend with the American state, and at another level, how the United States has expanded its regulatory reach beyond its borders. She is committed to creating more inclusive systems of knowledge production, as indicated by her research, teaching, and mentorship record. 

Maria Betto  

Assistant Professor, Department of Economics 

Maria Betto’s research interests are in microeconomic theory, with a particular focus on topics related to information economics and decision theory. 

Gira Bhabha     

Associate Research Professor, Department of Biology 

Gira Bhabha’s research focuses on the structural mechanisms and cell biology of microbes and their interactions with hosts. The lab uses integrative approaches including X-ray crystallography, cryo-electron microscopy, cryo-electron tomography, optical microscopy, biochemistry, microbiology, and cell biology techniques, to uncover the unique biology of eukaryotic and prokaryotic pathogens. 

Alessandra Corsi

Research Professor, William H. Miller III Department of Physics and Astronomy 

Alessandra Corsi’s research focuses on multi-messenger time-domain astronomy, with emphasis on relativistic radio transients and gravitational wave physics. She has vast experience with observations of astrophysical transients and follow-up of gravitational wave (GW) events and works on detection algorithms for GW data.  

Ross Doppelt

Senior Lecturer and Program Coordinator, Applied Economics (Advanced Academic Programs) 

Ross Doppelt teaches in the applied economics program, based out of the Washington, D.C., campus. As a macroeconomist, his primary research interests are labor markets and time series, and he has experience teaching classes in macro, micro, and econometrics. Doppelt has a PhD from New York University and a BA from the University of Chicago, both in economics. 

Damian Ekiert

Damian Ekiert is a microbiologist and biochemist whose research focuses on bacterial and parasite cell biology. He studies fundamental cellular and molecular processes ranging from lipid transport and virulence factors in bacteria, to invasion mechanisms in microsporidian parasites and the evolution of a minimal eukaryotic cell. 

Brahim El Guabli

Brahim El Guabli  

Associate Professor, Department of Comparative Thought and Literature 

Brahim El Guabli is an interdisciplinary scholar who teaches a variety of topics in Tamazghan and Middle Eastern literatures from the lens of environmentalism, indigeneity, and multilingualism. His work engages issues of race and racism, deserts and Saharanism, joint authorship practices, translation and transitional justice, Jews in Amazigh and Arabic cultural production, and revitalization of Indigenous cultures. 

Harris Feinsod

Harris Feinsod

Research Professor, Department of English 

Harris Feinsod is a literary and cultural historian of the United States, Latin America, and the Atlantic world. He is the author of a literary history, The Poetry of the Americas: from Good Neighbors to Countercultures , and the co-translator of Oliverio Girondo’s Decals: Complete Early Poems . His recent academic essays and public writings focus especially on port cities, and on maritime labor, environment, and culture under conditions of globalization. 

Kandyce Fernandez

Kandyce Fernandez   

Senior Lecturer, Non-Profit Management (Advanced Academic Programs) 

Kandyce Fernandez’s expertise in the nonprofit sector includes philanthropy and foundations, advocacy and the policy process, civic health and engagement and program evaluation. Her research addresses the interactions that take place among communities and organizations across sectors, with a focus on civic health in the nonprofit sector. 

Lauren Fusilier

Senior Lecturer, University Writing Program  

Lauren Fusilier’s research interests include digital and multimodal pedagogy, the intersection of digital literacies and new media campus resources, and issues of institutional equity and accessibility, especially as they impact non-dominant student populations.  

Andrew Gallup

Andrew Gallup 

Teaching Professor, Department of Psychological and Brain Sciences 

Andrew Gallup’s research areas include the evolution and functional significance of yawning, sports and athletics in evolutionary perspective, and threat detection and group vigilance. He teaches courses in behavioral biology and provides advising and research mentoring to undergraduate students. 

Maia Gil’Adí  

Assistant Professor, Department of Modern Languages and Literatures 

Maia Gil’Adí’s work takes up speculative fiction as a site for theorizing Latinx identity across national and ethnic borders and shows the vital role of historical trauma in its formation. Arguing against reparative modes of reading, she analyzes how literary portrayals of violence, destruction, and pain paradoxically elicit pleasurable affective and aesthetic experiences. Examining novels by established Latinx authors such as Junot Díaz and Cristina García as well as multiethnic writers such as Colson Whitehead and Sesshu Foster, Gil’Adí challenges definitions of what constitutes Latinx literature and notions of the speculative by dismantling generic boundaries and entrenched definitions of race, ethnicity, and nationhood. 

Nune Grigoryan

Senior Lecturer and Program Coordinator, Communication (Advanced Academic Programs) 

Nune Grigoryan’s research is focused on the intersection of media and democracy. Her scholarly work includes studies of media ethics and digital media use for political participation, as well as social media use for political dissent and political campaigns. 

Christopher Grobe

Associate Research Professor, Department of English 

Christopher Grobe specializes in the entanglement of literature, performance, media, and technology in U.S. culture. Understanding “performance” as both a mode of artistic practice and a source of social knowledge, he has studied such topics as the history of “confessional performance” in American culture, the theatricality of U.S. electoral politics, and the role artists play in constructing new technologies (from the telegraph to AI) and in determining their cultural significance.     

David Guggenheim

Lecturer, Program Coordinator, Academic Advisor, Environmental Programs (Advanced Academic Programs) 

David Guggenheim is a marine scientist, conservation policy specialist, educator, award-winning author, ocean explorer and manned submersible pilot. He has worked in Cuba for more than 20 years, leading collaborative education, research and conservation efforts focused on coral reef ecosystems. 

John Hale

Research Professor, Department of Cognitive Science 

John Hale’s expertise lies in computational linguistics. His research centers on language comprehension, asking questions such as how are we able to understand one another, just by hearing a sequence of words? He uses cognitive modeling, analyzing the human mind/brain via computer simulation.  

Simon D. Halliday

  • Associate Research Professor and Associate Director at the Center for Economy and Society, SNF Agora Institute 
  • Faculty affiliate of the Center for Advanced Study in the Behavioral Sciences, Stanford University (2023-2025) 

Simon Halliday conducts research in behavioral and experimental economics, with a focus on social preferences (reciprocity and social norms) and institutions (ratings, punishment, communication). He also does economics education, and he has co-authored (with Samuel Bowles) an intermediate-level microeconomics textbook: Microeconomics: Competition, Conflict, and Coordination (OUP, 2022). Halliday is also the co-leader of the enCOREage Project. In a new textbook for introductory economics students, Understanding the Economy , the enCOREage team will introduce content that draws students in because it addresses societal problems that students care about while building employability skills, belonging, and inclusion into the curriculum. 

Yuan He

Research Professor, Departments of Biophysics and Biology 

Yuan He is interested in understanding the molecular mechanisms by which large, multi-subunit complexes engage in DNA-centric processes. His research uses cryo-electron microscopy and other biophysical and biochemical approaches.  

William Huang

William Y. C. Huang

Assistant Professor, Department of Biophysics  

William Huang is broadly interested in biochemical reactions at the cell membrane, especially those involved in signal transduction. The research combines optical methods and kinetic modeling to analyze biochemically reconstituted systems and living cells. The integrated approach has enabled the invention of extensive imaging-based membrane assays, oftentimes revealing unexpected dynamic characteristics unique to membrane signaling configurations. 

Paul Johnson  

Associate Research Professor, Department of Modern Languages and Literatures 

A specialist in 16th- and 17th-century Spanish literature, Paul Johnson draws on the history of emotion, the senses, the body, and performance in his research. His writing on race, gender, translation, and popular culture also places pre-modern Iberia into conversation with urgent contemporary debates, while crossing borders to encompass the Global Hispanophone and larger early modern world. 

Sujung Kim

Sujung Kim  

Associate Professor, Department of Anthropology 

Sujung Kim’s research focuses on the transnational interactions of Buddhist practices in East Asia by engaging a variety of networks that connect people, places, and praxis in the Buddhist world. After her first monograph, Shinra Myojin and Buddhist Networks of the East Asian “Mediterranean” (University of Hawaii Press, 2019), she is currently working on her second monograph, Korean Magical Medicine: Buddhist Healing Talismans in Choson Korea (under advance contract with the University of Hawai’i Press). 

Chris Kromphardt

Chris Kromphardt  

Lecturer, Assistant Program Director, Data Analytics and Policy (Advanced Academic Programs) 

Chris Kromphardt’s research focuses on how citizens view the performance and legitimacy of judicial institutions, how judicial institutions make decisions, and the design and evaluation of experiential learning activities. He has taught courses on public policy, research methods and research design, constitutional law, and American politics. 

Jason Ludden 

Jason Ludden’s research interests include rhetoric of science, environmental communication, composition pedagogy, and creative writing. His work focuses on the role of experts in public policy discourse, and he has examined how forest management discourse shapes public perception of environmental issues.  

Diego Luis

Diego Luis     

Rohrbaugh Family Assistant Professor, Department of History 

Diego Luis is a historian of Latin America specializing in the connections between Mexico and the Philippines during the Manila galleon period (1565-1815). His work focuses on the global scope of the early modern Spanish empire by examining the movement of people across the Pacific Ocean and how that movement transformed societies at both the eastern and western termini of the galleon trade. 

Julie K. Lundquist    

Bloomberg Distinguished Professor of Atmospheric Science and Wind Energy, Departments of Earth and Planetary Sciences and Mechanical Engineering (Whiting School of Engineering) 

Julie Lundquist uses observational and computational approaches to understand the atmospheric boundary layer, with an emphasis on atmosphere-wind energy interactions. Her research engages in both the atmospheric influences on wind energy production as well as the atmospheric consequences of wind energy deployment.

Vikash Morar

Lecturer, Biotechnology (Advanced Academic Programs) 

Vikash Morar has a background in research in clinical diagnostics, cellular neuroscience, and machine learning. His interest in learning as an abstraction brought him toward pedagogy, where he has extensively trained and worked to improve student outcomes in the classroom.

Marcelo Nogueira  

Assistant Professor of Spanish and Portuguese, Department of Modern Languages and Literatures

Marcelo Nogueira’s research focuses on modern and contemporary Latin American poetry, popular music, and visual culture, with a special emphasis on Brazil. Drawing on literary history, media theory, sound studies, and ethnomusicology, he investigates the Latin American avant-gardes, Brazilian modernism, concrete poetry, and the art of songwriting. He earned his PhD in Romance Studies from Duke University in 2022. 

Danielle Norcini

Danielle Norcini  

Assistant Professor, William H. Miller III Department of Physics and Astronomy 

Danielle Norcini’s research interests are in particle physics and cosmology. Her main focus is building detectors to discover dark matter and measure neutrinos. Currently, her group is developing single-electron sensors called skipper CCDs to make precision measurements of particle interactions at very low energy thresholds.

Grace Panetti   

Assistant Professor, Department of Chemistry 

Grace Panetti investigates challenges at the interface of photochemistry and inorganic chemistry. She uses inorganic synthesis coupled with photophysical techniques like transient absorption spectroscopy to investigate these new systems.

Mladen Petkov

Mladen Petkov     

Lecturer, Communication (Advanced Academic Programs) 

Mladen Petkov is interested in journalistic roles and practices, media freedom, disinformation, and Artificial Intelligence. He has several years of newsroom experience in the United States and Bulgaria. 

Allison Pugh

Allison Pugh  

Research Professor, Department of Sociology 

Allison Pugh’s research speaks to central concerns in the sociology of gender, investigating how people forge connections and find meaning and dignity at work and at home. Her work uses qualitative methods to investigate how gender, race, and class inequalities affect the way people negotiate dignity and connection amidst socioeconomic trends such as rationalization, insecurity, and commodification.

Lakshmi Rajkumar

Lakshmi Rajkumar   

Senior Lecturer, Program Coordinator, Lab Manager, Biotechnology (Advanced Academic Programs) 

Lakshmi Rajkumar is an alumni of Johns Hopkins University’s M.S. in Biotechnology program specializing in microbiology. She has contributed to cancer research at the University of Maryland’s Translational Core Lab and has worked in the industry as a food microbiologist.

Margaret Renwick    

Associate Research Professor, Department of Cognitive Science 

Margaret Renwick’s research incorporates variation into models of spoken language to answer questions about the nature of phonological contrast, the origins and realization of phonological patterns, regional accents of U.S. English, and linguistic change across generations.

Mona Khadem Sameni 

Senior Lecturer and Program Coordinator, Applied Economics (Advanced Academic Programs)  

Mona Khadem Sameni’s research focuses on health economics, health policy, healthcare systems, and labor economics. She has done studies on global disparities in the impacts of the COVID-19 pandemic on healthcare systems, the application of AI in healthcare worldwide, and the regulation of AI. 

Hale Sirin 

  • Assistant Research Professor, Alexander Grass Humanities Institute – Center for Digital Humanities 
  • Affiliated faculty, Center for Languages and Speech Processing

Hale Sirin’s research and teaching bring together computational and critical methods to explore questions about narrative, language, and translation across historic and modern languages. She teaches courses on computational methods in the humanities and narrative theory. Sirin’s work has appeared in numerous venues including the Alliance of Digital Humanities Organizations and has been published in technical conferences including the Association for Computational Linguistics.

Sarah Sowden  

Lecturer, Computational Biology (Advanced Academic Programs) 

Sarah Sowden has expertise in data science, research administration, and the ethical and legal implications of data practices. She lectures in bioinformatics and individualized genomics and health.  

Frederick Tan

Frederick Tan 

Assistant Research Professor, Biology 

Frederick Tan researches mechanisms to effectively train and support diverse populations in genomic data science. He focuses on strategies that increase persistence among undergraduates, graduate students, and faculty. Through collaborative educational programs, he aims to build inclusive communities that advance research using the latest genomic technologies.

Heiko ter Haseborg  

Associate Teaching Professor, Department of Modern Languages and Literatures 

Heiko ter Haseborg’s research interests include curriculum design, curriculum evaluation, assessment, world language pedagogy, teacher, and learner role in language education, second language acquisition, and metacognition.   

Karina A. Vado 

Senior Lecturer, Medicine, Science, and the Humanities Program 

Karina Vado’s research lies at the intersections of Latinx and Latin American literary and cultural studies, science and technology studies, and science fiction studies. She is currently working on her first book project, Latinx DNA: Race, Latinidad, and Gene(ome ). In it, she interrogates scientized representations of Latinx identity across multiple cultural forms—popular science writing, music, speculative memoirs, documentary film, and visual art—and considers the vexed de- and re-codings of Latinidad that these “texts” forward or foreclose. 

Jasmina Wiemann

Jasmina Wiemann

Assistant Professor, Department of Earth and Planetary Sciences 

Jasmina Wiemann’s research develops new chemical methods and instrument components that comparatively probe biologically and geologically informative patterns in the macromolecular composition of modern and fossil organismal tissues to characterize past, present, and predictable future interactions between life and its changing environments and translates these insights into bio-inspired climate solutions.

Frances Wu

Senior Lecturer, Organizational Leadership (Advanced Academic Programs) 

Frances Wu’s research focuses on cross-cultural leadership for global challenges and community-engaged pedagogies for leadership education. She has been awarded research grants on leadership for sustainability in business education in Singapore, higher education innovations in the U.S., and change processes for joint-venture universities between the U.S. and other countries.

IMAGES

  1. (PDF) A Review on Machine Translation Approaches

    thesis on machine translation

  2. PPT

    thesis on machine translation

  3. Machine Translation and Translation Theory (English) Hardcover Book

    thesis on machine translation

  4. (PDF) The Feasibility of Chinese–English Machine Translation Applied to

    thesis on machine translation

  5. (PDF) An Overall Perspective of Machine Translation with its

    thesis on machine translation

  6. 6 steps for successful machine translation

    thesis on machine translation

COMMENTS

  1. PDF Machine Translationness: a Concept for Machine Translation ...

    Machine translationness (MTness) is the linguistic phenomena that make machine trans-lations distinguishable from human translations. This thesis intends to present MT-ness as a research object and suggests an MT evaluation method based on determining whether the translation is machine-like instead of determining its human-likeness as in

  2. Machine translation and its evaluation: a study

    Machine translation (namely MT) has been one of the most popular fields in computational linguistics and Artificial Intelligence (AI). As one of the most promising approaches, MT can potentially break the language barrier of people from all over the world. Despite a number of studies in MT, there are few studies in summarizing and comparing MT ...

  3. PDF Machine Translation with Transformers

    translation approach. Neural machine translation is the art of using arti cial neu-ral networks (ANN) models to learn a statistical model for machine translation. The phrase-based translation systems (e.g. Marcu and Wong (2002), Koehn et al. (2003), Setiawan et al. (2005)) require the pipeline of specialized components such as language model ...

  4. A scientometric study of three decades of machine translation research

    1. Introduction. Machine translation (MT), a subfield of computational linguistics, is defined by the European Association of Machine Translation as any type of automated translation from one natural language to another (European Association of Machine Translation, Citation n.d.).It is a form of language processing technology that enables the automatic translation of text and speech from one ...

  5. (PDF) The Effectiveness of Machine Translation

    The. Effectiveness of. Machine. Translation. Saleh M. Al-Salman. Abu Dhabi University. Vol, 5, 2000. Abstract: Insofar as machine translation is based on computerized natural. language processing ...

  6. PDF Interactive and Adaptive Neural Machine Translation

    NEURAL MACHINE TRANSLATION by Rebecca Knowles A dissertation submitted to The Johns Hopkins University in conformity with the ... My undergraduate mathematics thesis was advised by Lynne Butler, who always set high expectations for me and backed them up with good advice and guidance. Nathan Sanders advised my linguistics thesis

  7. (PDF) The Machine Translation of Literature: Implications for

    Machine translation (MT) has been recently experiencing an upsurge in popularity within the realm of second language (L2) and foreign language (FL) education.This paper aims to present the ...

  8. Multimodal machine translation through visuals and speech

    Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data. The most prominent tasks in this area are spoken language translation, image-guided translation, and video-guided translation, which exploit ...

  9. PDF On the correctness of machine translation: A machine translation

    ore detail in Koponen and Salmi (2012).3.1 Correctness analysisFigure 1 shows the overall correctness evaluat. on of the raw machine translations and the post-edited versions. In the raw MT state (left column of Figure 1), only six of the 120 sentences (5%) were evaluated as fully correct, and a further 24 sentences (20%) were eval.

  10. PDF Machine Translation for Professional Translators

    Machine Translation for Professional Translators. ZurichOpenRepositoryand Archive UniversityofZurich MainLibrary Strickhofstrasse39 CH-8057Zurich www.zora.uzh.ch Year: 2020. Machine Translation for Professional Translators.

  11. Towards efficient universal neural machine translation

    Machine translation (MT) aims to overcome such barriers by automatically transforming information from one language to another. With the rapid development of deep neural networks, neural machine translation (NMT) - especially Transformer - has achieved great success in recent years, delivering state-of-the-art and even near human ...

  12. PDF From Feature to Paradigm: Deep Learning in Machine Translation

    In such a context, Machine Translation (MT), which is the task of automatically translating a text from a source language into a target language, is gaining more and more relevance. Both industry and academy are strongly investigating in the eld which is progressing at an incredible speed.

  13. Machine translation and its evaluation: a study

    Machine translation (namely MT) has been one of the most popular fields in computational linguistics and Artificial Intelligence (AI). As one of the most promising approaches, MT can potentially break the language barrier of people from all over the world. Despite a number of studies in MT, there are few studies in summarizing and comparing MT methods. To this end, in this paper, we ...

  14. PDF NEURAL MACHINE TRANSLATION ADISSERTATION ...

    field of Statistical Machine Translation (SMT) over the past two decades, the translation quality has not yet been satisfactory; at the same time, SMT systems become increasing complex with many different components built separately, rendering it extremely difficult to make further advancement. Recently, Neural Machine Translation (NMT ...

  15. PDF Master Thesis Using Machine Learning Methods for Evaluating the ...

    sifying sentences as automated translation and professional translation. The research is based on a database containing 22;327 sentences and 32 translation evaluation attributes, which are used for optimizations of five different machine learning approaches. An op-timization process consisting of 795;000 evaluations shows a prediction accuracy ...

  16. Comparing Machine Translation and Human Translation: A Case Study

    Comparing Machine Translation and Human Translation: A Case Study. November 2017. DOI: 10.26615/978-954-452-042-7_003. Conference: RANLP 2017 - Workshop on Human-Informed Translation and ...

  17. PDF MIXED-INITIATIVE NATURAL LANGUAGE TRANSLATION A ...

    translation system. e machine suggests future translations based on previous interactions. v. For example, if the user has typed part of a translation for a given input sentence, PTM can proposeacompletion. WealsoshowhowPTMcanself-correctitsmodelviaincremental machine learning.

  18. Simulating the Machine Translation of Low-Resource Languages by

    written—are left with no way to effectively benefit from machine translation. As a step toward better translation processors for low-resource languages, this thesis examined the possibility of machine translation between high resource languages and low resource languages through an analysis of different machine learning techniques, and ultimately

  19. Neural Machine Translation

    NEURAL MACHINE TRANSLATION. Senior Thesis by. Quinn Lanners. Dr. Thomas Laurent, Thesis Director Neural Machine Translation is the primary algorithm used in industry to perform machine translation. This state-of-the-art algorithm is an application of deep learning in which massive datasets of translated sentences are used to train a model ...

  20. A Survey on the Application of Online Machine Translation in Thesis

    The advantages and limitations of online machine translation tools as an aid in thesis writing, the impact of their use on the quality of the thesis, as well as the views of English majors on their use were analyzed, thus providing enlightenment for all stakeholders involved in the completion of graduation thesis including students, teachers ...

  21. Dissertations / Theses: 'Machine translating'

    In this thesis a prototype machine translation system is presented. This system is designed to translate English text into a gloss based representation of South African Sign Language (SASL). In order to perform the machine translation, a transfer based approach was taken. English text is parsed into an intermediate representation.

  22. PDF Attention-based Neural Machine Translation from English-Wolaytta

    nslation system for English-Wolaytta using attention mechanism. The English-Wolaytta machine translation system has been trained on parallel corpus covering the religious, and frequently used s. ntences or phrases which can be used in day to day communication. A total of 27351 parallel English-Wolaytta sentences wer.

  23. The Impacts and Challenges of Artificial Intelligence Translation Tool

    From early rule-based systems to advanced Neural Machine Translation (NMT), these technologies have enhanced translation quality but still highlight the indispensable role of human translators in ...

  24. Thang Luong's thesis on Neural Machine Translation

    Thang Luong's Thesis on Neural Machine Translation. This repository contains the latest version of my thesis. Motivated by my advisor (Chris Manning)'s suggestion on extreme openness, I started sharing my thesis writing since day 1. While it is not yet clear how useful it is (smile), you can at least find all of the edits made by my advisor ...

  25. AI machine translation tools must be taught cultural ...

    Physics solves a training problem for artificial neural networks. News & Views 07 AUG 24. These AI firms publish the world's most highly cited work. News 01 AUG 24. Cheap light sources could ...

  26. How an Instagram-Perfect Life in the Hamptons Ended in Tragedy

    It all culminated in the kind of envy-inducing images anticipated by the roughly 80,000 followers of "Mama and Tata," Ms. Miller's popular Instagram feed, which featured a near-constant ...

  27. New 2024-2025 faculty

    His recent academic essays and public writings focus especially on port cities, and on maritime labor, environment, and culture under conditions of globalization. ... and performance in his research. His writing on race, gender, translation, and popular culture also places pre-modern Iberia into conversation with urgent contemporary debates ...