正文內(nèi)容

machinetranslation(i)mtoverview-在線瀏覽

2024-08-27 13:55本頁面

　　

【正文】 Phraselevel? Generate from meaning? Reinforced learning? Reranking? What kinds of resources are available to MT? ? Translation lexicon: – Bilingual dictionary ? Templates, transfer rules: – Grammar books ? Parallel data, parable data ? Thesaurus, WordNet, FrameNet, … ? NLP tools: tokenizer, morph analyzer, parser, … ? More resources for major languages, less for “minor” languages. Major approaches ? Transferbased ? Interlingua ? Examplebased (EBMT) ? Statistical MT (SMT) ? Hybrid approach The MT triangle word Word Meaning Transferbased Phrasebased SMT, EBMT Wordbased SMT, EBMT (interlingua) Transferbased MT ? Analysis, transfer, generation: 1. Parse the source sentence 2. Transform the parse tree with transfer rules 3. Translate source words 4. Get the target sentence from the tree ? Resources required: – Source parser – A translation lexicon – A set of transfer rules ? An example: Mary bought a book yesterday. Transferbased MT (cont) ? Parsing: linguistically motivated grammar or formal grammar? ? Transfer: contextfree rules? Additional constraints on the rules? Apply at most one rule at each level? How are rules created? ? Translating words: wordtoword translation? ? Generation: using LM or other additional knowledge? ? How to create the needed resources automatically? Interlingua ? For n languages, we need n(n1) MT systems. ? Interlingua uses a languageindependent representation. ? Conceptually, Interlingua is elegant: we only need n analyzers, and n generators. ? Resource needed: – A languageindependent representation – Sophisticated analyzers – Sophisticated generators Interlingua (cont) ? Questions: – Does languageindependent meaning representation really exist? If so, what does it look like? – It requires deep analysis: how to get such an analyzer: ., semantic analysis – It requires nontrivial generation: How is that done? – It forces disambiguation at various levels: lexical, syntactic, semantic, discourse levels. – It cannot take advantage of similarities between a particular language pair. Examplebased MT ? Basic idea: translate a sentence by using the closest match in parallel data. ? First proposed by Nagao (1981). ? Ex: – Training data: ? w1 w2 w3 w4 ? w1? w2? w3? w4? ? w5 w6 w7 ? w5? w6? w7? ? w8 w9 ? w8? w9? – Test sent: ? w1 w2 w6 w7 w9 ? w1? w2? w6? w7? w9? EMBT (cont) ? Types of EBMT: – Lexical (shallow) – Morphological / POS analysis – Parsetree based (deep) ? Types of data required by EBMT systems: – Parallel text – Bilingual dictionary – Thesaurus for puting semantic similarity – Syntactic parser, dependency parser, etc. EBMT (cont) ? Word alignment: using dictionary and heuristics ? exact match ? Generalization: – Clusters: dates, numbers, colors, shapes, etc. – Clusters can be built by hand or learned automatically. ? Ex: – Exact match: 12 players met in Paris last Tuesday ? 12 Spieler trafen sich letzen Dienstag in Paris – Templates: $num players met in $city $time ? $num Spieler trafen sich $time in $city Statistical MT ? Basic idea: learn all the parameters from parallel data. ? Major types: – Wordbased – Phrasebased ? Strengths: – Easy to build, and it requires no human knowledge – Good performance when a large amount of training data is available. ? Weaknesses: – How to express linguistic generalization? Comparison of resource requirement Transferbased Interlingua EBMT SMT dictionary + + + Transfer rules + parser + + + (?) semantic analyzer + parallel data + + others Universal representation thesaurus Hybrid MT ? Basic idea: bine strengths of different approaches: – Syntaxbased: generalization at syntactic level – Interlingua: conceptually elegant – EBMT: memorizing translation of ngrams。 generalization at various level. – SMT: fully automatic。 optimizing some objective functions. ? Type

點擊復(fù)制文檔內(nèi)容

環(huán)評公示相關(guān)推薦

freepeople性欧美熟妇, 色戒完整版无删减158分钟hd, 无码精品国产vα在线观看DVD, 丰满少妇伦精品无码专区在线观看,艾栗栗与纹身男宾馆3p50分钟,国产AV片在线观看,黑人与美女高潮,18岁女RAPPERDISSSUBS,国产手机在机看影片

machinetranslation(i)mtoverview-在線瀏覽

總需求ippt課件-在線瀏覽

i故障集錦ppt課件-在線瀏覽

i電極制備ppt課件-在線瀏覽

資本結(jié)構(gòu)ippt課件-在線瀏覽

創(chuàng)業(yè)投資ippt課件-在線瀏覽

i海爾管理ppt課件-在線瀏覽

星i介紹ppt課件-在線瀏覽

ilikemusicthaticandancetolisteningmaterial-在線瀏覽

循環(huán)碼i-在線瀏覽

iselvaraj博士-在線瀏覽

i導(dǎo)論economiesofscaleandinternationaltradean-在線瀏覽

iwanttobeanactor課件-在線瀏覽

generalchemistry(i)ins-在線瀏覽

i-mode服務(wù)-在線瀏覽

氨基酸ippt課件-在線瀏覽

machinetranslation(i)mtoverview-文庫吧在線文庫

machinetranslation(i)mtoverview(完整版)

machinetranslation(i)mtoverview(更新版)

machinetranslation(i)mtoverview(專業(yè)版)

machinetranslation(i)mtoverview(留存版)