【正文】
painting – Free search: landscape painting。 travelers, huge mountain, Nature。 scenery。 Shensi province HsinHsi Chen 57 English Names Chinese Names Machine Transliteration English Titles Chinese Titles Document Translation Name Search Title Search English Query Query Disambiguation Specific Bilingual Dictionary Generic Bilingual Dictionary Chinese Query Query Translation Chinese IR System NPDM Collection Results ECIR in NPDM HsinHsi Chen 58 Specific Topic Search ? proper names are important query terms – Creators such as “林逋” (Lin P?u), “李建中” (Li Chienchung), “歐陽脩” (Ouyang Hsiu), etc. – Emperors such as “康熙” (K39。anghsi), “乾隆” (Ch39。ienlung), “徽宗” (Huitsung), etc. – Dynasty such as ”宋” (Sung), “明” (Ming), “清” (Ch?ing), etc. HsinHsi Chen 59 Name Transliteration ? The alphabets of Chinese and English are totally different ? WadeGiles (WG) and Pinyin are two famous systems to romanize Chinese in libraries ? backward transliteration – Transliterate target language terms back to source language ones – Chen, Huang, and Tsai (COLING, 1998) – Lin and Chen (ROCLING, 2021) HsinHsi Chen 60 Name Mapping Table ? Divide a name into a sequence of Chinese characters, and transform each character into phonemes ? Look up phonemetoWG (Pinyin) mapping table, and derive a canonical form for the name ? Example – “林逋” ? “ㄌㄧㄣ ㄆㄨ” ? “ Lin P?u” (WG) HsinHsi Chen 61 Name Similarity ? Extract named entity from the query ? Select the most similar named entity from name mapping table ? Naming sequence/scheme – LastName FirstName1, ., Chu Hsi (朱熹 ) – FirstName1 LastName, ., Hsi Chu (朱熹 ) – LastName FirstName1FirstName2, ., Hsu Taoning (許道寧 ) – FirstName1FirstName2 LastName, ., Taoning Hsu (許道寧 ) – Any order, ., Tao Ning Hsu (許道寧 ) – Any transliteration, ., Ju Shi (朱熹 ) HsinHsi Chen 62 Title ? 谿山行旅圖” ? “ Travelers among Mountains and Streams” ? travelers, mountains, and streams are basic ponents ? Users can express their information need through the descriptions of a desired art ? System will measure the similarity of art titles (descriptions) and a query HsinHsi Chen 63 Free Search ? A query is posed of several concepts. ? Concepts are either transliterated or translated. ? The query translation similar to a small scale IR system ? Resources – Namemapping table – Titlemapping table – Specific EnglishChinese Dictionary – Generic EnglishChinese Dictionary – … HsinHsi Chen 64 Algorithm ? (1) For each resource, the Chinese translations whose scores are larger than a specific threshold are selected. ? (2) The Chinese translations identified from different resources are merged, and are sorted by their scores. ? (3) Consider the Chinese translation with the highest score in the sorting sequence. – If the intersection of the corresponding English description and query is not empty, then select the translation and delete the mon English terms between query and English description from query. – Otherwise, skip the Chinese translation. HsinHsi Chen 65 Algorithm (Continued) ? (4) Repeat step (3) until query is empty or all the Chinese translations in the sorting sequence are considered. ? (5) If the query is not empty, then these words are looked up from the general dictionary. A Chinese query is posed of all the translated results. HsinHsi Chen 66 Summary ? Users can select English input and Chinese output when they are neither familiar with Chinese input, nor lack of Chinese input device, but can read Chinese. ? Images or videos are transparent to those users that cannot read/write Chinese. ? The integration of different knowledge sources will be studied in the future.