• Title/Summary/Keyword: synonymy

Search Result 33, Processing Time 0.018 seconds

An Artificial Intelligence Approach for Word Semantic Similarity Measure of Hindi Language

  • Younas, Farah;Nadir, Jumana;Usman, Muhammad;Khan, Muhammad Attique;Khan, Sajid Ali;Kadry, Seifedine;Nam, Yunyoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.6
    • /
    • pp.2049-2068
    • /
    • 2021
  • AI combined with NLP techniques has promoted the use of Virtual Assistants and have made people rely on them for many diverse uses. Conversational Agents are the most promising technique that assists computer users through their operation. An important challenge in developing Conversational Agents globally is transferring the groundbreaking expertise obtained in English to other languages. AI is making it possible to transfer this learning. There is a dire need to develop systems that understand secular languages. One such difficult language is Hindi, which is the fourth most spoken language in the world. Semantic similarity is an important part of Natural Language Processing, which involves applications such as ontology learning and information extraction, for developing conversational agents. Most of the research is concentrated on English and other European languages. This paper presents a Corpus-based word semantic similarity measure for Hindi. An experiment involving the translation of the English benchmark dataset to Hindi is performed, investigating the incorporation of the corpus, with human and machine similarity ratings. A significant correlation to the human intuition and the algorithm ratings has been calculated for analyzing the accuracy of the proposed similarity measures. The method can be adapted in various applications of word semantic similarity or module for any other language.

Revisiting the Parvilucifera infectans / P. sinerae (Alveolata, Perkinsozoa) species complex, two parasitoids of dinoflagellates

  • Jeon, Boo Seong;Nam, Seung Won;Kim, Sunju;Park, Myung Gil
    • ALGAE
    • /
    • v.33 no.1
    • /
    • pp.1-19
    • /
    • 2018
  • Members of the family Parviluciferaceae (Alveolata, Perkinsozoa) are the well-known dinoflagellate parasitoids along with Amoebophrya ceratii species complex and parasitic chytrid Dinomyces arenysensis and contain six species across three genera (i.e., Parvilucifera infectans, P. sinerae, P. rostrata, and P. corolla, Dinovorax pyriformis, and Snorkelia prorocentri) so far. Among Parvilucifera species, the two species, P. infectans and P. sinerae, are very similar or almost identical each other morphologically and genetically, thereby make it difficult to distinguish between the two. The only main difference between the two species known so far is the number of sporangium wall (i.e., 2 layers in P. infectans vs. 3 layers in P. sinerae). During sampling in Masan bay, Korea during the spring season of 2015, the dinoflagellate Akashiwo sanguinea cells infected by the parasite Parvilucifera were observed and this host-parasite system was established in culture. Using this culture, its morphological and ultrastructural features with special emphasis on the variation in the number of sporangium wall over developmental times, were investigated. In addition, the sequences of rDNA regions and ${\beta}-tubulin$ genes were determined. The result clearly demonstrated that the trophocyte at 36 h was covered with 4 layers, and then outer layer of the sporocyte gradually degraded over time, resulting in wall structure consisting of two layers, with even processes being detached from 7-day-old sporangium with smooth surface, indicating that the difference in the number of layers seems not to be an appropriate ultrastructural character for distinguishing P. infectans and P. sinerae. While pairwise comparison of the large subunit rDNA sequences showed 100% identity among P. infectans / P. sinerae species complex, genetic differences were found in the small subunit (SSU) rDNA sequences but the differences were relatively small (11-13 nucleotides) compared with those (190-272 nucleotides) found among the rest of Parvilucifera species (P. rostrata and P. corolla). Those small differences in SSU rDNA sequences of P. infectans / P. sinerae species complex may reflect the variations within inter- strains of the same species from different geographical areas. Taken together, all morphological, ultrastructural, and molecular data from the present study suggest that they are the same species.

On Doublets (쌍형어에 대하여)

  • Yi, Eun-Gyeong
    • Cross-Cultural Studies
    • /
    • v.50
    • /
    • pp.425-451
    • /
    • 2018
  • In this paper, we examined the issues of the discussions on the subject of doublets. In general, as a definition, the use of doublets refer to a pair of words which have a common etymon, but also to a pair of words or grammatical morphemes that have the same meaning and similar forms of the word. In this paper, we have seen that a typical pairing word is a pair of words with a common etymology. Generally speaking, it is possible to divide doublets into subtypes depending on the identified similarities or differences in the meaning or form. The most distant type from the typical type of doublets is a pair of words that do not have a common etymon, but have the same meaning and are similar in form. The second issue about doublets is whether doublets include only words. For example, if some josas (postpositions or particles) have a common etymon, then it is noted that they can be accepted as a kind of doublets. In the case of suffixes, it may be possible to recognize the suffixes as doublets if they have a common etymon. In other words, it is not necessary to recognize the suffixes as doublets because the derivatives which are derived by the suffixes can be accepted as doublets. In the case of endings, it may be possible to recognize a pair of endings which have the same meaning and the common etymon as a doublet. Otherwise, the word forms to which the endings are combined can be accepted likewise as doublets. However, considering the fact that the endings typically in use in the Korean language may have syntactic properties, the endings should be considered as doublets rather than the words which have the endings. Finally, we conclude that there may be some debate as to whether stem doublets or ending doublets belong to a lexical item in the lexicon. It can be said that they are plural underlying forms and may be deserving of further research.