• Title/Summary/Keyword: 이중언어

Search Result 248, Processing Time 0.035 seconds

Bilingual Word Embedding using Subtitle Parallel Corpus (자막 병렬 코퍼스를 이용한 이중 언어 워드 임베딩)

  • Lee, Seolhwa;Lee, Chanhee;Lim, Heuiseok
    • Proceedings of The KACE
    • /
    • 2017.08a
    • /
    • pp.157-160
    • /
    • 2017
  • 최근 자연 언어 처리 분야에서는 단어를 실수벡터로 임베딩하는 워드 임베딩(Word embedding) 기술이 많은 각광을 받고 있다. 최근에는 서로 다른 두 언어를 이용한 이중 언어 위드 임베딩(Bilingual word embedding) 방법을 사용하는 연구가 많이 이루어지고 있는데, 이중 언어 워드 임베딩에서 임베딩 절과의 질은 학습하는 코퍼스의 정렬방식에 따라 많은 영향을 받는다. 본 논문은 자막 병렬 코퍼스를 이용하여 밑바탕 어휘집(Seed lexicon)을 구축하여 번역 연결 강도를 향상시키고, 이중 언어 워드 임베딩의 사천(Vocabulary) 확장을 위한 언어별 연결 함수(Language-specific mapping function)을 학습하는 새로운 방식의 모델을 제안한다. 제안한 모델은 기존 모델과의 성능비교에서 비교할만한 수준의 결과를 얻었다.

  • PDF

Research trends of bilingual education for children from multicultural families : focused on foreigner parent's language education (국내 다문화가정 자녀를 위한 이중언어교육 연구동향 -이주부모나라 언어교육을 중심으로-)

  • Chen, Jia;Huang, Hai-Ying
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.6 no.10
    • /
    • pp.107-120
    • /
    • 2016
  • The purpose of this study is to analyze research trends of bilingual education which are focused on foreigner parent's language education for children from multicultural families in South Korea. After searching, total 36 research papers were selected and analyzed. Results from this study showed that, (1) relevant researches started from 2009, later then the number of researches kept about 4-6 yearly and reached the peak of 11 (31%) in 2004; (2) a large majority (75%) of the analyzed researches published in the magazines relative to a field of language education or children education; (3) diverse research methods were employed, and the number of researches applied on qualitative research methods is similar with on quantitative ; (4) the researches mainly studied on the members of multicultural families (67%), and 10 analyzed researches investigated parents and children together 28%, and 9 targeted on parents or one of them (25%), and 5 just on the children from multicultural families (14%). and, (5) The research topics mainly divided into four fields current bilingual education programs (31%), bilingual education (31%), interaction related to bilingual education (22%), suggestion on policy improvement(17%). Based on the results, discussions and implications for further research and practice on bilingual education and foreigner parent's language education will be provided.

The study of bilingual learning of mathematics at International High School in South Korea (한국 국제학교에서의 이중 언어를 활용한 수학학습에 관한 연구)

  • Gong, Mina;Kim, Yunghwan
    • Journal of the Korean School Mathematics Society
    • /
    • v.22 no.2
    • /
    • pp.115-132
    • /
    • 2019
  • This study investigated results of bilingual learning of mathematics in teaching quadratic functions at an international high school in South Korea. Unlike a Korean traditional public school, this international school has provided bilingual learning. 10th grade students enrolled two math classes, Algebra II that was taught in English through CCSS and High School Mathematics I that was taught in Korean through the Korean National Math Curriculum. In order to collect information on students' behaviors and math achievement, we analyzed students' academic back grounds, mathematical abilities, results of interviews, observations, questionnaires and assessments. The results of this study include specific benefits. Bilingual learning of mathematics is effective as a method to improve Korean students' mathematical abilities and attitudes as well as positive influence on Korean mathematics education.

A Study on the Experiences of Multi-cultural Family Mothers in Children's Bilingual Education (다문화가정 어머니가 경험한 자녀의 이중언어교육에 관한 연구)

  • Cho, Seung-Suk;Kim, Hee-Soon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.11
    • /
    • pp.5549-5558
    • /
    • 2013
  • This study is aimed at qualitatively looking into multi-cultual family mothers' experiences in children's bilingual education. The subjects for the study were seven married immigrant Vietnamese women with elementary school children who have recently finished bilingual education. After recording their interviews and then transcribing them, the data was analyzed by utilizing comparative analysis method. The results were divided into three categories of experiences: emotional, identical, and relational aspects. The results of this study will offer information about necessary support for bilingual education to multi-cultural family members.

Developing a Korean sentiment lexicon through BPE (BPE를 활용한 한국어 감정사전 제작)

  • Park, Ho-Min;Cheon, Min-Ah;Nam-Goong, Young;Choi, Min-Seok;Yoon, Ho;Kim, Jae-Kyun;Kim, Jae-Hoon
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.510-513
    • /
    • 2019
  • 감정분석은 텍스트에서 나타난 저자 혹은 발화자의 태도, 의견 등과 같은 주관적인 정보를 추출하는 기술이며, 여론 분석, 시장 동향 분석 등 다양한 분야에 두루 사용된다. 감정분석 방법은 사전 기반 방법, 기계학습 기반 방법 등이 있다. 본 논문은 사전 기반 감정분석에 필요한 한국어 감정사전 자동 구축 방법을 제안한다. 본 논문은 영어 감정사전으로부터 한국어 감정사전을 자동으로 구축하는 방법이며, 크게 세 단계로 구성된다. 첫 번째는 한영 병렬 말뭉치를 이용한 한영 이중언어 사전을 구축하는 단계이고, 두 번째는 한영 이중언어 사전을 통한 한영 이중언어 그래프를 생성하는 단계이며, 세 번째는 영어 단어의 감정값을 한국어 BPE의 감정값으로 전파하는 단계이다. 본 논문에서는 제안된 방법의 유효성을 보이기 위해 사전 기반 한국어 감정분석 시스템을 구축하여 평가하였으며, 그 결과 제안된 방법이 합리적인 방법임을 확인할 수 있었으며 향후 연구를 통해 개선한다면 질 좋은 한국어 감정사전을 효과적인 방법으로 구축할 수 있을 것이다.

  • PDF

Utilizing Local Bilingual Embeddings on Korean-English Law Data (한국어-영어 법률 말뭉치의 로컬 이중 언어 임베딩)

  • Choi, Soon-Young;Matteson, Andrew Stuart;Lim, Heui-Seok
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.10
    • /
    • pp.45-53
    • /
    • 2018
  • Recently, studies about bilingual word embedding have been gaining much attention. However, bilingual word embedding with Korean is not actively pursued due to the difficulty in obtaining a sizable, high quality corpus. Local embeddings that can be applied to specific domains are relatively rare. Additionally, multi-word vocabulary is problematic due to the lack of one-to-one word-level correspondence in translation pairs. In this paper, we crawl 868,163 paragraphs from a Korean-English law corpus and propose three mapping strategies for word embedding. These strategies address the aforementioned issues including multi-word translation and improve translation pair quality on paragraph-aligned data. We demonstrate a twofold increase in translation pair quality compared to the global bilingual word embedding baseline.

The Semantic System in Late Korean-English Bilinguals (후기 한국어-영어 이중언어자의 의미체계)

  • Jeong, Woo-Rim;Kim, Min-Jung;Lee, Seung-Bok
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.2
    • /
    • pp.177-203
    • /
    • 2008
  • The present study was aimed to compare the semantic systems represented by the lexicon between L1 and L2 in late Korean-English bilinguals. The participants performed the word-picture matching task. the task was to decide whether the pictures represent the previously presented words' meaning. The words were the basic level categories. The stimuli were consisted of common object belonged to two different semantic categories (natural and artificial). To control the translation strategies, the SOA were manipulated as 650ms(Exp. 1) and 250ms(Exp. 2). No translation effort was found in the comparison of the two experiments. In both experiment, the RTs were faster in L1 rendition, and it took longer to decide the stimuli in natural categories than with artificial ones in L1. However, this category effect was not observed in L2. The results showed the differences in the organization of semantic representations in the brain through the bilinguals' two languages. While L1 semantic knowledge might be more systematically organized, that of L2 seems to be less well organized, at least by late bilinguals who participated in the present study.

  • PDF

Language Games between Donald Trump and Gloria Anzaldúa (도널드 트럼프와 글로리아 안살두아의 '언어' 게임)

  • Park, Jungwon
    • Cross-Cultural Studies
    • /
    • v.46
    • /
    • pp.85-112
    • /
    • 2017
  • Donald Trump, the $45^{th}$ president of the United States, has revived the 'English Only' policy since the beginning of his presidential campaign. The monolingualism not only underscores his extremely conservative ideas, but it also reflects the nativist tendency that prevents the demographic and cultural transformation of the US, which is accelerated by globalization and transnational migration. In particular, Donald Trump tries to reconfirm the mainstream American culture that is now thought to have been threatened by Hispanization and the growing number of Spanish speakers. This paper examines the effects of "code-switching" and the possibility of a bilingual community by contrasting Donald Trump with Gloria $Anzald{\acute{u}}a$, one of the representative Latina writers who created a "border language." Borderlands/La Frontera (1987) includes Spanish glossaries and expressions to represent her bilingual realities, while attempting to translate from English to Spanish, and vice versa. However, the text occasionally demonstrates the impossibility of translation. In doing so, $Anzald{\acute{u}}a$ indirectly states that it is indispensable to present both languages at the stage; she also invites monolingual readers to make more efforts to learn and better understand the Other's language. A "border language" she attempts to embody throughout the text is created in the process of encounters, conflicts, and negotiations among languages of different ethnicities, classes and generations. It does not signify an established form: rather it appears as a constantly transforming language, which can provide us with new perspectives and an alternative way of communication beyond monolingualism.

The Correlation of the Diction of Korean Lyrics Employed by Bilingual Users and the General Diction of K-Pop Lyrics : Focusing on a Korean-American Hip-hop Singers in the 1990s (이중언어 사용자와 K-Pop 노랫말 딕션과의 연관성 : 90년대 후반 재미교포 힙합가수를 중심으로)

  • Seo, Keun-Young
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.8
    • /
    • pp.267-280
    • /
    • 2019
  • The hybrid phenomenon of pronunciation of K-Pop lyrics, which is the diction fusion phenomenon of lyrics, is not a strategic measure for an overseas expansion but a natural formation in the course of history of K-Pop. With the debut of Seo Tai-ji and Boys in 1992, the course of K-Pop changed and settled from ballad to dance music. After three years, the success of a hip hop dance music, 'Come Back Home', opened up a door to Korean Americans who ascertained the potential competitiveness of a hip hop music in K-Pop market. And the writer insists that such advance of Korean Americans into K-Pop market is the origin of the bilingual usage and diction fusion phenomenon of K-Pop lyrics. Therefore, this research will look into the pronunciation system of Korean and English in order to understand the bilingual pronunciation system shown in the lyrics sung by Korean American hip hop singers in the 1990s. Moreover, through Korean pronunciation system, this analysis will discover the main cause of continual acceptance and application of Korean American's broken Korean in the diction of lyrics. Last but not least, this exploration will show the correlation of English pronunciation method of Korean lyrics appeared in K-Pop and the diction of Korean lyrics used by Korean American hip hop singers in late 90s. This research will identify the primary cause of the diction fusion phenomenon in the course of history of K-Pop.

Construction of bilingually pre-trained language model from large-scaled Korean and English corpus (KE-T5: 한국어-영어 대용량 텍스트를 활용한 이중언어 사전학습기반 대형 언어모델 구축)

  • Shin, Saim;Kim, San;Seo, Hyeon-Tae
    • Annual Conference on Human and Language Technology
    • /
    • 2021.10a
    • /
    • pp.419-422
    • /
    • 2021
  • 본 논문은 한국어와 영어 코퍼스 93GB를 활용하여 구축한 대형 사전학습기반 언어모델인 KE-T5를 소개한다. KE-T5는 한국어와 영어 어휘 64,000개를 포함하는 대규모의 언어모델로 다양한 한국어처리와 한국어와 영어를 모두 포함하는 번역 등의 복합언어 태스크에서도 높은 성능을 기대할 수 있다. KE-T5의 활용은 대규모의 언어모델을 기반으로 영어 수준의 복잡한 언어처리 태스크에 대한 연구들을 본격적으로 시작할 수 있는 기반을 마련하였다.

  • PDF