• Title/Summary/Keyword: Conversation

Search Result 827, Processing Time 0.027 seconds

Framework Switching of Speaker Overlap Detection System (화자 겹침 검출 시스템의 프레임워크 전환 연구)

  • Kim, Hoinam;Park, Jisu;Cha, Shin;Son, Kyung A;Yun, Young-Sun;Park, Jeon Gue
    • Journal of Software Assessment and Valuation
    • /
    • v.17 no.1
    • /
    • pp.101-113
    • /
    • 2021
  • In this paper, we introduce a speaker overlap system and look at the process of converting the existed system on the specific framework of artificial intelligence. Speaker overlap is when two or more speakers speak at the same time during a conversation, and can lead to performance degradation in the fields of speech recognition or speaker recognition, and a lot of research is being conducted because it can prevent performance degradation. Recently, as application of artificial intelligence is increasing, there is a demand for switching between artificial intelligence frameworks. However, when switching frameworks, performance degradation is observed due to the unique characteristics of each framework, making it difficult to switch frameworks. In this paper, the process of converting the speaker overlap detection system based on the Keras framework to the pytorch-based system is explained and considers components. As a result of the framework switching, the pytorch-based system showed better performance than the existing Keras-based speaker overlap detection system, so it can be said that it is valuable as a fundamental study on systematic framework conversion.

A Multi-speaker Speech Synthesis System Using X-vector (x-vector를 이용한 다화자 음성합성 시스템)

  • Jo, Min Su;Kwon, Chul Hong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.4
    • /
    • pp.675-681
    • /
    • 2021
  • With the recent growth of the AI speaker market, the demand for speech synthesis technology that enables natural conversation with users is increasing. Therefore, there is a need for a multi-speaker speech synthesis system that can generate voices of various tones. In order to synthesize natural speech, it is required to train with a large-capacity. high-quality speech DB. However, it is very difficult in terms of recording time and cost to collect a high-quality, large-capacity speech database uttered by many speakers. Therefore, it is necessary to train the speech synthesis system using the speech DB of a very large number of speakers with a small amount of training data for each speaker, and a technique for naturally expressing the tone and rhyme of multiple speakers is required. In this paper, we propose a technology for constructing a speaker encoder by applying the deep learning-based x-vector technique used in speaker recognition technology, and synthesizing a new speaker's tone with a small amount of data through the speaker encoder. In the multi-speaker speech synthesis system, the module for synthesizing mel-spectrogram from input text is composed of Tacotron2, and the vocoder generating synthesized speech consists of WaveNet with mixture of logistic distributions applied. The x-vector extracted from the trained speaker embedding neural networks is added to Tacotron2 as an input to express the desired speaker's tone.

The Uncertainty of Logical Time The Time of Lacan's Psychoanalysis Flows Backwards (논리적 시간의 균열 라캉 정신분석의 시간은 거꾸로 흐른다)

  • Lee, Dong Seok
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.4
    • /
    • pp.113-122
    • /
    • 2021
  • This study begins on the basis of Jacques Lacan's article 『Logical Time and Assertions of Preemptive Certainty: A New Sophism』 published in the reissue of 『Art Note Les Cahiers d'Art』 in March 1945. In this paper, a guard presents an esoteric problem to three prisoners. If the problem is solved, the prisoner is released. A condition is given to solve a problem. Conversation between prisoners is prohibited, and the disc behind them cannot be seen. In this time and space, prisoners place themselves in logical time through the 'time of understanding' in order to become the chosen ones. We always live in logical time. We will argue the point at which Lacan destroys logical time in psychoanalysis. Time in Lacanian psychoanalysis transcends time divisions of the past, present, and future. Our time is always the past in the present. In Lacanian psychoanalysis, logical time is the time in the Other. The transcendence of the Lacanian psychoanalysis concept of time shows the deviation of logical time. In this text, We try to prove how Lacan contrasts psychoanalysis and the problem of time with time in the other. First, we will examine how logical time and impulse are related in psychoanalysis. Second, the postmortemity of the signifient (signifier) will be discussed. Third, Lacan psychoanalysis will present the transcendence of time. In conclusion, We will present the view that the time of Lacan psychoanalysis is flowing backwards. In Lacanian psychoanalysis, we try to prove that logical time is in the territory of the Other and is infinite time.

Signifiant and Lacan Psychoanalysis Narcissism of Repetition and Reflection (시니피앙과 라캉 정신분석 반복과 반영의 나르시시즘)

  • Lee, Dong Seok
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.4
    • /
    • pp.75-83
    • /
    • 2021
  • This study will analyze the meaning of the signifiant, which occupies an absolute position in Lacanian psychoanalysis, and will prove the slip of meaning and signification that are accompanied together at the same time when the signifiant was utter through the subject. By directly citing the part where Lacan explained signification in his seminars and Écrits, I would like to examine how signifiant is carried out in everyday conversation. In addition, the dialogue that takes place in our discourse has the purposefulness and groundless purposelessness of the signifiant. Understanding this purpose is the core part that Lacanian psychoanalysis aims to pursue, and it discovers the cracks of the hidden meaning in the relationship between the signifiant and the signifiant connected to the next, presenting that the signifiant which arouses unrelenting fantasy of the subject is the practical ruler of body and mind. This thesis is aimed to present the above points mentioned above, and as an alternative to overcome the limitations of the signifiant, the ruler in discourse, this study would like to suggest the autonomy as a subject resisting against "Where it was, I must come into being," pursued by Lacan psychoanalysis.

A Convergence Study on the Meaningful Activity Experience of the Elderly (노인의 의미 있는 활동에 관한 융합 연구)

  • Hwang, Hey-Jeong;Kang, Kyung-hee;Kim, Doo-Ree;Chang, Kyung-Hee;Kim, Kwang-Hwan
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.45-52
    • /
    • 2022
  • The purpose of this study, a self-written questionnaire was conducted on 110 people aged 65 or older to analyze the factors of meaningful activities in elderly. As a research methods were student t-test and one-way ANOVA were conducted using the SPSS statistical program, and hierarchical regression analysis was performed to identify factors affecting meaningful activities of the elderly. As a result of the study, the highest score was 3.95±0.64 in "I think my work (activity) with my family is rewarding." As a result of hierarchical regression analysis, the factors affecting the meaningful activities of the elderly were 'resident' in both stages 1(β=-.308, p=.002), 2(β=-.330, p=<.001), and 3(β=-.281, p=<.001), and 'age(β=-.215, p=.026)' in the second stage, indicating that the factors affecting the meaningful activities of the elderly were 'resident' and 'age'. In conclusion, it's necessary to develop and apply a systematic program that prioritizes conversation and communication while working with families for the younger elderly(65-74). In the future, it will be necessary to systematically apply various programs for meaningful activities in old age.

Christian Education Aiming for Homo Creators (호모 크레토스를 지향하는 기독교교육)

  • Kim, Hyung Hee
    • Journal of Christian Education in Korea
    • /
    • v.70
    • /
    • pp.141-173
    • /
    • 2022
  • The purpose of this study is to illuminate depersonalization in the flow of technological revolution and to present a Christian SARAMDAUM education that aims for a new human image. It represents the Christian SARAMDAUM education that adapts to, mediates, and offers alternatives to the technological and human evolutionary flow of the machine age. The purpose of education for this purpose is to aim for 'Homo Creators', creative human beings presented as a new human image in the age of technological revolution. The educational goal is to nurture creative human beings through creative interpretation, creative integration between disciplines, and personal dialogue in the post-mechanical/ post-conventional paradigm. The content of the education is a conversation with the SARAMDAUM that consiliences the characteristics of post-machine and post-convention. The educational method utilizes Edu-Tech and AIED(Artificial Intelligence in Education) to realize systemic thinking and SARAMDAUM dialogue of technology. In addition, the composition of teachers and learners, educational environment and educational evaluation is presented. The significance of this study is that from the point of view of Christian education, the identity of human beings in the era of the technological revolution has been identified, and research on the creative image of the human being is newly attempted, and the direction of Christian SARAMDAUM education aimed at this is presented. This can be said to be a Christian education that emphasizes the essential characteristics of human beings while accommodating the era of technological revolution.

A Study on Eye Tracking Techniques using Wearable Devices (웨어러블향(向) 시선추적 기법에 관한 연구)

  • Jaehyuck Jang;Jiu Jung;Junghoon Park
    • Smart Media Journal
    • /
    • v.12 no.3
    • /
    • pp.19-29
    • /
    • 2023
  • The eye tracking technology is widespread all around the society, and is demonstrating great performances in both preciseness and convenience. Hereby we can glimpse new possibility of an interface's conduct without screen-touching. This technology can become a new way of conversation for those including but not limited to the patients suffering from Lou Gehrig's disease, who are paralyzed each part by part of the body and finally cannot help but only moving eyes. Formerly in that case, the patients were given nothing to do but waiting for the death, even being unable to communicate with there families. A new interface that harnesses eyes as a new means of communication, although it conveys great difficulty, can be helpful for them. There surely are some eye tracking systems and equipment for their exclusive uses on the market. Notwithstanding, several obstacles including the complexity of operation and their high prices of over 12 million won($9,300) are hindering universal supply to people and coverage for the patients. Therefore, this paper suggests wearable-type eye tracking device that can support minorities and vulnerable people and be occupied inexpensively and study eye tracking method in order to maximize the possibility of future development across the world, finally proposing the way of designing and developing a brought-down costed eye tracking system based on high-efficient wearable device.

Developing a New Algorithm for Conversational Agent to Detect Recognition Error and Neologism Meaning: Utilizing Korean Syllable-based Word Similarity (대화형 에이전트 인식오류 및 신조어 탐지를 위한 알고리즘 개발: 한글 음절 분리 기반의 단어 유사도 활용)

  • Jung-Won Lee;Il Im
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.267-286
    • /
    • 2023
  • The conversational agents such as AI speakers utilize voice conversation for human-computer interaction. Voice recognition errors often occur in conversational situations. Recognition errors in user utterance records can be categorized into two types. The first type is misrecognition errors, where the agent fails to recognize the user's speech entirely. The second type is misinterpretation errors, where the user's speech is recognized and services are provided, but the interpretation differs from the user's intention. Among these, misinterpretation errors require separate error detection as they are recorded as successful service interactions. In this study, various text separation methods were applied to detect misinterpretation. For each of these text separation methods, the similarity of consecutive speech pairs using word embedding and document embedding techniques, which convert words and documents into vectors. This approach goes beyond simple word-based similarity calculation to explore a new method for detecting misinterpretation errors. The research method involved utilizing real user utterance records to train and develop a detection model by applying patterns of misinterpretation error causes. The results revealed that the most significant analysis result was obtained through initial consonant extraction for detecting misinterpretation errors caused by the use of unregistered neologisms. Through comparison with other separation methods, different error types could be observed. This study has two main implications. First, for misinterpretation errors that are difficult to detect due to lack of recognition, the study proposed diverse text separation methods and found a novel method that improved performance remarkably. Second, if this is applied to conversational agents or voice recognition services requiring neologism detection, patterns of errors occurring from the voice recognition stage can be specified. The study proposed and verified that even if not categorized as errors, services can be provided according to user-desired results.

The Study on the Representation of the Times in the Sports Films of the 1980s (1980년대 스포츠영화의 시대적 표상 연구)

  • Im, Jeong-Sig
    • Journal of Popular Narrative
    • /
    • v.25 no.1
    • /
    • pp.315-347
    • /
    • 2019
  • (1986) and (1987) represent the society of 1980s in which the professional baseball game was initiated to cover the irrational military culture. The love and marriage of sports players were the headlines of the media, and the yearly salary of the players was the hottest issue of conversation. The military culture is represented in the scenes where the coaches train the failures and inapt players in extreme drills. The films pinpoint the absurdity of military culture and win-at-all-costs mentality. The collapse of the dictatorial leadership at the end of the films is a metaphor for the collapse of the fifth Republic of Korea. The episodes where the players talk about contract money, and the trade of players and sports business were a new phenomenon of the 1980's. The fact that Oh Hyesung of chooses love instead of victory deals a big blow to the secular ambition for money, victory and dictatorial leadership. His option provides catharsis for an audience oppressed under military leadership and success driven ideology. On the other hand, Oh Hyesung of dies right at the moment of winning the world champion. He achieves neither love nor success. While Oh Hyesung of is a symbol of pure love and gives spiritual comfort to the audience, Oh Hyesung of gives a sense of hopelessness to the audience. Both of the two sports films reflect the representation of the 1980's but received opposing reviews from audiences.

Exploring the nature and direction of early childhood science education for sustainable development (지속가능발전지향 유아과학교육의 본질과 실천방향 탐색)

  • Cho, BooKyung;Seo, Hyunjung
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.3
    • /
    • pp.407-418
    • /
    • 2018
  • Science and technology have led the development of mankind, but have created problems such as natural depletion, climate change, economic inequality and poverty. The purpose of this study is to explore the meaning of early childhood science for sustainable development to solve these problems and to contribute to the harmony of nature and human beings. In order to accomplish this research objectives, 18 experts and 15 teachers were interviewed on the meaning of sustainable development and the directions of early childhood science education for sustainable development. Early childhood science education for sustainable development was categorized as follows. 'Mutual respect between child-teacher-organism', 'developing individual inquiry-based on community consciousness', 'looking at the world with child's eyes', 'deepening and expanding on topics of interest', 'continuous inquiry and commitment', 'conversation and sharing-centered exploration'. By these results, it was concluded that early childhood science education for sustainable development should start from the perspective of children, and was a meaningful process in which children constantly learn about the nature surrounding themselves based on mutual respect.