• Title/Summary/Keyword: Speech recognition.

Search Result 2,047, Processing Time 0.031 seconds

Users' Preference and Acceptance of Smart Home Technologies (사용자의 스마트 주거 기술 선호와 수용에 관한 연구)

  • Cho, Myung Eun;Kim, Mi Jeong
    • Journal of the Architectural Institute of Korea Planning & Design
    • /
    • v.34 no.11
    • /
    • pp.75-84
    • /
    • 2018
  • This study analyzed users' acceptance and intention to use in addition to needs and preferences of smart home technologies, and identified the differences in technology preference and acceptance by different factors. The subjects were residents in the 40s and 60s residing in the Seoul or suburbs of Seoul, and questionnaires were conducted in the 40s while interviews with questionnaires were conducted in the 60s. A total of 105 questionnaires were used as data, and frequency, mean, crossover, independent sample t test, one-way ANOVA and multiple regression analysis were performaed using SPSS23. The results of this study are as follows. First, hypertension, hyperlipidemia and hypercholesterolemia were the most common diseases among respondents and if there was no discomfort, they would like to continue living in the homes of the current residence. Therefore, the direction of smart home development should support the daily living and health care so that residents can live a healthy life for a long time in their living space. Second, the technologies that residents most need were a control technology of residential environments and a monitoring technology of residents' health and physiological changes. The most preferred sensor types are motion sensors and speech recognition while video cameras have a very low preference. Third, technology anxiety was the most significant factor influencing intention to accept smart home technology. The greater the technology anxiety is, the weaker the acceptance of technology. Fourth, when applying smart residential technology in homes, various resident characteristics should be considered. Age and technology intimacy were the most influential variables, and accordingly there were differences in technology preference and acceptance. Therefore, a user-friendly smart home plan should be done in the consideration of the results.

Search for Optimal Data Augmentation Policy for Environmental Sound Classification with Deep Neural Networks (심층 신경망을 통한 자연 소리 분류를 위한 최적의 데이터 증대 방법 탐색)

  • Park, Jinbae;Kumar, Teerath;Bae, Sung-Ho
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.854-860
    • /
    • 2020
  • Deep neural networks have shown remarkable performance in various areas, including image classification and speech recognition. The variety of data generated by augmentation plays an important role in improving the performance of the neural network. The transformation of data in the augmentation process makes it possible for neural networks to be learned more generally through more diverse forms. In the traditional field of image process, not only new augmentation methods have been proposed for improving the performance, but also exploring methods for an optimal augmentation policy that can be changed according to the dataset and structure of networks. Inspired by the prior work, this paper aims to explore to search for an optimal augmentation policy in the field of sound data. We carried out many experiments randomly combining various augmentation methods such as adding noise, pitch shift, or time stretch to empirically search which combination is most effective. As a result, by applying the optimal data augmentation policy we achieve the improved classification accuracy on the environmental sound classification dataset (ESC-50).

Language Variation and World Englishes (언어변이와 세계영어들)

  • Kim, Yangsoon
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.1
    • /
    • pp.234-239
    • /
    • 2021
  • The purpose of this paper is to find out the nature of language variation by exploring the ways of the progress of the language variation that produces all English-lects, i.e., the World Englishes. The study of language variation in linguistics is a hybrid enterprise, so the study of World Englishes has led to the recognition of a highly diverse set of all English-lects, encompassing regional dialects, sociolects, ethnolects and (post-)colonial dialects of World Englishes. In this paper, we propose a hybrid language variation model with three interacting factors of social distancing, on/off-contact, and linguistic diversity to examine the characteristics of language variation. In the context of World Englishes, the social distance is typically low in terms of their local location (country/speech) for local purposes. The social distance also varies based on online/offline communication modes and other social factors like gender, age and ethnic groups, resulting in all English-lects. To clarify the nature of World Englishes, the core Englishes, BrE, AmE and CanE are discussed here.

A Study on the Linkage Model Between Institutions Related to Lifelong Education for People with Developmental Disabilities Based on the K-PACE Center of Daegu University: A Perspective on the Whole Life Cycle for People with Developmental Disabilities

  • Kim, Young-Jun;Kim, Wha-Soo;Rhee, Kun-Yong
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.24-35
    • /
    • 2022
  • The purpose of this study was to form a linked model in which local institutions related to lifelong education for the disabled can cooperate based on the Daegu University K-PACE Center. The contents of the study started with recognizing the problem that the adult-centered lifelong education support system does not effectively cope with these factors, even though the independent life of people with developmental disabilities is a major factor determining the quality of life. Regarding this problem recognition, this study primarily emphasized the view that educational support for independent life of people with developmental disabilities should establish the context of the school foundation. The context of the school foundation is established for lifelong education centered on adulthood for people with developmental disabilities because the curriculum is embodied through the standards of subject matter education. In this regard, the Daegu University K-PACE Center, which established a curriculum that supports the independent life of people with developmental disabilities in terms of linking higher and lifelong education, actually reflects the context of the school foundation. As a result, this study prepared a strategy that could be considered as a transition to advance the curriculum organized by the Daegu University K-PACE Center, and the strategy was secondarily reflected as a procedure that could be linked to local lifelong education-related institutions for the disabled. Finally, this study presented a form of transition in which people with developmental disabilities can access the curriculum of lifelong education through the connection of local lifelong education-related institutions for the disabled, centering on the entire life of adulthood.

An analysis study on the quality of article to improve the performance of hate comments discrimination (악성댓글 판별의 성능 향상을 위한 품사 자질에 대한 분석 연구)

  • Kim, Hyoung Ju;Min, Moon Jong;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.10 no.4
    • /
    • pp.71-79
    • /
    • 2021
  • One of the social aspects that changes as the use of the Internet becomes widespread is communication in online space. In the past, only one-on-one conversations were possible remotely, except when they were physically in the same space, but nowadays, technology has been developed to enable communication with a large number of people remotely through bulletin boards, communities, and social network services. Due to the development of such information and communication networks, life becomes more convenient, and at the same time, the damage caused by rapid information exchange is also constantly increasing. Recently, cyber crimes such as sending sexual messages or personal attacks to certain people with recognition on the Internet, such as not only entertainers but also influencers, have occurred, and some of those exposed to these cybercrime have committed suicide. In this paper, in order to reduce the damage caused by malicious comments, research a method for improving the performance of discriminate malicious comments through feature extraction based on parts-of-speech.

Application of Information Technologies to Improve the Quality of Services Provided to the Tourism Industry Under the COVID-19 Restrictions

  • Iudina, Elena Vladimirovna;Balova, Suzana L.;Maksimov, Dmitrij Vasilievich;Skoromets, Elena Klimentinovna;Ponyaeva, Tatyana Anatolyevna;Ksenofontova, Ekaterina Andreevna
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.7-12
    • /
    • 2022
  • The modern stage of society's development is characterized by the rapid penetration of information technologies into all spheres of life. Their use contributes to improving the quality of tourism services, as well as the competitiveness of tourism industry enterprises. The role of information technology in tourism is growing more and more every year, which determines the relevance of the study of modern trends in the use of information technology in the tourism sector. The purpose of the study is to determine the possibilities of using information technologies to improve the quality of services provided to the tourism industry under the COVID-19 restrictions. The article systematizes the main approaches to the "cluster" category and provides an original definition of the "regional tourist cluster" concept. Based on an expert survey, the main trends in the introduction of information technologies in the tourism industry under the COVID-19 restrictions have been identified, which include virtual reality and augmented reality, speech recognition technologies, photo, video, audio (contactless control technologies), mobile IT applications and Big Data technologies. It has been concluded that the vast majority of improvements in the organization of tourism services under restrictions will be based on the organization of virtual solutions and online activities. The types of tourism services will also change, and information technology will help their development and dissemination.

Efficient Thread Allocation Method of Convolutional Neural Network based on GPGPU (GPGPU 기반 Convolutional Neural Network의 효율적인 스레드 할당 기법)

  • Kim, Mincheol;Lee, Kwangyeob
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.10
    • /
    • pp.935-943
    • /
    • 2017
  • CNN (Convolution neural network), which is used for image classification and speech recognition among neural networks learning based on positive data, has been continuously developed to have a high performance structure to date. There are many difficulties to utilize in an embedded system with limited resources. Therefore, we use GPU (General-Purpose Computing on Graphics Processing Units), which is used for general-purpose operation of GPU to solve the problem because we use pre-learned weights but there are still limitations. Since CNN performs simple and iterative operations, the computation speed varies greatly depending on the thread allocation and utilization method in the Single Instruction Multiple Thread (SIMT) based GPGPU. To solve this problem, there is a thread that needs to be relaxed when performing Convolution and Pooling operations with threads. The remaining threads have increased the operation speed by using the method used in the following feature maps and kernel calculations.

Research on Developing a Conversational AI Callbot Solution for Medical Counselling

  • Won Ro LEE;Jeong Hyon CHOI;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.11 no.4
    • /
    • pp.9-13
    • /
    • 2023
  • In this study, we explored the potential of integrating interactive AI callbot technology into the medical consultation domain as part of a broader service development initiative. Aimed at enhancing patient satisfaction, the AI callbot was designed to efficiently address queries from hospitals' primary users, especially the elderly and those using phone services. By incorporating an AI-driven callbot into the hospital's customer service center, routine tasks such as appointment modifications and cancellations were efficiently managed by the AI Callbot Agent. On the other hand, tasks requiring more detailed attention or specialization were addressed by Human Agents, ensuring a balanced and collaborative approach. The deep learning model for voice recognition for this study was based on the Transformer model and fine-tuned to fit the medical field using a pre-trained model. Existing recording files were converted into learning data to perform SSL(self-supervised learning) Model was implemented. The ANN (Artificial neural network) neural network model was used to analyze voice signals and interpret them as text, and after actual application, the intent was enriched through reinforcement learning to continuously improve accuracy. In the case of TTS(Text To Speech), the Transformer model was applied to Text Analysis, Acoustic model, and Vocoder, and Google's Natural Language API was applied to recognize intent. As the research progresses, there are challenges to solve, such as interconnection issues between various EMR providers, problems with doctor's time slots, problems with two or more hospital appointments, and problems with patient use. However, there are specialized problems that are easy to make reservations. Implementation of the callbot service in hospitals appears to be applicable immediately.

Robust Real-time Pose Estimation to Dynamic Environments for Modeling Mirror Neuron System (거울 신경 체계 모델링을 위한 동적 환경에 강인한 실시간 자세추정)

  • Jun-Ho Choi;Seung-Min Park
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.3
    • /
    • pp.583-588
    • /
    • 2024
  • With the emergence of Brain-Computer Interface (BCI) technology, analyzing mirror neurons has become more feasible. However, evaluating the accuracy of BCI systems that rely on human thoughts poses challenges due to their qualitative nature. To harness the potential of BCI, we propose a new approach to measure accuracy based on the characteristics of mirror neurons in the human brain that are influenced by speech speed, depending on the ultimate goal of movement. In Chapter 2 of this paper, we introduce mirror neurons and provide an explanation of human posture estimation for mirror neurons. In Chapter 3, we present a powerful pose estimation method suitable for real-time dynamic environments using the technique of human posture estimation. Furthermore, we propose a method to analyze the accuracy of BCI using this robotic environment.

A Study of Psychometric Function Curve for Korean Standard Monosyllabic Word Lists for Preschoolers (KS-MWL-P) (한국표준 학령전기용 단음절어표 (Korean Standard Monosyllabic Word Lists for Preschoolers, KS-MWL-P)의 심리음향기능곡선 연구)

  • Shin, Hyun-Wook;Kim, Jin-Sook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.534-541
    • /
    • 2009
  • Word recognition test (WRT) for the children can be useful for diagnosing the degree of communication disability, prescribing hearing instruments, planning aural rehabilitation and speech therapy, and determination of site of lesions. The Korean standard monosyllabic word lists for preschoolers (KS-MWL-P) were developed considering the criteria given by the literatures. However, the authors of KS-MWL-P suggested more children should be included to verify homogeneity of the lists using psychometric function curve since only 8 children participated in the developing process. The purpose of this study was to explore the homogeneity of KS-MWL-P for supplementing the limitations of the lists employing psychometric analysis. To 23 preschoolers who have normal-hearing, 100 monosyllabic KS-MWL-P words were examined with the pictures. Psychometric function curve with linear slopes of 20% and 80%'s correct rates through accounting recognition scores of each monosyllabic word at variable intensities from -10 to 40 dBHL was obtained and analyzed. As a result, s-shaped psychometric function curve was presented with increasing correct rate depending on intensity and showed no statistical significant differences among each word and list. The congruous graph shapes among lists also indicated good homogeneity and the list 1,2,3,4's average slopes were 4.48, 3.86, 4.65, 4.50. It was verified that the homogeneity was suitable because the analysis of variance showed no statistical significance among lists (p>0.05). However, KS-MWL-P's order of slope according to the order of the number of items, $1{\sim}10$, $1{\sim}20$, $1{\sim}25$ showed no difference with the p-value of 0.93, 0.59, 0.91, 0.70 for the lists 1,2,3, and 4, respectively. Although KS-MWL-P was assumed that the lower-numbered items were easy for testing younger ages, this study's results could not agree with the author's conclusion. Considering this matter, rearranging of the number of items should be performed according to the analysis of slope suggested by this study for testing younger children with easier items. Other than this, in conclusion, KS-MWL-P was proved to be useful for clinical and rehabilitative evaluating and training tools for preschoolers.