• Title/Summary/Keyword: Audio Technology

Search Result 634, Processing Time 0.034 seconds

Hate Speech Detection Using Modified Principal Component Analysis and Enhanced Convolution Neural Network on Twitter Dataset

  • Majed, Alowaidi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.1
    • /
    • pp.112-119
    • /
    • 2023
  • Traditionally used for networking computers and communications, the Internet has been evolving from the beginning. Internet is the backbone for many things on the web including social media. The concept of social networking which started in the early 1990s has also been growing with the internet. Social Networking Sites (SNSs) sprung and stayed back to an important element of internet usage mainly due to the services or provisions they allow on the web. Twitter and Facebook have become the primary means by which most individuals keep in touch with others and carry on substantive conversations. These sites allow the posting of photos, videos and support audio and video storage on the sites which can be shared amongst users. Although an attractive option, these provisions have also culminated in issues for these sites like posting offensive material. Though not always, users of SNSs have their share in promoting hate by their words or speeches which is difficult to be curtailed after being uploaded in the media. Hence, this article outlines a process for extracting user reviews from the Twitter corpus in order to identify instances of hate speech. Through the use of MPCA (Modified Principal Component Analysis) and ECNN, we are able to identify instances of hate speech in the text (Enhanced Convolutional Neural Network). With the use of NLP, a fully autonomous system for assessing syntax and meaning can be established (NLP). There is a strong emphasis on pre-processing, feature extraction, and classification. Cleansing the text by removing extra spaces, punctuation, and stop words is what normalization is all about. In the process of extracting features, these features that have already been processed are used. During the feature extraction process, the MPCA algorithm is used. It takes a set of related features and pulls out the ones that tell us the most about the dataset we give itThe proposed categorization method is then put forth as a means of detecting instances of hate speech or abusive language. It is argued that ECNN is superior to other methods for identifying hateful content online. It can take in massive amounts of data and quickly return accurate results, especially for larger datasets. As a result, the proposed MPCA+ECNN algorithm improves not only the F-measure values, but also the accuracy, precision, and recall.

A Study on the Implementation of a Community-based LIS Capstone Course: Developing the 21st Century Skills of Preservice Librarians through Human Library Projects (지역사회협력 기반 문헌정보학 캡스톤 교과목 개발과 운영에 관한 연구 - 휴먼라이브러리 프로젝트 수행을 통한 21세기 학습 기술 강화를 중심으로 -)

  • Jisue Lee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.2
    • /
    • pp.379-408
    • /
    • 2023
  • This case study reports on the redevelopment of a course, Local Culture Information Theory offered by the Department of Library and Information Science at C University, into a capstone design course using a project-based learning approach. In collaboration with a local community youth organization, the redesigned course provided an opportunity for LIS students to develop and implement a digital literacy program that enabled high school students to use a variety of digital multimedia technologies to complete a project of digital Human Library featuring video, audio, and digital are such as webtoons. Through semi-structured interviews with 5 students and 3 staff from partner organizations, this study reports on course development process, the establishment of local partnerships, project outcome, as well as suggestions for improvements. In addition, a qualitative analysis of the participating students' interview responses using the Framework for 21st Century Learning (P21) found they developed and improved 11 skills across three core areas: life and career skills including self-direction, project management, collaboration with diverse teams, flexibility, responsibility, leadership; learning and innovation skills including communication and collaboration, problem-solving, creativity, and critical thinking; and information, media, and technology skills through media creation. Lessons learned and recommendations from this case study may be useful for other LIS programs and faculty interested in implementing project-based learning or developing capstone design courses.

A Brief Analysis of the Application of Chinese Traditional Culture in Big Fish and Begonia (<대어해당> 중 중국전통문화의 응용에 대한 간략 분석)

  • Xiaoli, Wang
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.5
    • /
    • pp.67-72
    • /
    • 2019
  • Animation is a comprehensive audio-visual art, animation literature, painting, music, architecture, photography and other art forms are integrated. China's animation industry has made some achievements in the new century, but on the whole, with the globalization of China, China's animation industry has been influenced by Japan and the United States. China has a history and culture of five thousand years, with profound social deposits and cultural foundation. Of the four ancient civilizations in the world, the Chinese civilization is the only one that has survived. China has too many stories to tell. From the development history of Chinese and foreign animation, we can see that many Chinese traditional cultural elements are used for reference. Since the 1980s, Chinese animation has been on the road of national revival. Chinese animation has begun to draw close to traditional culture in terms of themes, characters and scenes, and integrate Chinese traditional cultural elements. The theme of big fish and begonia is to repay kindness by sacrificing one's own life for the sake of justice and friendship. This fearless spirit of sacrificing one's life for justice is the concentrated embodiment of the fine qualities of the Chinese nation over the past several thousand years. Kun to save chun and give up his life, chun in order to repay rather give up half of his life, and qiushui in order to help their beloved, also would rather give up all of their own. These three protagonists are very distinctive personality characteristics, are to "righteousness" and give up their most precious things. At the same time, big fish and begonia combines many traditional Chinese cultural elements to form an animated film with Chinese characteristics.

Research on Generative AI for Korean Multi-Modal Montage App (한국형 멀티모달 몽타주 앱을 위한 생성형 AI 연구)

  • Lim, Jeounghyun;Cha, Kyung-Ae;Koh, Jaepil;Hong, Won-Kee
    • Journal of Service Research and Studies
    • /
    • v.14 no.1
    • /
    • pp.13-26
    • /
    • 2024
  • Multi-modal generation is the process of generating results based on a variety of information, such as text, images, and audio. With the rapid development of AI technology, there is a growing number of multi-modal based systems that synthesize different types of data to produce results. In this paper, we present an AI system that uses speech and text recognition to describe a person and generate a montage image. While the existing montage generation technology is based on the appearance of Westerners, the montage generation system developed in this paper learns a model based on Korean facial features. Therefore, it is possible to create more accurate and effective Korean montage images based on multi-modal voice and text specific to Korean. Since the developed montage generation app can be utilized as a draft montage, it can dramatically reduce the manual labor of existing montage production personnel. For this purpose, we utilized persona-based virtual person montage data provided by the AI-Hub of the National Information Society Agency. AI-Hub is an AI integration platform aimed at providing a one-stop service by building artificial intelligence learning data necessary for the development of AI technology and services. The image generation system was implemented using VQGAN, a deep learning model used to generate high-resolution images, and the KoDALLE model, a Korean-based image generation model. It can be confirmed that the learned AI model creates a montage image of a face that is very similar to what was described using voice and text. To verify the practicality of the developed montage generation app, 10 testers used it and more than 70% responded that they were satisfied. The montage generator can be used in various fields, such as criminal detection, to describe and image facial features.

A Study of Performance Analysis on Effective Multiple Buffering and Packetizing Method of Multimedia Data for User-Demand Oriented RTSP Based Transmissions Between the PoC Box and a Terminal (PoC Box 단말의 RTSP 운용을 위한 사용자 요구 중심의 효율적인 다중 수신 버퍼링 기법 및 패킷화 방법에 대한 성능 분석에 관한 연구)

  • Bang, Ji-Woong;Kim, Dae-Won
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.1
    • /
    • pp.54-75
    • /
    • 2011
  • PoC(Push-to-talk Over Cellular) is an integrated technology of group voice calls, video calls and internet based multimedia services. If a PoC user can not participate in the PoC session for various reasons such as an emergency situation, lack of battery capacity, then the user can use the PoC Box which has a similar functionality to the MM Box in the MMS(Multimedia Messaging Service). The RTSP(Real-Time Streaming Protocol) method is recommended to be used when there is a transmission session between the PoC box and a terminal. Since the existing VOD service uses a wired network, the packet size of RTSP-based VOD service is huge, however, the PoC service has wireless communication environments which have general characteristics to be used in RTSP method. Packet loss in a wired communication environments is relatively less than that in wireless communication environment, therefore, a buffering latency occurs in PoC service due to a play-out delay which means an asynchronous play of audio & video contents. Those problems make a user to be difficult to find the information they want when the media contents are played-out. In this paper, the following techniques and methods were proposed and their performance and superiority were verified through testing: cross-over dual reception buffering technique, advance partition multi-reception buffering technique, and on-demand multi-reception buffering technique, which are designed for effective picking up of information in media content being transmitted in short amount of time using RTSP when a user searches for media, as well as for reduction in playback delay; and same-priority packetization transmission method and priority-based packetization transmission method, which are media data packetization methods for transmission. From the simulation of functional evaluation, we could find that the proposed multiple receiving buffering and packetizing methods are superior, with respect to the media retrieval inclination, to the existing single receiving buffering method by 6-9 points from the viewpoint of effectiveness and excellence. Among them, especially, on-demand multiple receiving buffering technology with same-priority packetization transmission method is able to manage the media search inclination promptly to the requests of users by showing superiority of 3-24 points above compared to other combination methods. In addition, users could find the information they want much quickly since large amount of informations are received in a focused media retrieval period within a short time.

A Study on the Improvement for Medical Service Using Video Promotion Materials for PET/CT Scans (PET/CT 검사에서 동영상 홍보물을 통한 의료서비스 향상에 관한 연구)

  • Kim, Woo Hyun;Kim, Jung Seon;Ko, Hyun Soo;Sung, Ji Hye;Lee, Jeoung Eun
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.17 no.1
    • /
    • pp.30-35
    • /
    • 2013
  • Purpose: One of the current services, providing information to the patients and their guardians by using promotion materials induces positive responses and contributes to the improvement of the hospital reliability. Therefore, the objective of this study is to evaluate the effectiveness of audio visual materials, one of the means of promotion, as a way to give accurate medical information to resolve patient's curiosity about purpose and procedure of their examination and deplete complains about waiting which attributes negative effect to service quality assessment. Materials and Methods: 60 patients(mean age $53.97{\pm}12.24$, male : female = 26 : 34) who had $^{18}F-FDG PET/CT$ scan from July 2012 to August 2012 in Seoul Asan Medical Center were referred to the study. All of the patients having PET/CT scan were asked to watch an informative video material before the injection of radiopharmaceutical ($^{18}F-FDG$) and to fill in a questionnaire. Results: As a result of analyzing the contents of questionnaire, 52% of 60 patients had PET/CT scan for the first time and 72.4% of the patients read the PET/CT guidebook offered from their outpatient department or inpatient wards before their scan. After we searched the level of previous knowledge of the purpose and method of PET/CT scan, the patients answered 25.1% "know well", 34% "not sure", 40.9% "don't know" respectively. And 84.7% of the patients answered that watching the PET/CT guide video before the injection helps understanding what exam they were having and 15.3% of the patients did not. For the question asking ever the patients have experienced using our homepage or smart phone QR code to see the guide video before they visit out PET center, only 3.3% of them answered "yes". Lastly, the patients answered 60.1% "yes", 31.4% "so so" and 8.5% "no" respectively for the question asking whether watching the video makes the patients to fill the waiting time short. Conclusion: It is found that understanding of objective and method of the PET/CT scan and level of satisfaction was improved after the patients watched the guide video whether they had PET/CT scan before and read the PET/CT guidebook or not. Also, watching the video was effective for the reduction of perceptible waiting time. But while displaying the PET/CT guide video is useful for providing information about the scan and shortening the waiting time as one of the medical service, utilization of service was actually very poor because of the passive promotion and indifference of the patients about their examination. Therefore, from now on, it is necessary to construct the healthcare system which can be offered to more patients through the active promotion.

  • PDF

Automatic Speech Style Recognition Through Sentence Sequencing for Speaker Recognition in Bilateral Dialogue Situations (양자 간 대화 상황에서의 화자인식을 위한 문장 시퀀싱 방법을 통한 자동 말투 인식)

  • Kang, Garam;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.17-32
    • /
    • 2021
  • Speaker recognition is generally divided into speaker identification and speaker verification. Speaker recognition plays an important function in the automatic voice system, and the importance of speaker recognition technology is becoming more prominent as the recent development of portable devices, voice technology, and audio content fields continue to expand. Previous speaker recognition studies have been conducted with the goal of automatically determining who the speaker is based on voice files and improving accuracy. Speech is an important sociolinguistic subject, and it contains very useful information that reveals the speaker's attitude, conversation intention, and personality, and this can be an important clue to speaker recognition. The final ending used in the speaker's speech determines the type of sentence or has functions and information such as the speaker's intention, psychological attitude, or relationship to the listener. The use of the terminating ending has various probabilities depending on the characteristics of the speaker, so the type and distribution of the terminating ending of a specific unidentified speaker will be helpful in recognizing the speaker. However, there have been few studies that considered speech in the existing text-based speaker recognition, and if speech information is added to the speech signal-based speaker recognition technique, the accuracy of speaker recognition can be further improved. Hence, the purpose of this paper is to propose a novel method using speech style expressed as a sentence-final ending to improve the accuracy of Korean speaker recognition. To this end, a method called sentence sequencing that generates vector values by using the type and frequency of the sentence-final ending appearing in the utterance of a specific person is proposed. To evaluate the performance of the proposed method, learning and performance evaluation were conducted with a actual drama script. The method proposed in this study can be used as a means to improve the performance of Korean speech recognition service.

A Study on the Relationship Between Online Community Characteristics and Loyalty : Focused on Mediating Roles of Self-Congruency, Consumer Experience, and Consumer to Consumer Interactivity (온라인 커뮤니티 특성과 충성도 간의 관계에 대한 연구: 자아일치성, 소비자 체험, 상호작용성의 매개적 역할을 중심으로)

  • Kim, Moon-Tae;Ock, Jung-Won
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.4
    • /
    • pp.157-194
    • /
    • 2008
  • The popularity of communities on the internet has captured the attention of marketing scholars and practitioners. By adapting to the culture of the internet, however, and providing consumer with the ability to interact with one another in addition to the company, businesses can build new and deeper relationships with customers. The economic potential of online communities has been discussed with much hope in the many popular papers. In contrast to this enthusiastic prognostications, empirical and practical evidence regarding the economic potential of the online community has shown a little different conclusion. To date, even communities with high levels of membership and vibrant social arenas have failed to build financial viability. In this perspective, this study investigates the role of various kinds of influencing factors to online community loyalty and basically suggests the framework that explains the process of building purchase loyalty. Even though the importance of building loyalty in an online environment has been emphasized from the marketing theorists and practitioners, there is no sufficient research conclusion about what is the process of building purchase loyalty and the most powerful factors that influence to it. In this study, the process of building purchase loyalty is divided into three levels; characteristics of community site such as content superiority, site vividness, navigation easiness, and customerization, the mediating variables such as self congruency, consumer experience, and consumer to consumer interactivity, and finally various factors about online community loyalty such as visit loyalty, affect, trust, and purchase loyalty are those things. And the findings of this research are as follows. First, consumer-to-consumer interactivity is an important factor to online community purchase loyalty and other loyalty factors. This means, in order to interact with other people more actively, many participants in online community have the willingness to buy some kinds of products such as music, content, avatar, and etc. From this perspective, marketers of online community have to create some online environments in order that consumers can easily interact with other consumers and make some site environments in order that consumer can feel experience in this site is interesting and self congruency is higher than at other community sites. It has been argued that giving consumers a good experience is vital in cyber space, and websites create an active (rather than passive) customer by their nature. Some researchers have tried to pin down the positive experience, with limited success and less empirical support. Web sites can provide a cognitively stimulating experience for the user. We define the online community experience as playfulness based on the past studies. Playfulness is created by the excitement generated through a website's content and measured using three descriptors Marketers can promote using and visiting online communities, which deliver a superior web experience, to influence their customers' attitudes and actions, encouraging high involvement with those communities. Specially, we suggest that transcendent customer experiences(TCEs) which have aspects of flow and/or peak experience, can generate lasting shifts in beliefs and attitudes including subjective self-transformation and facilitate strong consumer's ties to a online community. And we find that website success is closely related to positive website experiences: consumers will spend more time on the site, interacting with other users. As we can see figure 2, visit loyalty and consumer affect toward the online community site didn't directly influence to purchase loyalty. This implies that there may be a little different situations here in online community site compared to online shopping mall studies that shows close relations between revisit intention and purchase intention. There are so many alternative sites on web, consumers do not want to spend money to buy content and etc. In this sense, marketers of community websites must know consumers' affect toward online community site is not a last goal and important factor to influnece consumers' purchase. Third, building good content environment can be a really important marketing tool to create a competitive advantage in cyberspace. For example, Cyworld, Korea's number one community site shows distinctive superiority in the consumer evaluations of content characteristics such as content superiority, site vividness, and customerization. Particularly, comsumer evaluation about customerization was remarkably higher than the other sites. In this point, we can conclude that providing comsumers with good, unique and highly customized content will be urgent and important task directly and indirectly impacting to self congruency, consumer experience, c-to-c interactivity, and various loyalty factors of online community. By creating enjoyable, useful, and unique online community environments, online community portals such as Daum, Naver, and Cyworld are able to build customer loyalty to a degree that many of today's online marketer can only dream of these loyalty, in turn, generates strong economic returns. Another way to build good online community site is to provide consumers with an interactive, fun, experience-oriented or experiential Web site. Elements that can make a dot.com's Web site experiential include graphics, 3-D images, animation, video and audio capabilities. In addition, chat rooms and real-time customer service applications (which link site visitors directly to other visitors, or with company support personnel, respectively) are also being used to make web sites more interactive. Researchers note that online communities are increasingly incorporating such applications in their Web sites, in order to make consumers' online shopping experience more similar to that of an offline store. That is, if consumers are able to experience sensory stimulation (e.g. via 3-D images and audio sound), interact with other consumers (e.g., via chat rooms), and interact with sales or support people (e.g. via a real-time chat interface or e-mail), then they are likely to have a more positive dot.com experience, and develop a more positive image toward the online company itself). Analysts caution, however, that, while high quality graphics, animation and the like may create a fun experience for consumers, when heavily used, they can slow site navigation, resulting in frustrated consumers, who may never return to a site. Consequently, some analysts suggest that, at least with current technology, the rule-of-thumb is that less is more. That is, while graphics etc. can draw consumers to a site, they should be kept to a minimum, so as not to impact negatively on consumers' overall site experience.

  • PDF

Prediction of field failure rate using data mining in the Automotive semiconductor (데이터 마이닝 기법을 이용한 차량용 반도체의 불량률 예측 연구)

  • Yun, Gyungsik;Jung, Hee-Won;Park, Seungbum
    • Journal of Technology Innovation
    • /
    • v.26 no.3
    • /
    • pp.37-68
    • /
    • 2018
  • Since the 20th century, automobiles, which are the most common means of transportation, have been evolving as the use of electronic control devices and automotive semiconductors increases dramatically. Automotive semiconductors are a key component in automotive electronic control devices and are used to provide stability, efficiency of fuel use, and stability of operation to consumers. For example, automotive semiconductors include engines control, technologies for managing electric motors, transmission control units, hybrid vehicle control, start/stop systems, electronic motor control, automotive radar and LIDAR, smart head lamps, head-up displays, lane keeping systems. As such, semiconductors are being applied to almost all electronic control devices that make up an automobile, and they are creating more effects than simply combining mechanical devices. Since automotive semiconductors have a high data rate basically, a microprocessor unit is being used instead of a micro control unit. For example, semiconductors based on ARM processors are being used in telematics, audio/video multi-medias and navigation. Automotive semiconductors require characteristics such as high reliability, durability and long-term supply, considering the period of use of the automobile for more than 10 years. The reliability of automotive semiconductors is directly linked to the safety of automobiles. The semiconductor industry uses JEDEC and AEC standards to evaluate the reliability of automotive semiconductors. In addition, the life expectancy of the product is estimated at the early stage of development and at the early stage of mass production by using the reliability test method and results that are presented as standard in the automobile industry. However, there are limitations in predicting the failure rate caused by various parameters such as customer's various conditions of use and usage time. To overcome these limitations, much research has been done in academia and industry. Among them, researches using data mining techniques have been carried out in many semiconductor fields, but application and research on automotive semiconductors have not yet been studied. In this regard, this study investigates the relationship between data generated during semiconductor assembly and package test process by using data mining technique, and uses data mining technique suitable for predicting potential failure rate using customer bad data.

Utility Estimation of the Application of Auditory-Visual-Tactile Sense Feedback in Respiratory Gated Radiation Therapy (호흡동조방사선치료 시 Real Time Monitor와 Ventilator의 유용성 평가)

  • Jo, Jung Hun;Kim, Byeong Jin;Roh, Shi Won;Lee, Hyeon Chan;Jang, Hyeong Jun;Kim, Hoi Nam;Song, Jae Hun;Kim, Young Jae
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.25 no.1
    • /
    • pp.33-40
    • /
    • 2013
  • Purpose: The purpose of this study was to evaluate the possibility to optimize the gated treatment delivery time and maintenance of stable respiratory by the introduction of breath with the assistance of auditory-visual-tactile sense. Materials and Methods: The experimenter's respiration were measured by ANZAI 4D system. We obtained natural breathing signal, monitor-induced breathing signal, monitor & ventilator-induced breathing signal, and breath-hold signal using real time monitor during 10 minutes beam-on-time. In order to check the stability of respiratory signals distributed in each group were compared with means, standard deviation, variation value, beam_time of the respiratory signal. Results: The stability of each respiratory was measured in consideration of deviation change studied in each respiratory time lapse. As a result of an analysis of respiratory signal, all experimenters has showed that breathing signal used both Real time monitor and Ventilator was the most stable and shortest time. Conclusion: In this study, it was evaluated that respiratory gated radiation therapy with auditory-visual-tactual sense and without auditory-visual-tactual sense feedback. The study showed that respiratory gated radiation therapy delivery time could significantly be improved by the application of video feedback when this is combined with audio-tactual sense assistance. This delivery technique did prove its feasibility to limit the tumor motion during treatment delivery for all patients to a defined value while maintaining the accuracy and proved the applicability of the technique in a conventional clinical schedule.

  • PDF