• Title/Summary/Keyword: Mobile technology

Search Result 6,584, Processing Time 0.035 seconds

Accelerometer-based Gesture Recognition for Robot Interface (로봇 인터페이스 활용을 위한 가속도 센서 기반 제스처 인식)

  • Jang, Min-Su;Cho, Yong-Suk;Kim, Jae-Hong;Sohn, Joo-Chan
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.53-69
    • /
    • 2011
  • Vision and voice-based technologies are commonly utilized for human-robot interaction. But it is widely recognized that the performance of vision and voice-based interaction systems is deteriorated by a large margin in the real-world situations due to environmental and user variances. Human users need to be very cooperative to get reasonable performance, which significantly limits the usability of the vision and voice-based human-robot interaction technologies. As a result, touch screens are still the major medium of human-robot interaction for the real-world applications. To empower the usability of robots for various services, alternative interaction technologies should be developed to complement the problems of vision and voice-based technologies. In this paper, we propose the use of accelerometer-based gesture interface as one of the alternative technologies, because accelerometers are effective in detecting the movements of human body, while their performance is not limited by environmental contexts such as lighting conditions or camera's field-of-view. Moreover, accelerometers are widely available nowadays in many mobile devices. We tackle the problem of classifying acceleration signal patterns of 26 English alphabets, which is one of the essential repertoires for the realization of education services based on robots. Recognizing 26 English handwriting patterns based on accelerometers is a very difficult task to take over because of its large scale of pattern classes and the complexity of each pattern. The most difficult problem that has been undertaken which is similar to our problem was recognizing acceleration signal patterns of 10 handwritten digits. Most previous studies dealt with pattern sets of 8~10 simple and easily distinguishable gestures that are useful for controlling home appliances, computer applications, robots etc. Good features are essential for the success of pattern recognition. To promote the discriminative power upon complex English alphabet patterns, we extracted 'motion trajectories' out of input acceleration signal and used them as the main feature. Investigative experiments showed that classifiers based on trajectory performed 3%~5% better than those with raw features e.g. acceleration signal itself or statistical figures. To minimize the distortion of trajectories, we applied a simple but effective set of smoothing filters and band-pass filters. It is well known that acceleration patterns for the same gesture is very different among different performers. To tackle the problem, online incremental learning is applied for our system to make it adaptive to the users' distinctive motion properties. Our system is based on instance-based learning (IBL) where each training sample is memorized as a reference pattern. Brute-force incremental learning in IBL continuously accumulates reference patterns, which is a problem because it not only slows down the classification but also downgrades the recall performance. Regarding the latter phenomenon, we observed a tendency that as the number of reference patterns grows, some reference patterns contribute more to the false positive classification. Thus, we devised an algorithm for optimizing the reference pattern set based on the positive and negative contribution of each reference pattern. The algorithm is performed periodically to remove reference patterns that have a very low positive contribution or a high negative contribution. Experiments were performed on 6500 gesture patterns collected from 50 adults of 30~50 years old. Each alphabet was performed 5 times per participant using $Nintendo{(R)}$ $Wii^{TM}$ remote. Acceleration signal was sampled in 100hz on 3 axes. Mean recall rate for all the alphabets was 95.48%. Some alphabets recorded very low recall rate and exhibited very high pairwise confusion rate. Major confusion pairs are D(88%) and P(74%), I(81%) and U(75%), N(88%) and W(100%). Though W was recalled perfectly, it contributed much to the false positive classification of N. By comparison with major previous results from VTT (96% for 8 control gestures), CMU (97% for 10 control gestures) and Samsung Electronics(97% for 10 digits and a control gesture), we could find that the performance of our system is superior regarding the number of pattern classes and the complexity of patterns. Using our gesture interaction system, we conducted 2 case studies of robot-based edutainment services. The services were implemented on various robot platforms and mobile devices including $iPhone^{TM}$. The participating children exhibited improved concentration and active reaction on the service with our gesture interface. To prove the effectiveness of our gesture interface, a test was taken by the children after experiencing an English teaching service. The test result showed that those who played with the gesture interface-based robot content marked 10% better score than those with conventional teaching. We conclude that the accelerometer-based gesture interface is a promising technology for flourishing real-world robot-based services and content by complementing the limits of today's conventional interfaces e.g. touch screen, vision and voice.

Electronic Roll Book using Electronic Bracelet.Child Safe-Guarding Device System (전자 팔찌를 이용한 전자 출석부.어린이 보호 장치 시스템)

  • Moon, Seung-Jin;Kim, Tae-Nam;Kim, Pan-Su
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.143-155
    • /
    • 2011
  • Lately electronic tagging policy for the sexual offenders was introduced in order to reduce and prevent sexual offences. However, most sexual offences against children happening these days are committed by the tagged offenders whose identities have been released. So, for the crime prevention, we need measures with which we could minimize the suffers more promptly and actively. This paper suggests a new system to relieve the sexual abuse related anxiety of the children and solve the problems that electronic bracelet has. Existing bracelets are only worn by serious criminals, and it's only for risk management and positioning, there is no way to protect the children who are the potential victims of sexual abuse and there actually happened some cases. So we suggest also letting the students(children) wear the LBS(Location Based Service) and USN(Ubiquitous Sensor Network) technology based electronic bracelets to monitor and figure out dangerous situations intelligently, so that we could prevent sexual offences against children beforehand, and while a crime is happening, we could judge the situation of the crime intelligently and take swift action to minimize the suffer. And by checking students' attendance and position, guardians could know where their children are in real time and could protect the children from not only sexual offences but also violent crimes against children like kidnapping. The overall system is like follows : RFID Tag for children monitors the approach of offenders. While an offender's RFID tag is approaching, it will transmit the situation and position as the first warning message to the control center and the guardians. When the offender is going far away, it turns to monitoring mode, and if the tag of the child or the offender is taken off or the child and offender stay at one position for 3~5 minutes or longer, then it will consider this as a dangerous situation, then transmit the emergency situations and position as the second warning message to the control center and the guardians, and ask for the dispatch of police to prevent the crime at the initial stage. The RFID module of criminals' electronic bracelets is RFID TAG, and the RFID module for the children is RFID receiver(reader), so wherever the offenders are, if an offender is at a place within 20m from a child, RFID module for children will transmit the situation every certain periods to the control center by the automatic response of the receiver. As for the positioning module, outdoors GPS or mobile communications module(CELL module)is used and UWB, WI-FI based module is used indoors. The sensor is set under the purpose of making it possible to measure the position coordinates even indoors, so that one could send his real time situation and position to the server of central control center. By using the RFID electronic roll book system of educational institutions and safety system installed at home, children's position and situation can be checked. When the child leaves for school, attendance can be checked through the electronic roll book, and when school is over the information is sent to the guardians. And using RFID access control turnstiles installed at the apartment or entrance of the house, the arrival of the children could be checked and the information is transmitted to the guardians. If the student is absent or didn't arrive at home, the information of the child is sent to the central control center from the electronic roll book or access control turnstiles, and look for the position of the child's electronic bracelet using GPS or mobile communications module, then send the information to the guardians and teacher so that they could report to the police immediately if necessary. Central management and control system is built under the purpose of monitoring dangerous situations and guardians' checking. It saves the warning and pattern data to figure out the areas with dangerous situation, and could help introduce crime prevention systems like CCTV with the highest priority. And by DB establishment personal data could be saved, the frequency of first and second warnings made, the terminal ID of the specific child and offender, warning made position, situation (like approaching, taken off of the electronic bracelet, same position for a certain time) and so on could be recorded, and the data is going to be used for preventing crimes. Even though we've already introduced electronic tagging to prevent recurrence of child sexual offences, but the crimes continuously occur. So I suggest this system to prevent crimes beforehand concerning the children's safety. If we make electronic bracelets easy to use and carry, and set the price reasonably so that many children can use, then lots of criminals could be prevented and we can protect the children easily. By preventing criminals before happening, it is going to be a helpful system for our safe life.

Evaluation of the Usefulness of Exactrac in Image-guided Radiation Therapy for Head and Neck Cancer (두경부암의 영상유도방사선치료에서 ExacTrac의 유용성 평가)

  • Baek, Min Gyu;Kim, Min Woo;Ha, Se Min;Chae, Jong Pyo;Jo, Guang Sub;Lee, Sang Bong
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.32
    • /
    • pp.7-15
    • /
    • 2020
  • Purpose: In modern radiotherapy technology, several methods of image guided radiation therapy (IGRT) are used to deliver accurate doses to tumor target locations and normal organs, including CBCT (Cone Beam Computed Tomography) and other devices, ExacTrac System, other than CBCT equipped with linear accelerators. In previous studies comparing the two systems, positional errors were analysed rearwards using Offline-view or evaluated only with a Yaw rotation with the X, Y, and Z axes. In this study, when using CBCT and ExacTrac to perform 6 Degree of the Freedom(DoF) Online IGRT in a treatment center with two equipment, the difference between the set-up calibration values seen in each system, the time taken for patient set-up, and the radiation usefulness of the imaging device is evaluated. Materials and Methods: In order to evaluate the difference between mobile calibrations and exposure radiation dose, the glass dosimetry and Rando Phantom were used for 11 cancer patients with head circumference from March to October 2017 in order to assess the difference between mobile calibrations and the time taken from Set-up to shortly before IGRT. CBCT and ExacTrac System were used for IGRT of all patients. An average of 10 CBCT and ExacTrac images were obtained per patient during the total treatment period, and the difference in 6D Online Automation values between the two systems was calculated within the ROI setting. In this case, the area of interest designation in the image obtained from CBCT was fixed to the same anatomical structure as the image obtained through ExacTrac. The difference in positional values for the six axes (SI, AP, LR; Rotation group: Pitch, Roll, Rtn) between the two systems, the total time taken from patient set-up to just before IGRT, and exposure dose were measured and compared respectively with the RandoPhantom. Results: the set-up error in the phantom and patient was less than 1mm in the translation group and less than 1.5° in the rotation group, and the RMS values of all axes except the Rtn value were less than 1mm and 1°. The time taken to correct the set-up error in each system was an average of 256±47.6sec for IGRT using CBCT and 84±3.5sec for ExacTrac, respectively. Radiation exposure dose by IGRT per treatment was measured at 37 times higher than ExacTrac in CBCT and ExacTrac at 2.468mGy and 0.066mGy at Oral Mucosa among the 7 measurement locations in the head and neck area. Conclusion: Through 6D online automatic positioning between the CBCT and ExacTrac systems, the set-up error was found to be less than 1mm, 1.02°, including the patient's movement (random error), as well as the systematic error of the two systems. This error range is considered to be reasonable when considering that the PTV Margin is 3mm during the head and neck IMRT treatment in the present study. However, considering the changes in target and risk organs due to changes in patient weight during the treatment period, it is considered to be appropriately used in combination with CBCT.

Validation of nutrient intake of smartphone application through comparison of photographs before and after meals (식사 전후의 사진 비교를 통한 스마트폰 앱의 영양소섭취량 타당도 평가)

  • Lee, Hyejin;Kim, Eunbin;Kim, Su Hyeon;Lim, Haeun;Park, Yeong Mi;Kang, Joon Ho;Kim, Heewon;Kim, Jinho;Park, Woong-Yang;Park, Seongjin;Kim, Jinki;Yang, Yoon Jung
    • Journal of Nutrition and Health
    • /
    • v.53 no.3
    • /
    • pp.319-328
    • /
    • 2020
  • Purpose: This study was conducted to evaluate the validity of the Gene-Health application in terms of estimating energy and macronutrients. Methods: The subjects were 98 health adults participating in a weight-control intervention study. They recorded their diets in the Gene-Health application, took photographs before and after every meal on the same day, and uploaded them to the Gene-Health application. The amounts of foods and drinks consumed were estimated based on the photographs by trained experts, and the nutrient intakes were calculated using the CAN-Pro 5.0 program, which was named 'Photo Estimation'. The energy and macronutrients estimated from the Gene-Health application were compared with those from a Photo Estimation. The mean differences in energy and macronutrient intakes between the two methods were compared using paired t-test. Results: The mean energy intakes of Gene-Health and Photo Estimation were 1,937.0 kcal and 1,928.3 kcal, respectively. There were no significant differences in intakes of energy, carbohydrate, fat, and energy from fat (%) between two methods. The protein intake and energy from protein (%) of the Gene-Health were higher than those from the Photo Estimation. The energy from carbohydrate (%) for the Photo Estimation was higher than that of the Gene-Health. The Pearson correlation coefficients, weighted Kappa coefficients, and adjacent agreements for energy and macronutrient intakes between the two methods ranged from 0.382 to 0.607, 0.588 to 0.649, and 79.6% to 86.7%, respectively. Conclusion: The Gene-Health application shows acceptable validity as a dietary intake assessment tool for energy and macronutrients. Further studies with female subjects and various age groups will be needed.

Simultaneous Determination and Monitoring of Three Macrolide Antibiotics in Foods by HPLC (Macrolide계 항생물질 동시분석법 확립 및 모니터링)

  • Park, Sang-Ouk;Lee, Sang-Ho;Ahn, Jong-Hoon;Jung, Young-Ji;Kim, Seong-Cheol;Kim, Ji-Yeon;Keum, Eun-Hee;Sung, Ju-Hyun;Kim, Sang-Yub;Jang, Young-Mi;Kang, Chan-Soon
    • Korean Journal of Food Science and Technology
    • /
    • v.42 no.3
    • /
    • pp.287-291
    • /
    • 2010
  • In this study, a simple and rapid pre-treatment method based on liquid extraction was applied for the simultaneous determination of three macrolides (spiramycin, tylosin, and tilmicosin) residues. In these studies, the stock farm products was used as a matrix sample. When the liquid extraction method was compared with the solid phase extraction (SPE) method, the former showed higher recovery percentages and simpler steps than the latter. The macrolids were separated using a reverse-phase C18 ($250\;mm{\times}4.6\;mm$, $5\;{\mu}m$) column and a gradient elution with mobile phases consisting of phosphate buffer (pH 2.5) and acetonitrile. Tylosin and tilmicosin were detected at 288 nm and spiramycin was detected at 232 nm. The average recovery percentage ranged between 83.0-90.2% for samples spiked with the three macrolids at 50 and 100 ng/g The validation results showed that the limit of detection (7 (spiramycin), 12 (tilmiconsin), 12 (tylosin) ng/g)) was under the regulatory tolerances and the linearity from calibration curves was satisfactory for determining the multi-residue of three macrolids in farm products. Monitoring samples were collected at the main cities in Korea as Seoul, Busan, Deajeon, Incheon, Deagu, and Gwangju. Microlide antibiotics were not detected in most samples.

Radiolysis Assessment of $^{18}F$-FDG According to Automatic Synthesis Module (자동합성장치에 따른 $^{18}F$-FDG의 방사선분해 평가)

  • Kim, Si-Hwal;Kim, Dong-Il;Chi, Yong-Gi;Choi, Sung-Wook;Choi, Choon-Ki;Seok, Jae-Dong
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.16 no.1
    • /
    • pp.8-11
    • /
    • 2012
  • Purpose : Among quality control items, the radiochemical impurity must be below 10% of total radioactivity. In this regard, as the recently commercialized automatic synthesis module produces a large amount of 18F-FDG, radiolysis of radiopharmaceuticals is very likely to occur. Thus, this study compared the changes in radiochemical purity regarding radiolysis of $^{18}F$-FDG according to automatic synthesis module. Materials and methods : Cyclotron (PETtrace, GE Healthcare) was used to produce $^{18}F$ and automatic synthesis module (FASTlab, Tracerlab MX, GE Healthcare) was used to achieve synthesis into FDG. For radiochemical purity, Radio-TLC Scanner (AR 2000, Bioscan), GC (Gas Chromatograph, Agilent 7890A) was used to measure the content of ethanol included in $^{18}F$-FDG. Glass board applied with silica gel ($1{\times}10cm$) was used for stationary phase while a mixed liquid formed of acetonitrile and water (ratio 19:1) was used for mobile phase. High-concentration and low-concentration $^{18}F$-FDG were produced in each synthesis module and the radiochemical purity was measured every 2 hours. Results : The purity in low-concentration (below 2.59 GBq/mL) was measured as 99.26%, 98.69%, 98.25%, 98.09% in Tracerlab MX and as 99.09%, 97.83%, 96.89%, 96.62% in FASTlab according to 0, 2, 4, 6 hours changes, respectively. The purity in high-concentration (above 3.7 GBq/mL) was measured as 99.54%, 96.08%, 93.77%, 92.54% in Tracerlab MX and as 99.53%, 95.65%, 92.39%, 89.82% in FASTlab according to 0, 2, 4, 6 hours changes, respectively. Also, ethanol was not detected in GC of $^{18}F$-FDG produced in FASTlab, while 100~300 ppm ethanol was detected in Tracerlab MX. Conclusion : Whereas the change of radiochemical purity was only 3% in low-concentration $^{18}F$-FDG, the change was rapidly increased to 10% in high-concentration. Also, higher radiolysis were observed in $^{18}F$-FDG produced in FASTlab than Tracerlab MX. This is because ethanol is included in the synthesis stage of Tracerlab MX but not in the synthesis stage of FASTlab. Thus, radiolysis is influenced by radioactivity concentration than the inclusion of ethanol, which is the radioprotector. Therefore, after producing high-concentration $^{18}F$-FDG, the content must be diluted through saline to lower concentration.

  • PDF

The Usefulness Evaluation of Radiation Shielding Devices in PET Scan Procedures (PET 검사 프러시저별 방사선 차폐기구의 유용성 평가)

  • Kim, Yeong-Seon;Seo, Myeong-Deok;Lee, Wan-Kyu;Jeong, Yo-Cheon;Kim, Sang-Wook;Seo, Il-Teak;Song, Jae-Beom
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.14 no.2
    • /
    • pp.65-76
    • /
    • 2010
  • Purpose: he use of PET scanners and the number of patient in Korea have been increased for recent several years dramatically. For this reason, technologists have more possibilities to be exposed to the radiation. The hospitals using PET scanners should make an effort to reduce the radiation exposure dose. The purpose of this study was to evaluate the radiation exposure does when using radiation shielding devices. The evaluation was performed through questionnaire survey and experiment. Materials and Methods: First, the technologists who had experience working in PET center in 2008-2009 were surveyed with questionnaire and TLD Figures, personal opinion of utilization of radiation shielding devices are analyzed. Second, we measured the shielding rate of shielding devices which have been using in PET study procedures. We divided the procedures into four steps; distribution, moving, injection of $^{18}F$-FDG and patient setup. Results: First, the results of this survey, using of L-block+Syringe shield, L-block, Syringe shield, No shield during the injection, were each 58.5%, 20%, 9%, 12.3%. The TLD values according to utilization of radiation shield, using both L-block+Syringe Shield and L-block showed the lower TLD values, and Syringe shield only or No shield showed the higher TLD values. Second, the results of experiments according to PET study procedures measured the shielding rates as follows. The shielding rates during the distribution using L-block, L-block+Apron shield were measured 97.4%, 97.7%. The shielding rates during the $^{18}F$-FDG delivery to the injection room using mobile Syringe shield, Syringe holder, Syringe shield carrier were each 81.7%, 98.9%, 99.7%. The shielding rates during the injection using Syringe shield, L-block, L-block+Syringe shield were measured each 51.9%, 98.3%, 98.7%. The shielding rates of Apron were measured in each 30, 60, 90, 120, 150 cm distance. The measurement were each 16.9%, 14.2%, 16.6%, 17.1%, 18.1%, 18.6%. Conclusion: The most effective method for radiation shielding is to using L-block during the $^{18}F$-FDG distribution and Syringe shield carrier during in moving $^{18}F$-FDG. For the $^{18}F$-FDG injection, L-block+Syringe shield have to be used. The shielding effect of Apron has shown average 16.4%. According to the survey of questionnaire, the operators recognized well risk of the radiation exposure but, tended ignore in working. The radiation dose according to recognition of radiation exposure risk was not relevant. but radiation dose according to utilization of radiation shield lower the more use it. The main reason of no use of shielding devices is cumbersome, 55% of the respondents answered. I'm sure, by use of radiation shield in all PET procedure, radiation exposure will be reduced considerably.

  • PDF

A Study of Performance Analysis on Effective Multiple Buffering and Packetizing Method of Multimedia Data for User-Demand Oriented RTSP Based Transmissions Between the PoC Box and a Terminal (PoC Box 단말의 RTSP 운용을 위한 사용자 요구 중심의 효율적인 다중 수신 버퍼링 기법 및 패킷화 방법에 대한 성능 분석에 관한 연구)

  • Bang, Ji-Woong;Kim, Dae-Won
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.1
    • /
    • pp.54-75
    • /
    • 2011
  • PoC(Push-to-talk Over Cellular) is an integrated technology of group voice calls, video calls and internet based multimedia services. If a PoC user can not participate in the PoC session for various reasons such as an emergency situation, lack of battery capacity, then the user can use the PoC Box which has a similar functionality to the MM Box in the MMS(Multimedia Messaging Service). The RTSP(Real-Time Streaming Protocol) method is recommended to be used when there is a transmission session between the PoC box and a terminal. Since the existing VOD service uses a wired network, the packet size of RTSP-based VOD service is huge, however, the PoC service has wireless communication environments which have general characteristics to be used in RTSP method. Packet loss in a wired communication environments is relatively less than that in wireless communication environment, therefore, a buffering latency occurs in PoC service due to a play-out delay which means an asynchronous play of audio & video contents. Those problems make a user to be difficult to find the information they want when the media contents are played-out. In this paper, the following techniques and methods were proposed and their performance and superiority were verified through testing: cross-over dual reception buffering technique, advance partition multi-reception buffering technique, and on-demand multi-reception buffering technique, which are designed for effective picking up of information in media content being transmitted in short amount of time using RTSP when a user searches for media, as well as for reduction in playback delay; and same-priority packetization transmission method and priority-based packetization transmission method, which are media data packetization methods for transmission. From the simulation of functional evaluation, we could find that the proposed multiple receiving buffering and packetizing methods are superior, with respect to the media retrieval inclination, to the existing single receiving buffering method by 6-9 points from the viewpoint of effectiveness and excellence. Among them, especially, on-demand multiple receiving buffering technology with same-priority packetization transmission method is able to manage the media search inclination promptly to the requests of users by showing superiority of 3-24 points above compared to other combination methods. In addition, users could find the information they want much quickly since large amount of informations are received in a focused media retrieval period within a short time.

Improving Curing Rate and Physical Properties of Korean Dendropanax Lacquer with Thermal and Photo Initiator by Dual Curing (이중경화법을 이용한 열개시제 및 광개시제가 배합된 황칠도료의 경화속도 촉진 및 물성향상 연구)

  • Hwang, Hyeon-Deuk;Moon, Je-Ik;Park, Cho-Hee;Kim, Hyun-Joong;Hwang, Baik
    • Journal of the Korean Wood Science and Technology
    • /
    • v.38 no.4
    • /
    • pp.333-340
    • /
    • 2010
  • The Korean Dendropanax lacquer, made from a natural resinous sap from Dendropanax orbifera Lev., was used as a golden and transparent varnish for the traditional artifacts (armor uits, helmets, arrowheads, etc.) to make them be brilliant golden color. The cured film of the acquer has excellent protective properties such as weatherability, water resistance, and nticorrosive. But, one of disadvantages is that takes a long time and much energy to fulfill curing the lacquer. The chemical constituents of the lacquer contained conjugated diene compounds s the photopolymerizable monomers. These monomers easily polymerized in sunlight to form olden-colored, hard-coating films in a short time. Photooxidation may be one of the most mportant reactions in the chemistry of the lacquer. Although the Korean Dendropanax Lacquer hould be dried to a thoroughly dry stage to achieve optimal film properties, curing with elevated emperatures may be required for the protracted curing time at atmospheric temperature. So we ntended to accelerate the curing rate of the lacquer by dual curing of thermal and radiation uring. The effect of thermal initiator on the thermal curing reaction was evaluated by monitoring he changes in double bond peak with FT-IR. Then the curing rate of the lacquer blended with hermal initiator and photoinitiator together was measured during dual curing using a RPT with V spot curing machine. Thermal initiator not only accelerated the curing rate but also improved he physical property. And the curing rate of the Korean Dendropanax lacquer was improved by ual curing method of thermal and UV curing. According to these results, the application area of he Korean Dendropanax lacquer could be expanded to surface coatings for electronic devices uch as mobile phones or electronics.

An Embedding /Extracting Method of Audio Watermark Information for High Quality Stereo Music (고품질 스테레오 음악을 위한 오디오 워터마크 정보 삽입/추출 기술)

  • Bae, Kyungyul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.21-35
    • /
    • 2018
  • Since the introduction of MP3 players, CD recordings have gradually been vanishing, and the music consuming environment of music users is shifting to mobile devices. The introduction of smart devices has increased the utilization of music through music playback, mass storage, and search functions that are integrated into smartphones and tablets. At the time of initial MP3 player supply, the bitrate of the compressed music contents generally was 128 Kbps. However, as increasing of the demand for high quality music, sound quality of 384 Kbps appeared. Recently, music content of FLAC (Free License Audio Codec) format using lossless compression method is becoming popular. The download service of many music sites in Korea has classified by unlimited download with technical protection and limited download without technical protection. Digital Rights Management (DRM) technology is used as a technical protection measure for unlimited download, but it can only be used with authenticated devices that have DRM installed. Even if music purchased by the user, it cannot be used by other devices. On the contrary, in the case of music that is limited in quantity but not technically protected, there is no way to enforce anyone who distributes it, and in the case of high quality music such as FLAC, the loss is greater. In this paper, the author proposes an audio watermarking technology for copyright protection of high quality stereo music. Two kinds of information, "Copyright" and "Copy_free", are generated by using the turbo code. The two watermarks are composed of 9 bytes (72 bits). If turbo code is applied for error correction, the amount of information to be inserted as 222 bits increases. The 222-bit watermark was expanded to 1024 bits to be robust against additional errors and finally used as a watermark to insert into stereo music. Turbo code is a way to recover raw data if the damaged amount is less than 15% even if part of the code is damaged due to attack of watermarked content. It can be extended to 1024 bits or it can find 222 bits from some damaged contents by increasing the probability, the watermark itself has made it more resistant to attack. The proposed algorithm uses quantization in DCT so that watermark can be detected efficiently and SNR can be improved when stereo music is converted into mono. As a result, on average SNR exceeded 40dB, resulting in sound quality improvements of over 10dB over traditional quantization methods. This is a very significant result because it means relatively 10 times improvement in sound quality. In addition, the sample length required for extracting the watermark can be extracted sufficiently if the length is shorter than 1 second, and the watermark can be completely extracted from music samples of less than one second in all of the MP3 compression having a bit rate of 128 Kbps. The conventional quantization method can extract the watermark with a length of only 1/10 compared to the case where the sampling of the 10-second length largely fails to extract the watermark. In this study, since the length of the watermark embedded into music is 72 bits, it provides sufficient capacity to embed necessary information for music. It is enough bits to identify the music distributed all over the world. 272 can identify $4*10^{21}$, so it can be used as an identifier and it can be used for copyright protection of high quality music service. The proposed algorithm can be used not only for high quality audio but also for development of watermarking algorithm in multimedia such as UHD (Ultra High Definition) TV and high-resolution image. In addition, with the development of digital devices, users are demanding high quality music in the music industry, and artificial intelligence assistant is coming along with high quality music and streaming service. The results of this study can be used to protect the rights of copyright holders in these industries.