• Title/Summary/Keyword: 2D Video

Search Result 910, Processing Time 0.032 seconds

Development of a Emergency Situation Detection Algorithm Using a Vehicle Dash Cam (차량 단말기 기반 돌발상황 검지 알고리즘 개발)

  • Sanghyun Lee;Jinyoung Kim;Jongmin Noh;Hwanpil Lee;Soomok Lee;Ilsoo Yun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.4
    • /
    • pp.97-113
    • /
    • 2023
  • Swift and appropriate responses in emergency situations like objects falling on the road can bring convenience to road users and effectively reduces secondary traffic accidents. In Korea, current intelligent transportation system (ITS)-based detection systems for emergency road situations mainly rely on loop detectors and CCTV cameras, which only capture road data within detection range of the equipment. Therefore, a new detection method is needed to identify emergency situations in spatially shaded areas that existing ITS detection systems cannot reach. In this study, we propose a ResNet-based algorithm that detects and classifies emergency situations from vehicle camera footage. We collected front-view driving videos recorded on Korean highways, labeling each video by defining the type of emergency, and training the proposed algorithm with the data.

Hardware Implementation of Real-Time Blind Watermarking by Substituting Bitplanes of Wavelet DC Coefficients (웨이블릿 DC 계수의 비트평면 치환방법에 의한 실시간 블라인드 워터마킹 및 하드웨어 구현)

  • 서영호;김동욱
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.3C
    • /
    • pp.398-407
    • /
    • 2004
  • In this paper, a blind watermarking method which is suitable to the video compression using 2-D discrete wavelet transform was proposed and implemented into the hardware using VHDL(VHSIC Hardware Description Language). The goal of the proposed watermarking algorithm is the authentication about the manipulation of the watermark embedded image and the detection of the error positions. Considering the compressed video image, the proposed watermarking scheme is unrelated to the quantization and is able to concurrently embed or extract the watermark. We experimentally verified that the lowest frequency subband(LL4) is not sensitive to the change in the spatial domain, so LL4 subband was selected for the mark space. And the combination of the bitplanes which has the properties of both the minimum degradation of the image and the robustness was chosen as the embedded Point in the mark space in LL4 subband. Since we know the watermark embedded positions and the watermark is embedded by not varying the value but changing the value, the watermark can be extracted without the original image. Also, for the security when exposing the watermark embedded position, we embed the encrypted watermark by the block cipher. The proposed watermark algorithm shows the robustness against the general image manipulation and is easily transplanted into the image or video compressor with the minimal changing in the structure. The designed hardware has 4037 LABs(24%) and 85 ESBs(3%) in APEX20KC EP20K400CF672C7 FPGA of Altera and stably operates in 82MHz clock frequency.

Efficacy of sucrose application in minimizing pain perception related to dental injection in children aged 3 to 9 years: a randomized control trial

  • Ishani Ratnaparkhi;Jasmin Winnier;Divya Shetty;Sanjana R. Kodical;Reema Manoj;Shilpa S Naik
    • Journal of Dental Anesthesia and Pain Medicine
    • /
    • v.24 no.2
    • /
    • pp.109-117
    • /
    • 2024
  • Background: Dental fear and anxiety are significant challenges in managing behavior in children. Oral administration of sucrose or sweet-tasting solutions has shown effectiveness in reducing procedural pain in infants and neonates. This study aimed to investigate whether pre-application of sucrose solution had an effect on minimizing pain perception during injection and to assess the potential impact of the child's age and sweet preference. Methods: A randomized control clinical trial was conducted on 60 children aged 3-9 years requiring buccal infiltration injections. Following parental consent, demographic data of the children were recorded. Sweet preferences was assessed using a modified forced-choice test. Children were equally and randomly allocated into study (sucrose) and control groups using a lottery method. Sucrose solution or distilled water, respectively, was applied to the lateral surface of the tongue for 2 min. Topical anesthetic was applied at the site of injection, followed by local anesthesia administration. The children rinsed their mouths thrice with water immediately after anesthetic injection. A video was recorded during injection which was then scored by three blinded examiners on the Sound Eye Motor (SEM) scale. The children also self-evaluated using Wong-Baker Faces Pain Rating Scale (WBFPS). Results: The mean SEM scores and WBFPS scores were analyzed using the Kruskall-Wallis test. The mean SEM score in the study group was 1.37 ± 0.61, compared to 3.17 ± 0.87 in the control group, showing a statistically significant difference (P < 0.001). Mean pain scores assessed by WBFPS in the study group were 0.60 ± 1.4, while in the control group, they were 6.27 ± 2.33, also showing a statistically significant difference (P < 0.001). Children with a sweet preference demonstrated a subjective reduction in pain perception. Conclusion: Application of sucrose before dental injections in children helps to minimize pain upon injection across all age groups.

Dual Codec Based Joint Bit Rate Control Scheme for Terrestrial Stereoscopic 3DTV Broadcast (지상파 스테레오스코픽 3DTV 방송을 위한 이종 부호화기 기반 합동 비트율 제어 연구)

  • Chang, Yong-Jun;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.216-225
    • /
    • 2011
  • Following the proliferation of three-dimensional video contents and displays, many terrestrial broadcasting companies have been preparing for stereoscopic 3DTV service. In terrestrial stereoscopic broadcast, it is a difficult task to code and transmit two video sequences while sustaining as high quality as 2DTV broadcast due to the limited bandwidth defined by the existing digital TV standards such as ATSC. Thus, a terrestrial 3DTV broadcasting with a heterogeneous video codec system, where the left image and right images are based on MPEG-2 and H.264/AVC, respectively, is considered in order to achieve both high quality broadcasting service and compatibility for the existing 2DTV viewers. Without significant change in the current terrestrial broadcasting systems, we propose a joint rate control scheme for stereoscopic 3DTV service based on the heterogeneous dual codec systems. The proposed joint rate control scheme applies to the MPEG-2 encoder a quadratic rate-quantization model which is adopted in the H.264/AVC. Then the controller is designed for the sum of the left and right bitstreams to meet the bandwidth requirement of broadcasting standards while the sum of image distortions is minimized by adjusting quantization parameter obtained from the proposed optimization scheme. Besides, we consider a condition on maintaining quality difference between the left and right images around a desired level in the optimization in order to mitigate negative effects on human visual system. Experimental results demonstrate that the proposed bit rate control scheme outperforms the rate control method where each video coding standard uses its own bit rate control algorithm independently in terms of the increase in PSNR by 2.02%, the decrease in the average absolute quality difference by 77.6% and the reduction in the variance of the quality difference by 74.38%.

4-way Search Window for Improving The Memory Bandwidth of High-performance 2D PE Architecture in H.264 Motion Estimation (H.264 움직임추정에서 고속 2D PE 아키텍처의 메모리대역폭 개선을 위한 4-방향 검색윈도우)

  • Ko, Byung-Soo;Kong, Jin-Hyeung
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.6
    • /
    • pp.6-15
    • /
    • 2009
  • In this paper, a new 4-way search window is designed for the high-performance 2D PE architecture in H.264 Motion Estimation(ME) to improve the memory bandwidth. While existing 2D PE architectures reuse the overlapped data of adjacent search windows scanned in 1 or 3-way, the new window utilizes the overlapped data of adjacent search windows as well as adjacent multiple scanning (window) paths to enhance the reusage of retrieved search window data. In order to scan adjacent windows and multiple paths instead of single raster and zigzag scanning of adjacent windows, bidirectional row and column window scanning results in the 4-way(up. down, left, right) search window. The proposed 4-way search window could improve the reuse of overlapped window data to reduce the redundancy access factor by 3.1, though the 1/3-way search window redundantly requires $7.7{\sim}11$ times of data retrieval. Thus, the new 4-way search window scheme enhances the memory bandwidth by $70{\sim}58%$ compared with 1/3-way search window. The 2D PE architecture in H.264 ME for 4-way search window consists of $16{\times}16$ pe array. computing the absolute difference between current and reference frames, and $5{\times}16$ reusage array, storing the overlapped data of adjacent search windows and multiple scanning paths. The reference data could be loaded upward and downward into the new 2D PE depending on scanning direction, and the reusage array is combined with the pe array rotating left as well as right to utilize the overlapped data of adjacent multiple scan paths. In experiments, the new implementation of 4-way search window on Magnachip 0.18um could deal with the HD($1280{\times}720$) video of 1 reference frame, $48{\times}48$ search area and $16{\times}16$ macroblock by 30fps at 149.25MHz.

Comparison of Nasalance Score Between Glottal and Oral Articulation in Children with Velopharyngeal Insufficiency (연인두 폐쇄부전 아동의 보상조음과 정조음에서의 비음치 비교)

  • Lee, Eun-Kyung;Son, Young-Ik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.2
    • /
    • pp.129-133
    • /
    • 2007
  • Background and Objectives: Nasometry is an easy, noninvasive method to obtain objective data regarding the function of velopharynx. However, because articulation errors may affect the results of nasometry, the examiner should interpret the nasalance score based on appropriate speech stimuli. The purpose of this study is to examine the difference of nasalance score between glottal and oral articulations in patients with velopharyngeal insufficiency (VPI). Materials and Method: Nineteen children between 3.4 and 12.1 years of age (mean age 5.7 years) with a confirmed VPl showing hypernasality and articulation errors (glottal stops) were included. Nasalance scores were obtained for two speech patterns of glottal and oral stops. In addition, the velopharyngeal functions were analyzed in four subjects using video nasopharyngoscopy. Results: The $mean{\pm}S.D$ nasalance scores of the glottal stops and oral stops were $42.54{\pm}16.26%$ and $25.47{\pm}16.51%$ respectively (p=.000). Six of 19 patients achieved normal nasalance scores when glottal stops changed to oral stops by the trial speech therapy. Video nasopharyngoscope confirmed that large velopharyngeal gaps can be decreased into tiny gaps or complete closure when compensatory articulations were corrected for some cases. Conclusion: Compensatory articulation errors must be corrected for the reliable interpretation of the nasalance scores that are obtained in children with velopharyngeal insufficiency, which would facilitate to make a better decision for further management of these patients.

  • PDF

The Methodology of the Golf Swing Similarity Measurement Using Deep Learning-Based 2D Pose Estimation

  • Jonghyuk, Park
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.1
    • /
    • pp.39-47
    • /
    • 2023
  • In this paper, we propose a method to measure the similarity between golf swings in videos. As it is known that deep learning-based artificial intelligence technology is effective in the field of computer vision, attempts to utilize artificial intelligence in video-based sports data analysis are increasing. In this study, the joint coordinates of a person in a golf swing video were obtained using a deep learning-based pose estimation model, and based on this, the similarity of each swing segment was measured. For the evaluation of the proposed method, driver swing videos from the GolfDB dataset were used. As a result of measuring swing similarity by pairing swing videos of a total of 36 players, 26 players evaluated that their other swing sequence was the most similar, and the average ranking of similarity was confirmed to be about 5th. This ensured that the similarity could be measured in detail even when the motion was performed similarly.

Image Processing Software Development for Detection of Oyster Hinge Lines (굴의 힌지 선 감지를 위한 영상처리 소프트웨어의 개발)

  • So, J.D.;Wheaton, Fred W.
    • Journal of Biosystems Engineering
    • /
    • v.22 no.2
    • /
    • pp.237-246
    • /
    • 1997
  • Shucking(removing the meat from the shell) an oyster requires that the muscle attachments to the two shell valves and the hinge be severed. Described here is the computer vision software needed to locate the oyster hinge line so it can be automatically severed, one step in development of an automated oyster shucker. Oysters are first prepared by washing and trimming off a small shell piece on the oyster hinge end to provide access to the outer hinge surface. A computer vision system employing a color video comera then gabs an image of the hinge end of the oyster shell. This image is Processed by the computer using software. The software is a combination of commercially available and custom written routines that locate the oyster hinge. The software uses four feature variables, circularity, rectangularity, aspect-ration, and Euclidian distance, to distinguish the hinge object from other dark colored objects on the hinge end of the oyster. Several techniques, including shrink-expand, thresholding, and others, were used to secure an image that could be reliably and efficiently processed to locate the oyster hinge line.

  • PDF

A Mismatch-Insensitive 12b 60MS/s 0.18um CMOS Flash-SAR ADC (소자 부정합에 덜 민감한 12비트 60MS/s 0.18um CMOS Flash-SAR ADC)

  • Byun, Jae-Hyeok;Kim, Won-Kang;Park, Jun-Sang;Lee, Seung-Hoon
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.7
    • /
    • pp.17-26
    • /
    • 2016
  • This work proposes a 12b 60MS/s 0.18um CMOS Flash-SAR ADC for various systems such as wireless communications and portable video processing systems. The proposed Flash-SAR ADC alleviates the weakness of a conventional SAR ADC that the operation speed proportionally increases with a resolution by deciding upper 4bits first with a high-speed flash ADC before deciding lower 9bits with a low-power SAR ADC. The proposed ADC removes a sampling-time mismatch by using the C-R DAC in the SAR ADC as the combined sampling network instead of a T/H circuit which restricts a high speed operation. An interpolation technique implemented in the flash ADC halves the required number of pre-amplifiers, while a switched-bias power reduction scheme minimizes the power consumption of the flash ADC during the SAR operation. The TSPC based D-flip flop in the SAR logic for high-speed operation reduces the propagation delay by 55% and the required number of transistors by half compared to the conventional static D-flip flop. The prototype ADC in a 0.18um CMOS demonstrates a measured DNL and INL within 1.33LSB and 1.90LSB, with a maximum SNDR and SFDR of 58.27dB and 69.29dB at 60MS/s, respectively. The ADC occupies an active die area of $0.54mm^2$ and consumes 5.4mW at a 1.8V supply.

A 12b 100MS/s 1V 24mW 0.13um CMOS ADC for Low-Power Mobile Applications (저전력 모바일 응용을 위한 12비트 100MS/s 1V 24mW 0.13um CMOS A/D 변환기)

  • Park, Seung-Jae;Koo, Byeong-Woo;Lee, Seung-Hoon
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.47 no.8
    • /
    • pp.56-63
    • /
    • 2010
  • This work proposes a 12b 100MS/s 0.13um CMOS pipeline ADC for battery-powered mobile video applications such as DVB-Handheld (DVB-H), DVB-Terrestrial (DVB-T), Satellite DMB (SDMB), and Terrestrial DMB (TDMB) requiring high resolution, low power, and small size at high speed. The proposed ADC employs a three-step pipeline architecture to optimize power consumption and chip area at the target resolution and sampling rate. A single shared and switched op-amp for two MDACs removes a memory effect and a switching time delay, resulting in a fast signal settling. A two-step reference selection scheme for the last-stage 6b FLASH ADC reduces power consumption and chip area by 50%. The prototype ADC in a 0.13um 1P7M CMOS technology demonstrates a measured DNL and INL within 0.40LSB and 1.79LSB, respectively. The ADC shows a maximum SNDR of 60.0dB and a maximum SFDR of 72.4dB at 100MS/s, respectively. The ADC with an active die area of 0.92 $mm^2$ consumes 24mW at 1.0V and 100MS/s. The FOM, power/($f_s{\times}2^{ENOB}$), of 0.29pJ/conv. is the lowest of ever reported 12b 100MS/s ADCs.