Search | Korea Science

Candidate Word List and Probability Score Guided for Korean Scene Text Recognition (후보 단어 리스트와 확률 점수에 기반한 한국어 문자 인식 모델)

Lee, Yoonji;Lee, Jong-Min
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.73-75
- /
- 2022
Scene Text Recognition is a technology used in the field of artificial intelligence that requires manless robot, automatic vehicles and human-computer interaction. Though scene text images are distorted by noise interference, such as illumination, low resolution and blurring. Unlike previous studies that recognized only English, this paper shows a strong recognition accuracy including various characters, English, Korean, special character and numbers. Instead of selecting only one class having the highest probability value, a candidate word can be generated by considering the probability value of the second rank as well, thus a method can be corrected an existing language misrecognition problem.
PDF

Ship Number Recognition Method Based on An improved CRNN Model

Wenqi Xu;Yuesheng Liu;Ziyang Zhong;Yang Chen;Jinfeng Xia;Yunjie Chen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.3
- /
- pp.740-753
- /
- 2023
Text recognition in natural scene images is a challenging problem in computer vision. The accurate identification of ship number characters can effectively improve the level of ship traffic management. However, due to the blurring caused by motion and text occlusion, the accuracy of ship number recognition is difficult to meet the actual requirements. To solve these problems, this paper proposes a dual-branch network based on the CRNN identification network. The network couples image restoration and character recognition. The CycleGAN module is used for blur restoration branch, and the Pix2pix module is used for character occlusion branch. The two are coupled to reduce the impact of image blur and occlusion. Input the recovered image into the text recognition branch to improve the recognition accuracy. After a lot of experiments, the model is robust and easy to train. Experiments on CTW datasets and real ship maps illustrate that our method can get more accurate results.
https://doi.org/10.3837/tiis.2023.03.004 인용 PDF HTML

Study on the Influence of the Fourth Wall on the Player's Gaming Experience in Side-Scrolling Games

Qi Yi;Jeanhun Chung
- International Journal of Internet, Broadcasting and Communication
- /
- v.15 no.2
- /
- pp.118-123
- /
- 2023
With the continuous development of emerging technologies represented by VR technology, many game developers are declaring that they are constantly trying to break the "fourth wall" and break the boundaries between virtual and reality to create game immersion for players. new game. But for many gamers, a strong sense of immersion is not the focus of their pursuit. The sense of control and safe exploration during the game is also the game experience that many gamers are pursuing. Moreover, there is ambiguity in the definition of the concept of breaking the fourth wall in the field of academic theory. The purpose of breaking the fourth wall was to separate the real world from the virtual world, to remind the audience that the actors and the audience are in two different worlds, and to trigger the audience's thinking about drama and deeper philosophy. But in the current game, it has become a blurring of the boundary between virtual and reality, pulling players into the virtual world, and focusing on the immersive experience. In this paper, we will first sort out the concept of "breaking the fourth wall", and then conduct a comparative analysis of horizontal scroll games and VR games, and conclude that the "fourth wall" has an impact on players Great conclusion.
https://doi.org/10.7236/IJIBC.2023.15.2.118 인용 PDF

AWGN Removal Algorithm using Switching Fuzzy Function and Weight (스위칭 퍼지 함수와 가중치를 사용한 AWGN 제거 알고리즘)

Cheon, Bong-Won;Kim, Nam-Ho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.121-123
- /
- 2021
Image processing is being used in various forms in important fields of the 4th industrial revolution, such as artificial intelligence, smart factories, and the IoT industry. In particular, in systems that require data processing such as object tracking, medical images, and object recognition, noise removal is used as a preprocessing step, but the existing algorithm has a drawback in that blurring occurs in the filtering process. Therefore, in this paper, we propose a filter algorithm using switching fuzzy weights. The proposed algorithm switches the fuzzy function by dividing the low-frequency region and the high-frequency region by the standard deviation of the filtering mask, and obtains the final output according to the fuzzy weight. The proposed algorithm showed improved results compared to the existing method, and showed excellent characteristics in the region where the high-frequency component is strong.
PDF

Design of a Contactless Access Security System using Palm Creases and Palm Vein Pattern Matching (손금과 정맥혈관 패턴매칭을 이용한 비접촉 출입 보안시스템 설계)

Ki-Jung Kim
- The Journal of the Korea institute of electronic communication sciences
- /
- v.19 no.1
- /
- pp.327-334
- /
- 2024
In this paper, we developed a system with a near-infrared LED light source with a wavelength of 950nm to acquire palm vein images and a white LED light source to acquire palm creases based on Raspberry Pi. In addition, we implemented a unique pattern-extractable image processing technology that can prevent counterfeiting and enhance security of mixed creases and palmprints through image pre-processing (Gray scaling, Histogram Equalization, Blurring, Thresholding, Thinning) for the acquired vein and palm images, and secured a source technology that can be used in a security-enhanced system.
https://doi.org/10.13067/JKIECS.2024.19.1.327 인용 PDF

Newly-designed adaptive non-blind deconvolution with structural similarity index in single-photon emission computed tomography

Kyuseok Kim;Youngjin Lee
- Nuclear Engineering and Technology
- /
- v.55 no.12
- /
- pp.4591-4596
- /
- 2023
Single-photon emission computed tomography SPECT image reconstruction methods have a significant influence on image quality, with filtered back projection (FBP) and ordered subset expectation maximization (OSEM) being the most commonly used methods. In this study, we proposed newly-designed adaptive non-blind deconvolution with a structural similarity (SSIM) index that can take advantage of the FBP and OSEM image reconstruction methods. After acquiring brain SPECT images, the proposed image was obtained using an algorithm that applied the SSIM metric, defined by predicting the distribution and amount of blurring. As a result of the contrast to noise ratio (CNR) and coefficient of variation evaluation (COV), the resulting image of the proposed algorithm showed a similar trend in spatial resolution to that of FBP, while obtaining values similar to those of OSEM. In addition, we confirmed that the CNR and COV values of the proposed algorithm improved by approximately 1.69 and 1.59 times, respectively, compared with those of the algorithm involving an inappropriate deblurring process. To summarize, we proposed a new type of algorithm that combines the advantages of SPECT image reconstruction techniques and is expected to be applicable in various fields.
https://doi.org/10.1016/j.net.2023.08.042 인용 PDF

Media Experience in multi-Layered Space through Media Art (미디어아트 기반 다층공간에서의 미디어 경험)

Kang, Yoon Jeong
- The Journal of the Convergence on Culture Technology
- /
- v.10 no.3
- /
- pp.635-641
- /
- 2024
Based on Merleau-Ponty and Lakoff's research on the inseparability of body and mind, this thesis seeks to explore the impact of experiences in multi-layered spaces based on media art on the understanding of human existence. The sensory experience provided by media art strengthens the viewer's physical presence and induces a new perception of reality by blurring the boundaries between reality and virtuality. Through this, it shows that it is possible to newly recognize and explore the deep relationship between human existence and the world. This paper analyzes how media art can be an important means of expanding existential experience through the connection between body and mind, and explains how the combination of art and technology contributes to ontological understanding.
https://doi.org/10.17703/JCCT.2024.10.3.635 인용 PDF

Hologram Watermarking Using Fresnel Diffraction Model (Fresnel 회절 모델을 이용한 홀로그램 워터마킹)

Lee, Yoon-Hyuk;Seo, Young-Ho;Kim, Dong-Wook
- Journal of Broadcast Engineering
- /
- v.19 no.5
- /
- pp.606-615
- /
- 2014
This paper is to propose an algorithm for digital hologram watermarking by using a characteristic of the Fresnel diffraction model in 2D image. When 2D image is applied Fresnel transform, the result concentrates center region. When applied to a hologram, on the other hand, the result focused diffraction pattern of 2D form. Using this characteristic, to generate diffraction model by applying 2-th Fresnel transform to the hologram. Corner of diffraction model is mark space. This mark space is embedded watermark and extracted watermark. Experimental results showed that all the extracted watermarks after several kinds of attacks (Gaussian blurring, Sharpening, JPEG compression) showed visibilities good enough to be recognized to insist the ownership of the hologram.
https://doi.org/10.5909/JBE.2014.19.5.606 인용 PDF KSCI KPUBS

A STUDY ON THE APPLICATION OF DYNAMIC TOMOGRAM OF THE HUMAN HEAD (인체 두부에서 Dynamic Tomogram의 응용에 관한 연구)

Choi Eui Whan;Kim Jae Duk
- Journal of Korean Academy of Oral and Maxillofacial Radiology
- /
- v.21 no.2
- /
- pp.317-326
- /
- 1991
The purpose of this study was to establish the principle and the clinical application of dynamic tomogram of a human head by using the dental machine. For this study, a block of wax with details lying at three parallel planes and a human dry skull were used. This experiment was reexamined the dynamic tomogram with specialized radiographic device and view box, and the radiograms taken by the change of exposure time according to the numbers of film used in x-ray taking and taken according to the change of kVp and the types of film were analyzed density with the densitometer. From this study, the obtained results were as follows: 1. When the underexposed radiograms taken by angulation of clockwise and counter-clockwise direction of the film and skull. were superimposed and moved laterally, it was possible to focus on right and left jaws and teeth. 2. The superimposition of the two underexposed radiograms according to each condition of x-ray taking showed some differencies in density visually, and the measurement of density with the densitometer was 1.23 to 1.57 in 75kVp and 1.34 to 1.70 in 90kVp. 3. The superimposition of the two underexposed radiograms according to the kinds of x-ray film showed almost equal density visually, and the measurement of density with the densiometer was 1.34 to 1.37. 4. When seven radiograms taken by each condition of x-ray taking were superimposed on the view box, a intense rear light of view box didn't transilluminate film density regardless of the conditions of x-ray taking. Even though seven radiograms taken according to types of film were superimposed on the view box, a more intense rear light of view box was required to transilluminate total density of films. 6. Long film-object distance resulted in the enlargement and blurring of radiographic images.
PDF

Error Concealment of MPEG-2 Intra Frames by Spatiotemporal Information of Inter Frames (인터 프레임의 시공간적 정보를 이용한 MPEG-2 인트라 프레임의 오류 은닉)

Kang, Min-Jung;Ryu, Chul
- Journal of the Institute of Convergence Signal Processing
- /
- v.4 no.2
- /
- pp.31-39
- /
- 2003
The MPEG-2 source coding algorithm is very sensitive to transmission errors due to using of variable-length coding. When the compressed data are transmitted, transmission errors are generated and error correction scheme is not able to be corrected well them. In the decoder error concealment (EC) techniques must be used to conceal errors and it is able to minimize degradation of video quality. The proposed algorithm is method to conceal successive macroblock errors of I-frame and utilize temporal information of B-frame and spatial information of P-frame In the previous GOP which is temporally the nearest location to I-frame. This method can improve motion distortion and blurring by temporal and spatial errors which cause at existing error concealment techniques. In network where the violent transmission errors occur, we can conceal more efficiently severe slice errors. This algorithm is Peformed in MPEG-2 video codec and Prove that we can conceal efficiently slice errors of I-frame compared with other approaches by simulations.
PDF

Search Result 442, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)