Search | Korea Science

An Implementation of a System for Video Translation on Window Platform Using OCR (윈도우 기반의 광학문자인식을 이용한 영상 번역 시스템 구현)

Hwang, Sun-Myung;Yeom, Hee-Gyun
- Journal of Internet of Things and Convergence
- /
- v.5 no.2
- /
- pp.15-20
- /
- 2019
As the machine learning research has developed, the field of translation and image analysis such as optical character recognition has made great progress. However, video translation that combines these two is slower than previous developments. In this paper, we develop an image translator that combines existing OCR technology and translation technology and verify its effectiveness. Before developing, we presented what functions are needed to implement this system and how to implement them, and then tested their performance. With the application program developed through this paper, users can access translation more conveniently, and also can contribute to ensuring the convenience provided in any environment.
https://doi.org/10.20465/KIOTS.2019.5.2.015 인용 PDF KSCI

Facial Feature Based Image-to-Image Translation Method

Kang, Shinjin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.12
- /
- pp.4835-4848
- /
- 2020
The recent expansion of the digital content market is increasing the technical demand for various facial image transformations within the virtual environment. The recent image translation technology enables changes between various domains. However, current image-to-image translation techniques do not provide stable performance through unsupervised learning, especially for shape learning in the face transition field. This is because the face is a highly sensitive feature, and the quality of the resulting image is significantly affected, especially if the transitions in the eyes, nose, and mouth are not effectively performed. We herein propose a new unsupervised method that can transform an in-wild face image into another face style through radical transformation. Specifically, the proposed method applies two face-specific feature loss functions for a generative adversarial network. The proposed technique shows that stable domain conversion to other domains is possible while maintaining the image characteristics in the eyes, nose, and mouth.
https://doi.org/10.3837/tiis.2020.12.012 인용 PDF KSCI HTML

Character-Level Neural Machine Translation (문자 단위의 Neural Machine Translation)

Lee, Changki;Kim, Junseok;Lee, Hyoung-Gyu;Lee, Jaesong
- Annual Conference on Human and Language Technology
- /
- 2015.10a
- /
- pp.115-118
- /
- 2015
Neural Machine Translation (NMT) 모델은 단일 신경망 구조만을 사용하는 End-to-end 방식의 기계번역 모델로, 기존의 Statistical Machine Translation (SMT) 모델에 비해서 높은 성능을 보이고, Feature Engineering이 필요 없으며, 번역 모델 및 언어 모델의 역할을 단일 신경망에서 수행하여 디코더의 구조가 간단하다는 장점이 있다. 그러나 NMT 모델은 출력 언어 사전(Target Vocabulary)의 크기에 비례해서 학습 및 디코딩의 속도가 느려지기 때문에 출력 언어 사전의 크기에 제한을 갖는다는 단점이 있다. 본 논문에서는 NMT 모델의 출력 언어 사전의 크기 제한 문제를 해결하기 위해서, 입력 언어는 단어 단위로 읽고(Encoding) 출력 언어를 문자(Character) 단위로 생성(Decoding)하는 방법을 제안한다. 출력 언어를 문자 단위로 생성하게 되면 NMT 모델의 출력 언어 사전에 모든 문자를 포함할 수 있게 되어 출력 언어의 Out-of-vocabulary(OOV) 문제가 사라지고 출력 언어의 사전 크기가 줄어들어 학습 및 디코딩 속도가 빨라지게 된다. 실험 결과, 본 논문에서 제안한 방법이 영어-일본어 및 한국어-일본어 기계번역에서 기존의 단어 단위의 NMT 모델보다 우수한 성능을 보였다.
PDF

A Novel Character Segmentation Method for Text Images Captured by Cameras

Lue, Hsin-Te;Wen, Ming-Gang;Cheng, Hsu-Yung;Fan, Kuo-Chin;Lin, Chih-Wei;Yu, Chih-Chang
- ETRI Journal
- /
- v.32 no.5
- /
- pp.729-739
- /
- 2010
Due to the rapid development of mobile devices equipped with cameras, instant translation of any text seen in any context is possible. Mobile devices can serve as a translation tool by recognizing the texts presented in the captured scenes. Images captured by cameras will embed more external or unwanted effects which need not to be considered in traditional optical character recognition (OCR). In this paper, we segment a text image captured by mobile devices into individual single characters to facilitate OCR kernel processing. Before proceeding with character segmentation, text detection and text line construction need to be performed in advance. A novel character segmentation method which integrates touched character filters is employed on text images captured by cameras. In addition, periphery features are extracted from the segmented images of touched characters and fed as inputs to support vector machines to calculate the confident values. In our experiment, the accuracy rate of the proposed character segmentation system is 94.90%, which demonstrates the effectiveness of the proposed method.
https://doi.org/10.4218/etrij.10.1510.0086 인용 PDF KSCI

SkelGAN: A Font Image Skeletonization Method

Ko, Debbie Honghee;Hassan, Ammar Ul;Majeed, Saima;Choi, Jaeyoung
- Journal of Information Processing Systems
- /
- v.17 no.1
- /
- pp.1-13
- /
- 2021
In this research, we study the problem of font image skeletonization using an end-to-end deep adversarial network, in contrast with the state-of-the-art methods that use mathematical algorithms. Several studies have been concerned with skeletonization, but a few have utilized deep learning. Further, no study has considered generative models based on deep neural networks for font character skeletonization, which are more delicate than natural objects. In this work, we take a step closer to producing realistic synthesized skeletons of font characters. We consider using an end-to-end deep adversarial network, SkelGAN, for font-image skeletonization, in contrast with the state-of-the-art methods that use mathematical algorithms. The proposed skeleton generator is proved superior to all well-known mathematical skeletonization methods in terms of character structure, including delicate strokes, serifs, and even special styles. Experimental results also demonstrate the dominance of our method against the state-of-the-art supervised image-to-image translation method in font character skeletonization task.
https://doi.org/10.3745/JIPS.02.0152 인용 PDF KSCI

Development of Korean-to-English and English-to-Korean Mobile Translator for Smartphone (스마트폰용 영한, 한영 모바일 번역기 개발)

Yuh, Sang-Hwa;Chae, Heung-Seok
- Journal of the Korea Society of Computer and Information
- /
- v.16 no.3
- /
- pp.229-236
- /
- 2011
In this paper we present light weighted English-to-Korean and Korean-to-English mobile translators on smart phones. For natural translation and higher translation quality, translation engines are hybridized with Translation Memory (TM) and Rule-based translation engine. In order to maximize the usability of the system, we combined an Optical Character Recognition (OCR) engine and Text-to-Speech (TTS) engine as a Front-End and Back-end of the mobile translators. With the BLEU and NIST evaluation metrics, the experimental results show our E-K and K-E mobile translation equality reach 72.4% and 77.7% of Google translators, respectively. This shows the quality of our mobile translators almost reaches the that of server-based machine translation to show its commercial usefulness.
https://doi.org/10.9708/jksci.2011.16.3.229 인용 PDF KSCI

Study on Explicitation Strategy in English-Korean Game Translation A Case Study of 'League of Legends' - (영한 게임 번역에서의 명시화에 관한 고찰 게임 '리그 오브 레전드'를 중심으로 -)

Kim, Hong-kyun
- Journal of Korea Game Society
- /
- v.21 no.3
- /
- pp.117-132
- /
- 2021
This paper investigates how information game users needs to play game is offered to game user by applying the notion of explicitation toward translated game texts. By using League of Legends' Character lines, Character Abilities and Equipment Description texts as a case, this paper focused on how 'Insertion(addition)' and 'Replacement' method are applied toward game translation and which information is being explicitated. As a result, this paper found out that translation on Player vs. Player genre game, explicitation occurs by adding or replacing words containing information needed, and information about game control was prioritized among other information related with game universe and culture.
https://doi.org/10.7583/JKGS.2021.21.3.117 인용 PDF KSCI

Jungian Character Network in Growing Other Character Archetypes in Films

Han, Youngsue
- International Journal of Contents
- /
- v.15 no.2
- /
- pp.13-19
- /
- 2019
This research demonstrates a clear visual outline of character influence-relations in creating Jungian character archetypes in films using R computational technology. It contributes to the integration of Jungian analytical psychology into film studies by revealing character network relations in film. This paper handles character archetypes and their influence on developing other character archetypes in films in regards to network analysis drawn from Lynn Schmidt's analysis of 45 master characters in films. Additionally, this paper conducts a character network analysis visualization experiment using R open-source software to create an easily reproducible tutorial for scholars in humanities. This research is a pioneering work that could trigger the academic communities in humanities to actively adopt data science in their research and education.
https://doi.org/10.5392/IJoC.2019.15.2.013 인용 PDF KSCI HTML

Translation,Scale,and Rotation-Invariant Hangul Character Recognition with Ring Projection (고리 투영을 이용한 위치,크기 및 회전 변형에 무관한 한글 문자 인식)

이성환;박희선
- Korean Journal of Cognitive Science
- /
- v.3 no.1
- /
- pp.139-160
- /
- 1991

A Unicode based Deep Handwritten Character Recognition model for Telugu to English Language Translation

BV Subba Rao;J. Nageswara Rao;Bandi Vamsi;Venkata Nagaraju Thatha;Katta Subba Rao
- International Journal of Computer Science & Network Security
- /
- v.24 no.2
- /
- pp.101-112
- /
- 2024
Telugu language is considered as fourth most used language in India especially in the regions of Andhra Pradesh, Telangana, Karnataka etc. In international recognized countries also, Telugu is widely growing spoken language. This language comprises of different dependent and independent vowels, consonants and digits. In this aspect, the enhancement of Telugu Handwritten Character Recognition (HCR) has not been propagated. HCR is a neural network technique of converting a documented image to edited text one which can be used for many other applications. This reduces time and effort without starting over from the beginning every time. In this work, a Unicode based Handwritten Character Recognition(U-HCR) is developed for translating the handwritten Telugu characters into English language. With the use of Centre of Gravity (CG) in our model we can easily divide a compound character into individual character with the help of Unicode values. For training this model, we have used both online and offline Telugu character datasets. To extract the features in the scanned image we used convolutional neural network along with Machine Learning classifiers like Random Forest and Support Vector Machine. Stochastic Gradient Descent (SGD), Root Mean Square Propagation (RMS-P) and Adaptative Moment Estimation (ADAM)optimizers are used in this work to enhance the performance of U-HCR and to reduce the loss function value. This loss value reduction can be possible with optimizers by using CNN. In both online and offline datasets, proposed model showed promising results by maintaining the accuracies with 90.28% for SGD, 96.97% for RMS-P and 93.57% for ADAM respectively.
https://doi.org/10.22937/IJCSNS.2024.24.2.12 인용 PDF

Search Result 63, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)