• Title/Summary/Keyword: character recognition

Search Result 992, Processing Time 0.025 seconds

Study for the Pseudonymization Technique of Medical Image Data (의료 이미지 데이터의 비식별화 방안에 관한 연구)

  • Baek, Jongil;Song, Kyoungtaek;Choi, Wonkyun;Yu, Khiguen;Lee, Pilwoo;In, Hanjin;Kim, Cheoljung;Yeo, Kwangsoo;Kim, Soonseok
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.6 no.6
    • /
    • pp.103-110
    • /
    • 2016
  • The recent frequent cases of damage due to leakage of medical data and the privacy of medical patients is increasing day by day. The government says the Privacy Rule regulations established for these victims, such as prevention. Medical data guidelines can be seen 'national medical privacy guidelines' is only released. When replacing the image data between the institutions it has been included in the image file (JPG, JPEG, TIFF) there is exchange of data in common formats such as being made when the file is leaked to an external file there is a risk that the exposure key identification information of the patient. This medial image file has no protection such as encryption, This this paper, introduces a masking technique using a mosaic technique encrypting the image file contains the application to optical character recognition techniques. We propose pseudonymization technique of personal information in the image data.

Spam Image Detection Model based on Deep Learning for Improving Spam Filter

  • Seong-Guk Nam;Dong-Gun Lee;Yeong-Seok Seo
    • Journal of Information Processing Systems
    • /
    • v.19 no.3
    • /
    • pp.289-301
    • /
    • 2023
  • Due to the development and dissemination of modern technology, anyone can easily communicate using services such as social network service (SNS) through a personal computer (PC) or smartphone. The development of these technologies has caused many beneficial effects. At the same time, bad effects also occurred, one of which was the spam problem. Spam refers to unwanted or rejected information received by unspecified users. The continuous exposure of such information to service users creates inconvenience in the user's use of the service, and if filtering is not performed correctly, the quality of service deteriorates. Recently, spammers are creating more malicious spam by distorting the image of spam text so that optical character recognition (OCR)-based spam filters cannot easily detect it. Fortunately, the level of transformation of image spam circulated on social media is not serious yet. However, in the mail system, spammers (the person who sends spam) showed various modifications to the spam image for neutralizing OCR, and therefore, the same situation can happen with spam images on social media. Spammers have been shown to interfere with OCR reading through geometric transformations such as image distortion, noise addition, and blurring. Various techniques have been studied to filter image spam, but at the same time, methods of interfering with image spam identification using obfuscated images are also continuously developing. In this paper, we propose a deep learning-based spam image detection model to improve the existing OCR-based spam image detection performance and compensate for vulnerabilities. The proposed model extracts text features and image features from the image using four sub-models. First, the OCR-based text model extracts the text-related features, whether the image contains spam words, and the word embedding vector from the input image. Then, the convolution neural network-based image model extracts image obfuscation and image feature vectors from the input image. The extracted feature is determined whether it is a spam image by the final spam image classifier. As a result of evaluating the F1-score of the proposed model, the performance was about 14 points higher than the OCR-based spam image detection performance.

The Lure of the Racial Other: Race and Sexuality in D. H. Lawrence's Quetzalcoatl (인종적 타자의 매혹 -로런스의 『께짤코아틀』에 그려진 인종과 성)

  • Kim, Sungho
    • Journal of English Language & Literature
    • /
    • v.55 no.4
    • /
    • pp.693-718
    • /
    • 2009
  • Kate Burns, a disillusioned Irish woman in Quetzalcoatl, has alternating feelings of fear, repulsion, oppression, compassion, and fascination vis-à-vis Mexican people. Together, these feelings are constitutive of a psychic process in which an imaginary appropriation of the other takes place. In this process white subjectivity represents or reconstructs the dark race precisely as its other. At the same time, Kate's feelings register her anxious recognition of the resistant, unappropriated being of the dark people: their true 'otherness,' or what Žižek calls "the excess of existence over representation." The otherness, frequently racial and sexual, evokes mixed feelings in the white subject. Kate's at once amorous and aggressive response to Ramón's body provides a case in point. Kate's emotional undulation is considerably mitigated in The Plumed Serpent, the revised version of the novel in which the theme of 'blood-mixing' is pushed to the ultimate point. Yet the interracial marriage resolves neither the racial nor the ontologico-sexual issues raised in the first version. Kate is still attracted to Ramón in his sagacious sensuality but goes on to get married to Cipriano, a pure Indian, only to find his mechanical masculinity ever unpalatable. This shows, not just Lawrence's wilful commitment to the 'blood-mixing' theme, but perhaps his lingering taboo against miscegenation as well. Changes in the plot entail those in the narrative voice. In Quetzalcoatl, Owen, a spectatorial and gossipy character, frequently competes for narration with the fully participant third-person narrator. In The Plumed Serpent, the third-person narrator becomes predominant, now attempting with greater confidence to present the reality of the racial other immediately to European readership. While such immediacy is illusional, narrative insistence on it implies a struggle to displace racial stereotypes and offer an experiential understanding of the other.

Enhancing Korean Alphabet Unit Speech Recognition with Neural Network-Based Alphabet Merging Methodology (한국어 자모단위 음성인식 결과 후보정을 위한 신경망 기반 자모 병합 방법론)

  • Solee Im;Wonjun Lee;Gary Geunbae Lee;Yunsu Kim
    • Annual Conference on Human and Language Technology
    • /
    • 2023.10a
    • /
    • pp.659-663
    • /
    • 2023
  • 이 논문은 한국어 음성인식 성능을 개선하고자 기존 음성인식 과정을 자모단위 음성인식 모델과 신경망 기반 자모 병합 모델 총 두 단계로 구성하였다. 한국어는 조합어 특성상 음성 인식에 필요한 음절 단위가 약 2900자에 이른다. 이는 학습 데이터셋에 자주 등장하지 않는 음절에 대해서 음성인식 성능을 저하시키고, 학습 비용을 높이는 단점이 있다. 이를 개선하고자 음절 단위의 인식이 아닌 51가지 자모 단위(ㄱ-ㅎ, ㅏ-ㅞ)의 음성인식을 수행한 후 자모 단위 인식 결과를 음절단위의 한글로 병합하는 과정을 수행할 수 있다[1]. 자모단위 인식결과는 초성, 중성, 종성을 고려하면 규칙 기반의 병합이 가능하다. 하지만 음성인식 결과에 잘못인식된 자모가 포함되어 있다면 최종 병합 결과에 오류를 생성하고 만다. 이를 해결하고자 신경망 기반의 자모 병합 모델을 제시한다. 자모 병합 모델은 분리되어 있는 자모단위의 입력을 완성된 한글 문장으로 변환하는 작업을 수행하고, 이 과정에서 음성인식 결과로 잘못인식된 자모에 대해서도 올바른 한글 문장으로 변환하는 오류 수정이 가능하다. 본 연구는 한국어 음성인식 말뭉치 KsponSpeech를 활용하여 실험을 진행하였고, 음성인식 모델로 Wav2Vec2.0 모델을 활용하였다. 기존 규칙 기반의 자모 병합 방법에 비해 제시하는 자모 병합 모델이 상대적 음절단위오류율(Character Error Rate, CER) 17.2% 와 단어단위오류율(Word Error Rate, WER) 13.1% 향상을 확인할 수 있었다.

  • PDF

Study on the Neural Network for Handwritten Hangul Syllabic Character Recognition (수정된 Neocognitron을 사용한 필기체 한글인식)

  • 김은진;백종현
    • Korean Journal of Cognitive Science
    • /
    • v.3 no.1
    • /
    • pp.61-78
    • /
    • 1991
  • This paper descibes the study of application of a modified Neocognitron model with backward path for the recognition of Hangul(Korean) syllabic characters. In this original report, Fukushima demonstrated that Neocognitron can recognize hand written numerical characters of $19{\times}19$ size. This version accepts $61{\times}61$ images of handwritten Hangul syllabic characters or a part thereof with a mouse or with a scanner. It consists of an input layer and 3 pairs of Uc layers. The last Uc layer of this version, recognition layer, consists of 24 planes of $5{\times}5$ cells which tell us the identity of a grapheme receiving attention at one time and its relative position in the input layer respectively. It has been trained 10 simple vowel graphemes and 14 simple consonant graphemes and their spatial features. Some patterns which are not easily trained have been trained more extrensively. The trained nerwork which can classify indivisual graphemes with possible deformation, noise, size variance, transformation or retation wre then used to recongnize Korean syllabic characters using its selective attention mechanism for image segmentation task within a syllabic characters. On initial sample tests on input characters our model could recognize correctly up to 79%of the various test patterns of handwritten Korean syllabic charactes. The results of this study indeed show Neocognitron as a powerful model to reconginze deformed handwritten charavters with big size characters set via segmenting its input images as recognizable parts. The same approach may be applied to the recogition of chinese characters, which are much complex both in its structures and its graphemes. But processing time appears to be the bottleneck before it can be implemented. Special hardware such as neural chip appear to be an essestial prerquisite for the practical use of the model. Further work is required before enabling the model to recognize Korean syllabic characters consisting of complex vowels and complex consonants. Correct recognition of the neighboring area between two simple graphemes would become more critical for this task.

Curvature stroke modeling for the recognition of on-line cursive korean characters (온라인 흘림체 한글 인식을 위한 곡률획 모델링 기법)

  • 전병환;김무영;김창수;박강령;김재희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.11
    • /
    • pp.140-149
    • /
    • 1996
  • Cursive characters are written on an economical principle to reduce the motion of a pen in the limit of distinction between characters. That is, the pen is not lifted up to move for writing a next stroke, the pen is not moved at all, or connected two strokes chance their shapes to a similar and simple shape which is easy to be written. For these reasons, strokes and korean alphabets are not only easy to be changed, but also difficult to be splitted. In this paper, we propose a curvature stroke modeling method for splitting and matching by using a structural primitive. A curvature stroke is defined as a substroke which does not change its curvanture. Input strokes handwritten in a cursive style are splitted into a sequence of curvature strokes by segmenting the points which change the direction of rotation, which occur a sudden change of direction, and which occur an excessive rotation Each reference of korean alphabets is handwritten in a printed style and is saved as a sequence of curvature strikes which is generated by splitting process. And merging process is used to generate various sequences of curvature strikes for matching. Here, it is also considered that imaginary strokes can be written or omitted. By using a curvature stroke as a unit of recognition, redundant splitting points in input characters are effectively reduced and exact matching is possible by generating a reference curvature stroke, which consists of the parts of adjacent two korean alphasbets, even when the connecting points between korean alphabets are not splitted. The results showed 83.6% as recognition rate of the first candidate and 0.99sec./character (CPU clock:66MHz) as processing time.

  • PDF

Reconsideration of Acer pictum complex in Korea (한국산(韓國産) 고로쇠분류군(分類群)에 대한 재고(再考))

  • Chang, Chin-Sung
    • Korean Journal of Plant Taxonomy
    • /
    • v.31 no.3
    • /
    • pp.283-309
    • /
    • 2001
  • Acer pictum complex (A. pictum Thunb. ex Murray with varieties, A. okamotoanum Nakai, A. truncatum Bunge) in eastern Asia causes frequent difficulty in identification. One hundred twenty five specimens from A. pictum complex of China, Korea and Japan and A. cappadocicum var. sinicum of China were compared to investigate patterns of intra- and interspecific variation and to evaluate a recognition of several species as well as many varieties using 22 characters for morphometric analysis. The first three PCA accounted for 59% of the total variance. No strong discontinuities existed among taxa with respect to fruit and leaf characters. Much overlap among all taxa occurred the central region of the scatter diagram. Many characters appeared to show some clinal variation with changes from east of China to Japan through Korea. This was true not only when all species as considered as a single taxon, but when characters of individual taxa were compared with geography. As one considers a path from the western part of the ranges to areas to the east, the leaves become larger in most respects and become increasingly many lobed (five to seven or nine). In general, there was a tendency toward larger nutlet with smaller wing in the area toward northeast of China (=A. truncatum), while in the east of ranges (Island Ullung-do), plants were larger with respect to characters of fruit and leaves (=A. okamotoanum). The morphological differentiation between A. okamotoanum and Japanese and Korean individuals of A. pictum was not considered sufficient to warrant recognition of either specific or varietal status and should be treated as con specific under A. pictum var. mono. Since the lectotype of Acer pictum had minute hairs uniformly on the under surface of leaves(A. pictum var. pictum), the glabrous type of A. pictum was called A. pictum var. mono as Ohahsi suggested. The univaraite analysis (the mean and maximum/minium of nutlet size and wing/nutlet length ratio) indicated geographical differentiation of northeastern populations, A. truncatum, was distinctive, but Korean individuals of A. truncatum showed an affinity between Chinese individuals of A. truncatum and Korean individuals of A. Pictum var. mono. The current results, together with qualitative character, trunk features, justify subspecific status for this taxon. The previous varieties of A. mono in Korea were indistinguishable from typical form of A. Pictum var. mono on the basis of the wing angle and nutlet size, rejecting continued recognition of these taxa as distinctive varieties. Therefore, it is recommended that only one polymorphic species of A. pictum be recognized in addition to three varieties.

  • PDF

The Comparative Analysis of Exposure Conditions between F/S and C/R System for an Ideal Image in Simple Abdomen (복부 단순촬영의 이상적 영상구현을 위한 F. S system과 C.R system의 촬영조건 비교분석)

  • Son, Sang-Hyuk;Song, Young-Geun;Kim, Je-Bong
    • Korean Journal of Digital Imaging in Medicine
    • /
    • v.9 no.1
    • /
    • pp.37-43
    • /
    • 2007
  • 1. Purpose : This study is to present effective exposure conditions to acquire the best image of simple abdomen in Film Screen (F.S) system and Computed Radiography (C.R) system. 2. Method : In the F.S system, while an exposure condition was fixed as 70kVp, images of a patients simple abdomen were taken under the different mAs exposure conditions. Among these images, the best one was chosen by radiologists and radiological technologists. In the C.R system, the best image of the same patient was acquired with the same method from the F.S system. Both characteristic curves from F.S system and C.R system were analyzed. 3. Results : In the F.S system, the best exposure condition of simple abdomen was 70kVp and 20mAs. In the CR system, with the fixed condition at 70kVp, the image densities of human organs, such as liver, kidney, spleen, psoas muscle, lumbar spine body and iliac crest, were almost same despite different environments (3.2mAs, 8mAs, 12mAs, 16mAs and 20mAs). However, when the exposure conditions were over or under (below) 12mAs, the images between the abdominal wall and the directly exposed part became blurred because the gap of density was decreased. In the C.R system, while the volume of mAs was decreased, an artifact of quantum mottle was increased. 4. Conclusion : This study shows that the exposure condition in the C.R system can be reduced 40% than in the F.S system. This paper concluded that when the exposure conditions are set in CR environment, after the analysis of equipment character, such as image processing system(EDR : Exposure Data Recognition processing), PACS and so on, the high quality of image with maximum information can be acquired with a minimum exposure dose.

  • PDF

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

A study on the improving and constructing the content for the Sijo database in the Period of Modern Enlightenment (계몽기·근대시조 DB의 개선 및 콘텐츠화 방안 연구)

  • Chang, Chung-Soo
    • Sijohaknonchong
    • /
    • v.44
    • /
    • pp.105-138
    • /
    • 2016
  • Recently with the research function, "XML Digital collection of Sijo Texts in the Period of Modern Enlightenment" DB data is being provided through the Korean Research Memory (http://www.krm.or.kr) and the foundation for the constructing the contents of Sijo Texts in the Period of Modern Enlightenment has been laid. In this paper, by reviewing the characteristics and problems of Digital collection of Sijo Texts in the Period of Modern Enlightenment and searching for the improvement, I tried to find a way to make it into the content. This database has the primary meaning in the integrating and glancing at the vast amounts of Sijo in the Period of Modern Enlightenment to reaching 12,500 pieces. In addition, it is the first Sijo data base which is provide the variety of search features according to literature, name of poet, title of work, original text, per period, and etc. However, this database has the limits to verifying the overall aspects of the Sijo in the Period of Modern Enlightenment. The title and original text, which is written in the archaic word or Chinese character, could not be searched, because the standard type text of modern language is not formatted. And also the works and the individual Sijo works released after 1945 were missing in the database. It is inconvenient to extract the datum according to the poet, because poets are marked in the various ways such as one's real name, nom de plume and etc. To solve this kind of problems and improve the utilization of the database, I proposed the providing the standard type text of modern language, giving the index terms about content, providing the information on the work format and etc. Furthermore, if the Sijo database in the Period of Modern Enlightenment which is prepared the character of the Sijo Culture Information System could be built, it could be connected with the academic, educational contents. For the specific plan, I suggested as follow, - learning support materials for the Modern history and the national territory recognition on the Modern Age - source materials for studying indigenous animals and plants characters creating the commercial characters - applicability as the Sijo learning tool such as Sijo Game.

  • PDF