• Title/Summary/Keyword: Model Feature Map

Search Result 161, Processing Time 0.031 seconds

A Study On Handwritten Numeral Recognition Using Numeral Shape Grasp and Divided FSOM (숫자의 형태 이해와 분할된 FSOM을 이용한 필기 숫자 인식에 관한 연구)

  • 서석배;김대진;강대성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.8B
    • /
    • pp.1490-1499
    • /
    • 1999
  • This paper proposes a new handwritten numeral recognition method using numeral shape grasps and FSOM (Fuzzy Self-Organizing Map). The proposed algorithm is based on the idea that numeral input data with similar shapes are classified into the same class. Shapes of numeral data are created using lines of external-contact and the class of numeral data is determined by template matching of the shapes. Each class of numeral data has FSOM and feature extraction method, respectively. In this paper, we divide the numeral database into the 16 classes. The divided FSOM model allows not only an independent learning phase of SOM but also step-by-step learning. Experiments using Concordia University handwritten numeral database proved that the proposed algorithm is effective to improve recognition accuracy.

  • PDF

DP-LinkNet: A convolutional network for historical document image binarization

  • Xiong, Wei;Jia, Xiuhong;Yang, Dichun;Ai, Meihui;Li, Lirong;Wang, Song
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1778-1797
    • /
    • 2021
  • Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.

Confluence shear layer feature extraction method using RGB aerial imagery (RGB 항공영상을 이용한 합류부 전단층 특징 추출법)

  • Noh, Hyoseob;Park, Yong Sung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.277-277
    • /
    • 2021
  • 합류부는 인공수로 또는 자연하천에서 흔히 존재하며 매우 복잡한 흐름 구조가 발생하는 곳이다. 특히 본류와 지류의 유속장의 차이에 따라 발생하는 전단층은 흐름과 물질이 혼합되는 경계면이 되며, 흐름 구조가 전단층을 따라 발달한다는 특징으로 인해 수리학적으로 매우 중요하다. 최근 원격탐사 기법의 발전에 따라 위성이나 드론과 같은 무인 이동체를 이용한 하천 계측법이 수질 및 지형변화 연구들에 광범위하게 적용되고 있다. 그 중 RGB 항공영상은 해상도가 높고 취득 비용이 저렴하여 확장성 및 활용도가 높다. 본 연구에서는 합류부 전단층이 촬영된 RGB 항공 영상을 이용해 합류부 전단층 분석에 활용하는 방법을 제안한다. 제안되는 방법은 RGB 항공 영상에서 본류와 지류의 수체 영역을 각각 추출하기 위해 가우시안 혼합 모형(Gaussian mixture model)을 이용한다. 추출된 수체 영역에는 자기조직화지도(self-organizing map)을 적용하고 좌표 변환을 하여 정량적인 특징을 추출한다. 본 연구에서는 알고리듬의 적용 예로서 구글어스를 통해 확보된 낙동강-남강 합류부의 항공 영상을 분석한다. 본 추출법을 이용하면 접촉식 센서를 이용하는 기존의 전단층 계측 방법들에 비해 경제적이고 안전하며 합류부 흐름의 평면적 분석을 가능하게 할 수 있을 것으로 기대된다.

  • PDF

A Dual-scale Network with Spatial-temporal Attention for 12-lead ECG Classification

  • Shuo Xiao;Yiting Xu;Chaogang Tang;Zhenzhen Huang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.9
    • /
    • pp.2361-2376
    • /
    • 2023
  • The electrocardiogram (ECG) signal is commonly used to screen and diagnose cardiovascular diseases. In recent years, deep neural networks have been regarded as an effective way for automatic ECG disease diagnosis. The convolutional neural network is widely used for ECG signal extraction because it can obtain different levels of information. However, most previous studies adopt single scale convolution filters to extract ECG signal features, ignoring the complementarity between ECG signal features of different scales. In the paper, we propose a dual-scale network with convolution filters of different sizes for 12-lead ECG classification. Our model can extract and fuse ECG signal features of different scales. In addition, different spatial and time periods of the feature map obtained from the 12-lead ECG may have different contributions to ECG classification. Therefore, we add a spatial-temporal attention to each scale sub-network to emphasize the representative local spatial and temporal features. Our approach is evaluated on PTB-XL dataset and achieves 0.9307, 0.8152, and 89.11 on macro-averaged ROC-AUC score, a maximum F1 score, and mean accuracy, respectively. The experiment results have proven that our approach outperforms the baselines.

Land Use Feature Extraction and Sprawl Development Prediction from Quickbird Satellite Imagery Using Dempster-Shafer and Land Transformation Model

  • Saharkhiz, Maryam Adel;Pradhan, Biswajeet;Rizeei, Hossein Mojaddadi;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.1
    • /
    • pp.15-27
    • /
    • 2020
  • Accurate knowledge of land use/land cover (LULC) features and their relative changes over upon the time are essential for sustainable urban management. Urban sprawl growth has been always also a worldwide concern that needs to carefully monitor particularly in a developing country where unplanned building constriction has been expanding at a high rate. Recently, remotely sensed imageries with a very high spatial/spectral resolution and state of the art machine learning approaches sent the urban classification and growth monitoring to a higher level. In this research, we classified the Quickbird satellite imagery by object-based image analysis of Dempster-Shafer (OBIA-DS) for the years of 2002 and 2015 at Karbala-Iraq. The real LULC changes including, residential sprawl expansion, amongst these years, were identified via change detection procedure. In accordance with extracted features of LULC and detected trend of urban pattern, the future LULC dynamic was simulated by using land transformation model (LTM) in geospatial information system (GIS) platform. Both classification and prediction stages were successfully validated using ground control points (GCPs) through accuracy assessment metric of Kappa coefficient that indicated 0.87 and 0.91 for 2002 and 2015 classification as well as 0.79 for prediction part. Detail results revealed a substantial growth in building over fifteen years that mostly replaced by agriculture and orchard field. The prediction scenario of LULC sprawl development for 2030 revealed a substantial decline in green and agriculture land as well as an extensive increment in build-up area especially at the countryside of the city without following the residential pattern standard. The proposed method helps urban decision-makers to identify the detail temporal-spatial growth pattern of highly populated cities like Karbala. Additionally, the results of this study can be considered as a probable future map in order to design enough future social services and amenities for the local inhabitants.

A Baseline Correction for Effective Analysis of Alzheimer’s Disease based on Raman Spectra from Platelet (혈소판 라만 스펙트럼의 효율적인 분석을 위한 기준선 보정 방법)

  • Park, Aa-Ron;Baek, Sung-June
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.1
    • /
    • pp.16-22
    • /
    • 2012
  • In this paper, we proposed a method of baseline correction for analysis of Raman spectra of platelets from Alzheimer's disease (AD) transgenic mice. Measured Raman spectra include the meaningful information and unnecessary noise which is composed of baseline and additive noise. The Raman spectrum is divided into the local region including several peaks and the spectrum of the region is modeled by curve fitting using Gaussian model. The additive noise is clearly removed from the process of replacing the original spectrum with the fitted model. The baseline correction after interpolating the local minima of the fitted model with linear, piecewise cubic Hermite and cubic spline algorithm. The baseline corrected models extract the feature with principal component analysis (PCA). The classification result of support vector machine (SVM) and maximum $a$ posteriori probability (MAP) using linear interpolation method showed the good performance about overall number of principal components, especially SVM gave the best performance which is about 97.3% true classification average rate in case of piecewise cubic Hermite algorithm and 5 principal components. In addition, it confirmed that the proposed baseline correction method compared with the previous research result could be effectively applied in the analysis of the Raman spectra of platelet.

A Study on Determinants of Commercial Land Values in Gwangju City (광주시 상업지 지가의 형성요인에 관한 연구)

  • Lee, Hyun-Wook
    • Journal of the Korean association of regional geographers
    • /
    • v.2 no.2
    • /
    • pp.159-171
    • /
    • 1996
  • The aim of this study is which factors affect the commercial land values and how they act upon them through distribution of commercial land values by multiple regression analysis in Gwangju city. The major findings of this study are as follows: (1) The changes of commercial land values distribution in $1989{\sim}1996$, We see that the commercial area of higher land values extends following the main arterial road. This is related to urbanization in urban fringe while the decline of commercial land values occurs in city center with long history of commercial region. This is due to unsuitableness in rapid changes of commercial environment because of fragmented lots, old buildings. traffic congestion etc. (2) The regions where commercial land values greatly rose are the west in constructed the new planning city center of Sangmu-dong. and the south west in which is related to the extension of high density apartment and the location of big discount stores. (3) Through the changes in commercial land values distribution map. and road map, topographical map, we know that commercial land values is related to various factors; namely, distance from CBD, convenient traffic, reputation of commercial district, condition of a road, size of supplementary, a degree of commercial land use etc. (4) From the above related factor, six variables are extracted by operational definition. That is the spatial distance from the city center, the walking distance to a stopping place, the road width, the amount of bus traffic, the amount of pedestrian, the number of the shop. (5) Data of seven variables are collected on the highest values point of each Dong. We applicate multiple regression analysis with commercial land values as a dependent variable, extracted six variables as independent variables. (6) As a result of multiple regression on the determinants of commercial land values, the variables which is greatly related to commercial land values are the amount of pedestrain, the spatial distance from city center. We identify that two variables explain variance of the commercial land values by 65%. (7) In order to make clear about not explained 35%. we carry out analysis of residual. In consequence, we see small estimate in downtown area and large estimate in urban fringe. This feature is due to simple core structure of Gwangju city and limits of this regression model.

  • PDF

Improved SIM Algorithm for Contents-based Image Retrieval (내용 기반 이미지 검색을 위한 개선된 SIM 방법)

  • Kim, Kwang-Baek
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.2
    • /
    • pp.49-59
    • /
    • 2009
  • Contents-based image retrieval methods are in general more objective and effective than text-based image retrieval algorithms since they use color and texture in search and avoid annotating all images for search. SIM(Self-organizing Image browsing Map) is one of contents-based image retrieval algorithms that uses only browsable mapping results obtained by SOM(Self Organizing Map). However, SOM may have an error in selecting the right BMU in learning phase if there are similar nodes with distorted color information due to the intensity of light or objects' movements in the image. Such images may be mapped into other grouping nodes thus the search rate could be decreased by this effect. In this paper, we propose an improved SIM that uses HSV color model in extracting image features with color quantization. In order to avoid unexpected learning error mentioned above, our SOM consists of two layers. In learning phase, SOM layer 1 has the color feature vectors as input. After learning SOM Layer 1, the connection weights of this layer become the input of SOM Layer 2 and re-learning occurs. With this multi-layered SOM learning, we can avoid mapping errors among similar nodes of different color information. In search, we put the query image vector into SOM layer 2 and select nodes of SOM layer 1 that connects with chosen BMU of SOM layer 2. In experiment, we verified that the proposed SIM was better than the original SIM and avoid mapping error effectively.

  • PDF

Development of Deep Learning Structure for Defective Pixel Detection of Next-Generation Smart LED Display Board using Imaging Device (영상장치를 이용한 차세대 스마트 LED 전광판의 불량픽셀 검출을 위한 딥러닝 구조 개발)

  • Sun-Gu Lee;Tae-Yoon Lee;Seung-Ho Lee
    • Journal of IKEEE
    • /
    • v.27 no.3
    • /
    • pp.345-349
    • /
    • 2023
  • In this paper, we propose a study on the development of deep learning structure for defective pixel detection of next-generation smart LED display board using imaging device. In this research, a technique utilizing imaging devices and deep learning is introduced to automatically detect defects in outdoor LED billboards. Through this approach, the effective management of LED billboards and the resolution of various errors and issues are aimed. The research process consists of three stages. Firstly, the planarized image data of the billboard is processed through calibration to completely remove the background and undergo necessary preprocessing to generate a training dataset. Secondly, the generated dataset is employed to train an object recognition network. This network is composed of a Backbone and a Head. The Backbone employs CSP-Darknet to extract feature maps, while the Head utilizes extracted feature maps as the basis for object detection. Throughout this process, the network is adjusted to align the Confidence score and Intersection over Union (IoU) error, sustaining continuous learning. In the third stage, the created model is employed to automatically detect defective pixels on actual outdoor LED billboards. The proposed method, applied in this paper, yielded results from accredited measurement experiments that achieved 100% detection of defective pixels on real LED billboards. This confirms the improved efficiency in managing and maintaining LED billboards. Such research findings are anticipated to bring about a revolutionary advancement in the management of LED billboards.

Classification of Music Data using Fuzzy c-Means with Divergence Kernel (분산커널 기반의 퍼지 c-평균을 이용한 음악 데이터의 장르 분류)

  • Park, Dong-Chul
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.3
    • /
    • pp.1-7
    • /
    • 2009
  • An approach for the classification of music genres using a Fuzzy c-Means(FcM) with divergence-based kernel is proposed and presented in this paper. The proposed model utilizes the mean and covariance information of feature vectors extracted from music data and modelled by Gaussian Probability Density Function (GPDF). Furthermore, since the classifier utilizes a kernel method that can convert a complicated nonlinear classification boundary to a simpler linear one, he classifier can improve its classification accuracy over conventional algorithms. Experiments and results on collected music data sets demonstrate hat the proposed classification scheme outperforms conventional algorithms including FcM and SOM 17.73%-21.84% on average in terms of classification accuracy.